NLP Flashcards
What NLP services does Azure provide?
Text Analytics
Translator
Speech
Language Understanding Intelligent Service (LUIS)
What would you use Translator Text for?
Automatically translating spoken or written word between languages.
What would you use LUIS for?
Interpreting commands and determining appropriate actions.
What is Text Analytics?
A cloud-based service that provides advanced natural language processing over raw text for sentiment analysis, key phrase extraction, named entity recognition and language detection.
What would you use Text Analytics for?
Analysing and interpreting text in documents, email messages, and other sources.
What is frequency analysis?
Counting how often each word appears in text.
What is an N-gram?
A contiguous sequence of n items from a given sample of text or speech.
What is an entity in Text Analytics?
People, places, organisations or everyday items like dates, times, quantities, and so on.
What does the language capability of Text Analytics detect for each document submitted to it?
The language (e.g. ‘English’)
The ISO 6391 code (e.g. ‘en’)
A score indicating the level of confidence in the language detection.
What happens if a text with a mix of languages is submitted to Text Analytics service?
The service will focus on the predominant language in the text, and may return a confidence score less than 1.0
What could happen if you submitted ambiguous content to the Text Analytics service?
The service would return the language name and the language identifier as unknown and a score of NaN. (not a number)
What would you use the language detection capability of Text Analytics for?
To identify the language a text is written in.
What would you use the sentiment analysis capability of Text Analytics for?
Evaluating text to return sentiment scores and labels for each sentence.
What is the range of the sentiment score?
0 - 1
What is a positive sentiment score?
Values close to 1.
What is a negative sentiment score?
Values close to 0.
What is a neutral or indeterminate sentiment score?
0.5
What could cause an indeterminate sentiment score?
A sentence without structure (like a list of words), or using the wrong language code. (e.g. telling the service a document is en(glish) but its actually fr(ench))
What would you use the key phrase extraction capability of Text Analytics for?
Identifying the main talking points of a document(s).
What would you use the entity recognition capability of Text Analytics for?
Getting a list of entities from a piece of text. The service can also provide links to more information about that entity on the web.
What is Entity Linking?
The ability to disambiguate entities by linking to a specific reference. (Wikipedia)