04-Natural Processing Language Flashcards

Question

What is literal translation

Answer 1

Each word is translated to the corresponding word in the target language

Answer 2

Relating to meaning in language or logic

Answer 3

Used to translate documents from one language to another, translate email communications that come from foreign governments, and even provide the ability to translate web pages on the Internet

Answer 4

Used to translate between spoken languages, sometimes directly (speech-to-speech translation) and sometimes by translating to an intermediary text format (speech-to-text translation)

Answer 5

1. Translator Text - text-to-text translation | 2. Speech - speech-to-text and speech-to-speech

Answer 6

Uses a Neural Machine Translation (NMT) model for translation, which analyzes the semantic context of the text and renders a more accurate and complete translation as a result

Answer 7

Use ISO 639-1 language code, i.e. en for English, fr for French.

Answer 8

Use 3166-1 cultural code, i.e. en-US for US English, en-GM for British English.

Answer 9

1. Profanity filtering | 2. Selective translation - tag content so it isn't translated

Answer 10

1. Speech-to-text 2. Text-to-speech 3. Speech translation

Answer 11

1. Utterances 2. Entities 3. Intents

Answer 12

What a user might say, i.e. "Switch the fan on"

Answer 13

An item to which an utterance refers, i.e. "Switch the FAN on"

Answer 14

Purpose, or goal, expressed in a user's utterance, i.e. "TurnOn" for "Switch the FAN on"

Answer 15

To handle utterances that do not map any of the utterances you have created. Considered a fallback, and used to provide generic response to users when their requests don't match any other intent.

Answer 16

1. Define entities, intents, and utterances with which to train the language model. Referred to as AUTHORING the model. 2. Publish the model so the client applications can use it for intent and entity PREDICTION based on user input

Answer 17

Web-based interface for creating and managing Language Understanding applications

Answer 18

1. Machine Learned - entities that are learned by your model during training from context in the sample utterances you provide 2. List - entities that are defined as a hierarchy of lists and sublists, i.e device list might include sublists for light and fan. For each list entry, specify synonyms such as lamp for light 3. RegEx - regular expression that describes a pattern, i.e. [0-9]{3}-[0-9]{3}-[0-9]{4} in the form 555-123-4567 4. Pattern.any - entities that are used with patterns to define complex entities that may be hard to extract from sample utterances

Answer 19

The process of using sample utterances to teach your model to match natural language expressions that a user might say to probable intents and entities. Training and testing is an iterative process.

Answer 20

After training and testing, publish Language Understanding application to prediction resource. Predictions are returned to client application

Answer 21

1. Statistical analysis 2. Extending frequency analysis to multi-term phrases 3. Apply stemming or lemmatization algorithms 4. Apply linguistic structure rules to analyze sentences 5. Encode words or terms as numeric features that can be used to train a ML model 6. Create vectorized models to capture semantic relationships

Answer 22

Remove common "stop words"', i.e. "the" or "a". Perform FREQUENCY ANALYSIS of the remaining words (how often each word appears). This provides clues about the main subject of the text.

Answer 23

These are known as N-grams (a two-word phrase is a bi-gram, a three-word phrase is a tri-gram, etc). Analyze the frequency analysis to such words.

Answer 24

These NORMALIZE WORDS before counting them, i.e. "power", "powered", and "powerful" are interpreted as being the same word.

Answer 25

For example, break down sentence into tree-like structure such as noun phrase, which itself contains nouns, verbs, adjectives, and so on

Answer 26

This technique is often used to perform SENTIMENT ANALYSIS, in which a document is classified as positive or negative. For example, classify a text document based on the terms it contains.

Answer 27

Capture semantic relationship between words by assigning them to locations in n-dimensional space. For example, assign values to words "flower" and "plant" that locate them close to one another, while "skateboard" might be given a value that positions it much further away

Answer 28

IT uses pre-trained models that can 1. Determine the language of a document or text (i.e. French or English) 2. Perform sentiment analysis from text that might indicate its main talking points. 3. Extract key phrases from text that might indicate its main talking points 4. Identify and categorize entities in the text. Entities can be people, places, organizations, or even everyday items such as dates, times, quantities, etc

Answer 29

1. Social media feed analyzer to detect sentiment around a political campaign or a product in market 2. Document search application that extracts key phrases to help summarize the main subject matter of documents in a catalog. 3. Extract brand information or company names from documents or other text for identification purposes

Answer 30

1. Language name, i.e. "English" 2. ISO 6391 language code, i.e. "en" 3. Score indicating level of confidence n the language detection

Answer 31

1. Speech recognition - ability to detect and interpret spoken input 2. Speech synthesis - ability to generate spoken output

Answer 32

Uses 1. An acoustic model that converts audio signal into phonemes (representations of specific sounds) 2. A language model that maps phonemes to words, usually using a statistical algorithm that predicts the most probable sequence of words based on phonemes

Answer 33

Representations of specific sounds A phoneme is the smallest unit of sound in speech. When we teach reading we teach children which letters represent those sounds. For example – the word 'hat' has 3 phonemes – 'h' 'a' and 't'.

Answer 34

1. Provide closed captions for recorded or live videos 2. Create transcript of a phone call or meeting 3. Automated note dictation 4. Determine intended user input for further processing

Answer 35

Concerned with vocalizing data, usually by converting text to speech. Speech synthesis solution typically requires 1. Text to be spoke 2. Void to be used to vocalize the speech

Answer 36

1. Tokenizes the text to break it into individual words 2. Assigns phonetic sounds to each words 3. Breaks the phonetic transcription into prosodic units (such as phrases, clauses, or sentences) to create phonemes that will be converted to audio format. 4. Phonemes are then synthesized by audio by applying a voice, which will determine parameters such as pitch and timbre; and generating an audio wave form that can be output to a speaker or written to a file

Answer 37

Phrases, clauses, or sentences

Answer 38

1. Generate spoken responses to user input 2. Create voice menus for telephone systems 3. Read email or text messages aloud in hands-free scenarios 4. Broadcast announcements in public locations, such as railway stations or airports

Answer 39

Cognitive Services

Answer 40

1. Profanity Filtering | 2. Selective translation

Answer 41

1. Speech-to-text 2. Text-to-speech 3. Speech Translation

Answer 42

Yes, it can

Answer 43

Text-to-text translation

Answer 44

Enables speech-to-text and speech-to-speech translation

04-Natural Processing Language Flashcards

(68 cards)