CHAPTER 1 Flashcards

1
Q

………………is a field of computer science, artificial intelligence, and computational linguistics concerned with the interactions between computers and human (natural) languages.

A

Natural language processing (NLP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

NLP devoted to creating computers models that use natural language as…………and/or …………..

A

input and/or output.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

T/F NLP is about trying to get a machine brain to work the same way as a human brain when it comes to language.

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why NLP is important?

A

To interact with computing devices using human (natural) languages.

To access (large amount of) information and knowledge stored in the form of human languages quickly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Practical examples:

A

Building intelligent robots (AI).
Enabling voice-controlled operation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the challenges?

A

-Ambiguity in language

-Human language is astoundingly complex and diverse

-Errors relevant to Speech and Text

-Apathy towards Vertical-Specific Lingo

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Ambiguity can be of 2 types:

A

-Lexical – multiple meanings of words: It is dealt with in “lexical semantics

2-Structural: It is dealt with in parsing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Language Processing Levels

A

L1: Speech sound(Phonetics & Phonology )
L2: Words & their forms(Morphology, Lexicon)
L3: Structure of sentences(Syntax , Parsing)
L4: Meaning of sentences(semantics)
L5:Meaning in context & for a purpose (pragmatics)
L6:Connected sentence processing in a larger body of text(Discourse)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Examples of Levels:

A

L1 : sound
L2 : Dog - Dog(s), Dog(ged)
Lady – Lad(ies)

L3 : Ram goes to market (right)
goes Ram to the market (wrong)

L4 : translation from unstructured to structured representation

L5 : User situation & context
“Is that water?” – the action to be performed is different in a chemistry lab and on a dining table.

L6 : Backward & forward references –
Coreference resolution
“The man went near the dog. It bit him.”
Often co reference & ambiguity go together as in
“The dog went near the cat. It bit it.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Three prominent application areas:(Some NLP Applications)

A

-Text analytics/mining (from “unstructured data”):
Sentiment analysis
Topic identification

-Conversational agents :Siri, Cortana, Amazon Alexa, Google Assistant
Chatbot

Machine translation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Some NLP Applications (cont.)

A

-Text Analytics
-Conversational Agents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is text analytics ?

A

Data-mining of weblogs, microblogs, discussion forums, user reviews, and other forms of user-generated media.

Typically this involves the extraction of limited kinds of semantic and pragmatic information from texts
-Entity mentions
-Concept identification
-Sentiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is Sentiment Analysis

A

The field of sentiment analysis deals with categorization (or classification) of opinions expressed in textual documents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is Conversational Agents?

A

Combine
-Speech recognition
-Question answering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is Machine Translation

A

The automatic translation of texts between languages is one of the oldest non-numerical applications in Computer Science.

Translating documents from one language to another (for example, producing automobile repair manuals in many different languages)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How does NLP work?

A

NLP is similar to how humans think and learn. Humans start by learning the basics. We first learn words, then sentences. From there, a level of complexity is built. The same can be said for NLP.

This type of technology is all about statistics and probability. From the meaning in the language of the text, we extract the value for each word and we extract the value between the relationship of the words.

17
Q

NLP applications require several NLP analysis:

A

-Sentence and/or Word tokenization
-Part-of-speech tagging (abbreviated as PoS tagging)
-Named Entity (NE) recognition
-Parsing
-Stemming
-Lemmatization
-Stemming &Lemmatization
-Stopword Removal
-Semantic analysis

18
Q

…………………….taggers with high level of accuracy can solve Word’s syntactic ambiguity. On the other hand, the problem of resolving semantic ambiguity is called WSD (word sense disambiguation).

A

Part-of-speech (POS) taggers

19
Q

…………is useful for identifying relationships between words and, therefore, understand the meaning of sentences.

A

PoS tagging

20
Q

…………….. is the process of determining the syntactic structure of a text by analyzing its constituent words based on an underlying grammar (of the language).

21
Q

about parsing?

A

Breaking a sentence as per grammar:

Like a Sentence is broken into Noun Phrase and Verb Phrase.

Noun Phrase could be again Article + Noun

22
Q

example of parsing

A

Parsing (cont.)

Example:
The boy ate the pancakes.

This sentence has the following structure:

The boy: Noun Phrase
ate: Verb
the pancakes: Noun Phrase (Determiner + Noun)

23
Q

what is stemming

A

consists of extracting the root of the word.

24
Q

wha is Lemmatization

A

allows reducing a word to its canonical form (the lemma). It is also called the “dictionary” form.

25
what is Stemming & Lemmatization
For example, in the sentence: “This is better” The word “better” is transformed into the word “good” by a lemmatizer but is unchanged by stemming.
26
what is Stopword Removal
It involves filtering out high-frequency words that add little or no semantic value to a sentence, for example, which, to, at, for, is, etc. You can even customize lists of stopwords to include words that you want to ignore.
27
what is Semantic analysis
The process of identifying which sense of a word is meant in a sentence or other segment of context.
28
about semantic
The occurrence of the word bass clearly denotes the distinct meaning. In first sentence, it means frequency and in second, it means fish. Hence, if it would be disambiguated by WSD then the correct meaning to the above sentences can be assigned as follows: I can hear bass/frequency sound. He likes to eat grilled bass/fish.
29
T/F Part-of-speech (POS) taggers with high level of accuracy can solve Word’s syntactic ambiguity. On the other hand, the problem of resolving semantic ambiguity is called WSD (word sense disambiguation).
T
30
...........The process that assigns labels to words or phrases in a sentence that indicates their semantic role in the sentence, such as that of an agent, goal, or result.
Semantic Role Labelling (SRL)