Week 11 UAS Flashcards

1
Q

The difference between Temporal-Difference Learning and Adaptive Dynamic Programming?

A
  • TDL is to match estimation utility value of state with follow state to reach and makes one matching the value of utility estimation of each transition state.
  • ADP is to match estimation utility value of state with all sate to reach and makes some matching value needed for consistent between utility estimation value with environment modelling.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Approaches in Active Reinforcement Learning?

A
  1. Reflex agent: learn policy directly, function
    mapping from states to actions
  2. Q-learning: learns an action-value function, or Q function, giving the expected utility of taking a given action in a given state
  3. Utility-based agent: learn utility values for states, use it to select actions that maximize the expected outcome utility
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The agent in Active Reinforcement Learning can get profit value from 2 method

A
  1. Greedy approach: maximize utility by using estimation model
  2. Wacky approach: random for all environment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Direct Utility Estimation?

A

Using least mean squares concept (reward to go) which use Bellman Equations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Active reinforcement Learning?

A

learn about optimizing mapping from states and actions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Temporal Difference Learning?

A

update utility value appropriately which is affected by final state

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Natural Language Processing?

A

The Agent which want to add the information needs to understand (at least partially) of the human language which is sometimes ambiguous and unclear.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

3 ways to find information based on information retrieval perspective in NLP?

A
  1. Text classification
  2. information retrieval
  3. information extraction
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

N-gram character models

A

N-gram character models is defined as a Markov chain of order that in a Markov chain the probability of character depends only on the immediately preceding characters, not on any other characters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Smoothing approach N-gram models?

A

Linear Interpolation smoothing (backoff model) combines unigram, bigram, and trigram using linear interpolation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Text classification also known as?

A

Categorization: given a text of some kind, decide which of a predefined set of classes it belongs to. Language identification and genre classification are examples of text classification, as is sentiment analysis (classifying a movie or product review as positive or negative) and spam detection (classifying an email message as spam or not-spam).
can be done with Naive bayes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Information retrieval?

A

Information retrieval (searching information like Google) is the task of finding documents that are relevant to a user’s need for information. The best-known examples of information retrieval systems are search engines on the World Wide Web.
characteristics:
1. A collection of writings (document).
The system must determine which one want to be considered as a document (paper). Example: a paragraph, a page, etc.
2. User Query
The query is a formula used to find the information needed by the user.
In its simplest form, a query is a keyword and documents that contain the keywords are the searched documents.
3. Set of Results
The results from the queries. A part of the documents
in which is relevant to the query.
4. Display of result sets
Can be a list of results in a ranking of the title documents.

previously works with boolean models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Information extraction?

A
Information extraction is the process of acquiring knowledge by skimming a text and looking for occurrences of a particular class of object and for relationships among objects.
The simplest type of information extraction system is an attribute-based extraction system TEMPLATE REGULAR EXPRESSION that assumes that the entire text refers to a single object and the task is to extract attributes of that object.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

4 approximation for Information Extraction?

A

– Deterministic to stochastic
– Domain-spesific to general
– Hand-crafted to learned
– Small-scale to large-scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Relational-based extraction system?

A

FASTUS.
divided to be 5 stages:
1. Tokenization: divide the characters into a token
2. Complex-word handling: handle words that contains grammatical rules
3. Basic-group handling: sort by morphological of the words
4. Complex-phrase handling: merge basic group to be an arrangement of words
5. Structure merging: combining structure that have been resulted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Machine Translation?

A

System that could read on its own and build up its own database. Such a system would he relation-independent; would work for any relation.

17
Q

A representative machine-reading system is

A

TEXTRUNNER

18
Q

Information Retrieval applications?

A
  1. PageRank Algorithm
  2. HITS (Hyperlink-Induced Topic Search) Algorithm
  3. Question Answering, when the query type is questions, then the result is not a ranking list of the documents, but the form of short response, it could be a sentence or phrase.