E7 Flashcards

1
Q

What is text mining?

A

Finding interesting information in texts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Cleaning and preprocessing text

A
  1. Case normalization
  2. Removing punctuation
  3. Removing numbers
  4. Removing stopwords
  5. Word stemming and stem completion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A token/term

A

e.g., a word or a group of words

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A document

A

One piece of text

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A corpus

A

A collection of documents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly