Conceptual Flashcards

1
Q

What does conceptual analytics focus on?

A

Related concepts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Conceptual analytics helps reveal the facts of a case by doing the following:

A
  • Giving users an overview of the document collection through clustering
  • Helping users find similar documents with a right-click
  • Allowing users to build example sets of key issues
  • Running advanced keyword analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of indexing does a Conceptual index use?

A

Latent Semantic Indexing (LSI)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What type of indexing does a Classification index use?

A

Support Vector Machine (SVM)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Only documents in this are returned when using the Analytics index

A

Data source

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

You get diminishing returns when using more than this number of dimensions

A

300

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the maximum number of recommended documents in a Classification source

A

9 million

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What type of content should the training data contain?

A

Authored (i.e. extracted text)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What type of fields should not be included in the training set?

A

Metadata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Documents with less than this amount of text should be excluded from the training source

A

0.2 KB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

True or False: words starting with numbers are not including in the index

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or False: words ending with or with embedded numbers are not included in the index

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Optimize training set excludes documents of low quality, such as:

A
  • Very short documents
  • Very long documents
  • Lists containing a significant amount of numbers
  • Spreadsheet-like documents
  • System log files
  • Text resulting from processing errors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Optimize training set evaluates documents based on:

A
  • Word count
  • Uniqueness
  • Number count
  • Punctuation marks
  • Words with many characters (50+)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Typically these fields are returned to be indexed:

A

Extracted text

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Do not index these types of fields:

A
  • single choice
  • multiple choice
  • multiple object
17
Q

What field contains whether a document is included as a data source or training data source for the index?

A

Conceptual Index

18
Q

What does cancelling a population cause?

A

The next build needs to be a full build

19
Q

What must you do to add documents previously excluded from the training set to the training set?

A

Disable “Optimize training set” and perform a full build

20
Q

Documents larger than this size are excluded from the index?

A

30MB