Conceptual Flashcards
What does conceptual analytics focus on?
Related concepts
Conceptual analytics helps reveal the facts of a case by doing the following:
- Giving users an overview of the document collection through clustering
- Helping users find similar documents with a right-click
- Allowing users to build example sets of key issues
- Running advanced keyword analysis
What type of indexing does a Conceptual index use?
Latent Semantic Indexing (LSI)
What type of indexing does a Classification index use?
Support Vector Machine (SVM)
Only documents in this are returned when using the Analytics index
Data source
You get diminishing returns when using more than this number of dimensions
300
What is the maximum number of recommended documents in a Classification source
9 million
What type of content should the training data contain?
Authored (i.e. extracted text)
What type of fields should not be included in the training set?
Metadata
Documents with less than this amount of text should be excluded from the training source
0.2 KB
True or False: words starting with numbers are not including in the index
True
True or False: words ending with or with embedded numbers are not included in the index
False
Optimize training set excludes documents of low quality, such as:
- Very short documents
- Very long documents
- Lists containing a significant amount of numbers
- Spreadsheet-like documents
- System log files
- Text resulting from processing errors
Optimize training set evaluates documents based on:
- Word count
- Uniqueness
- Number count
- Punctuation marks
- Words with many characters (50+)
Typically these fields are returned to be indexed:
Extracted text