USING ANALYTICS FOR REVIEW Flashcards

1
Q

What are the two types of emails in Structured Analytics?

A

Inclusive

Non-Inclusive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is an inclusive email?

A

An email that contains unique content not included in any other email, and thus, must be reviewed. An email with no replies or forwards is by definition inclusive. The last email in a thread is also by definition inclusive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an non-inclusive email?

A

an email whose text and attachments are fully contained in other (inclusive) emails

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why would you review only inclusive emails?

A

Because they contain everything that needs to be reviewed.

By reviewing only inclusive emails and skipping duplicates, your review process will be much more efficient.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the common Inclusive emails?

A
  • The last email in a thread
  • The end of attachments
  • Change of text
  • Change of sender or time
  • Duplication
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Are duplicate emails with blank attachments considered Inclusive or non-inclusive?

A

Blank attachments are always considered unique and therefore inclusive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What can email threading visualisation be used for?

A
  • Quickly see the story of an email conversation

- Optimise your QC process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What field is required for email thread visualisation?

A

Email Author Date ID must be present for the emails.

This field is only available for emails run through a full analysis using structured analytics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If you have multiple Structured Analytics Sets how do you set which one to use for your current email threading visualisation?

A

Click display options and choose which SA set to display

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Textual Near Duplicate Identification

What does it do?

A
  • Takes contents of Extracted Text Field for all docs with 30MB or less text
  • Operates on text-only and converts everything to lower case
  • Sorts all docs from largest to smallest and then defines groups of documents organised around a principle document for each group.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Textual Near Duplicate Identification

If no groups are matched for a document what happens?

A

Nothing happens.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Textual Near Duplicate Identification

What happens for documents that are not textually similar to other documents or do not contain text?

A

Analyzed documents that are not textually similar enough to any other documents will not have fields populated for Textual Near Duplicate Principal or Textual Near Duplicate Group.

Docs that only contain numbers or are blank will have the Textual Near Duplicate Group set to numbers only or empty, respectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Textual Near Duplicate Identification

What is the minimum similarity percentage parameter for?

A

This parameter indicates how similar a document must be to a principal document to be placed into that principal’s group.

The higher the setting the faster the process will run because fewer comparisons have to be made

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Textual Near Duplicate Identification

What fields does it create when you run Textual Near Duplicates Identification?

A

-::Textual Near Duplicate Principal
Identifies the principal document with a “Yes” value.

-::Textual Near Duplicate Similarity
The percent value of similarity between the near duplicate document and its principal document.

-::Textual Near Duplicate Group
Acts as the identifier for a given group of textual near-duplicate documents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Textual Near Duplicate Identification

After running TNDI what are the next recommended actions?

A
  • Setting up a Textual Near Duplicates view for a structured analytics set
  • Assessing similarities and differences with Document Compare
  • Viewing the Textual Near Duplicates Summary
  • Viewing Near Dup Groups in related items pane
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Textual Near Duplicate Identification

How do you set up a view for TNDI?

A

Create new view
In the OTHER tab
GROUP BY the Destination Textual Near Duplicate Group

Apply condition
Textual Near Duplicate Group : is set

Use these output fields for the view:

  • Textual Near Duplicate Principal
  • Textual Near Duplicate Similarity
  • Textual Near Duplicate Group

You now have a view that shows all TNDI documents in their groups

17
Q

Textual Near Duplicate Identification

Can I view Near Dup Groups in the Related Items Pane?

A

Yes

There is a button for it

18
Q

Textual Near Duplicate Identification

Propagation For Responsiveness Field
Is this a good idea?

A

Nope.

It’s a bad idea.
if you mark an email within a group as not responsive, other potentially responsive emails within the group could be automatically coded as not responsive.

19
Q

Name Normalisation

What is it?

A

Analyses email headers to identify aliases and the entities those aliases belong to.

Entities are merged with those created by Legal Hold, Processing or Case Dynamics

20
Q

Name Normalisation

How does it work?

A

Creates Aliases the first time is sees an email address or name. Then associates the aliases with appropriate entity.

21
Q

Name Normalisation

Should you run Name Normalisation in the same Structure Analytics set as email threading or Textual Near Duplicate Identification.

A

No!

Best to run in it’s only SA set for maximum flexibility. Makes it easier to make modifications if needed in future.

22
Q

Name Normalisation

What fields are needed to run Name Normalisation?

A

Must have at least one From field and one other email header field such as To, CC or BCC, Subject or Date Sent.

23
Q

Name Normalisation

If you don’t add Alias values prior to running name normalisation can you add them after?

A

Yes!

But you have to use the Merge Mass operation to consolidate duplicate entities

24
Q

Cluster Visualisation

What is it?

A

Renders your cluster data as an interactive map allowing you to see a quick overview of your cluster sets and quickly drill into each cluster set to view subclusters and conceptually-related clusters

25
Q

Clustering

What is it?

A

Analytics uses clustering to create groups of conceptually similar documents.

Unlike categorization, clustering doesn’t require much user input.

26
Q

Concept Searching

What is it?

A

Concept Searching finds information without precisely phrased query by applying a block of text to find docs that have similar conceptual content

27
Q

Concept Searching

How do you run a concept search or a “Find Similar Documents” Search

A

In the viewer right-click on some text and Conceptual Search.

Conceptual Search does the highlighted Text
Find Similar Documents uses the entire document