CDS Legal - 52 eDiscovery Terms Flashcards

1
Q

Evidence that is allowable in court.

A

Admissable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The term used to refer to the various technologies used to provide multiple views into the data set.

A

Analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Long term repository for the storage of records and files.

A

Archive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A document or file that is connected to another document or file either externally, e.g. a document connected to an email, or embedded, e.g. an image in a word processing document.

A

Attachment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Both the action of and the result of creating a copy of data as a precaution against the loss or damage of the original data.

A

Attachment Backup

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Portable media used to store copies of data that are created as precaution against the loss or damage of the original data.

A

Backup tape

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The processing of a large amount of EDI in a single step.

A

Batch Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The processing, tracking and recording the movement, handling and location of electronic evidence chronologically from collection to production. It is used to verify the authenticity of the ESI.

A

Chain-of-Custody

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A file that is attached to another communication file. E.g. the attachment to an email or a spreadsheet imbedded in a word processing document.

A

Child Document

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

A single file containing multiple documents and/or files, usually in a compressed format; e.g. zip, car, pst.

A

Container File

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Most often refers to the individual from whose file systems a group of records were extracted. This person is not necessarily the author of the documents.

A

Custodian

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The process of parsing data from electronic documents to identify their metadata and body contents

A

Data Extraction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The process of identifying and recording the location and types of ESI within an organization’s network, and policies and procedures related to that ESI.

A

Data Mapping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The process of comparing the characteristics of electronic documents to identify and/or remove duplicate records to reduce review time and increase coding consistency.

A

De-duplication, De-duping.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

The process of separating documents generated by a computer system from those created by a user. This automated process utilized a list of file extensions developed by the National Institute of Standards and Technology.

A

De-NIST

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The process of identifying, securing, reviewing information that is potentially relevant to the matter and producing information that can be utilized as evidence in the legal process.

A

Discovery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

All parts of a group of documents that are connected to each other for purposes of communication; e.g. an email and its attachments.

A

Document Family

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

The eDiscovery process as it is practiced in the European Union

A

e-Disclosure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The process of identifying, preserving, collection, preparing, reviewing and producing ESI in the context of a legal or investigative process.

A

ediscovery, e-discovery. Electronic discovery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Information that is stored in an electronic format. This is used to prove or disprove the facts of a legal matter.

A

Electronic evidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

An electronic communication sent or received via a data application designed for that purpose. (e.g. MS Outlook, Lotus, Notes, Google Gmail)

A

Email

22
Q

Electronically stores information.

A

ESI

23
Q

The process of applying specific parameters to remove groups of documents that do not fit those parameters, in order to reduce the volume of the data set, e.g. date ranges and keywords.

A

Filtering

24
Q

The handling of ESI including collection, examination and analysis, in a manner that ensures its authenticity, so as to provide for its admission as evidence in a court of law.

A

Forensics

25
Q

The rules that govern ediscovery and other aspects of the civil legal process.

A

Federal Rules of Civil Procedure, FRCP

26
Q

An algorithm that generates a unique value for each document. It is referred to as a digital fingerprint and is used to authenticate documents and to identify duplicate documents.

A

Hash

27
Q

To make an identical copy of a drive including its empty space, “mirror image”.

A

Image (Drive)

28
Q

To make a picture copy of a document. The most common image formats in ediscovery are tiff and pdf.

A

Image (File)

29
Q

Data whose format has become obsolete.

A

Legacy Data

30
Q

A communication requesting the preservation of information that is potentially relevant to current or a reasonably anticipated legal matter and the resulting preservation.

A

Legal Hold

31
Q

A file used to import data into an ediscovery system. It defines document parameters for imaged documents and often contains metadata for all ESI it relates to.

A

Load File

32
Q

The device used to store electronic information, e.g. hard drives, back up tapes, DVDs.

A

Media

33
Q

Often referred to as data about data. it is the information that describes the characteristics of ESI, e.g., sender, recipient, author, date. Much of the metadata is not accessible to non-technical users.

A

Metadata

34
Q

A file that is maintained in the format in which it was created. This format preserves metadata and details about the data that might be lost when the documents are converted to image format. e.g. pivot tables in spreadsheets.

A

Native Format

35
Q

Two or more files that contain a specified percentage of similarity. Also, the process used to identify those nearly-identical files.

A

Near-duplicate

36
Q

Reformatting data so that it is stored in a standardized format.

A

Normalization

37
Q

The process of converting images of printed pages into electronic text.

A

Optical Character Recognition, OCR

38
Q

A document to which other documents/files are attached.

A

Parent Document

39
Q

A document categorization process that extrapolates the tagging decisions of an expert reviewer across a data set. It is an iterative process that increases accuracy with multiple training passes.

A

Predictive Coding

40
Q

In search results analysis, _____ is the measure of the level of relevance to the query in the results set of documents.

A

Precision

41
Q

The ediscovery workflow which ingests data, extracts text and metadata, and normalizes the data. Some systems include the data indexing and de-duplication in their processing workflow.

A

Processing

42
Q

The delivery, to the requesting party, of documents and ESI that meet the criteria of the discovery request

A

Production

43
Q

In search results analysis, _____ is the measure of the percentage of total number of relevant documents in the corpus returned in the results set.

A

Recall

44
Q

To intentionally conceal, usually via an overlay, portions of a document considered privileged, proprietary or confidential.

A

Redact

45
Q

The process of looking within a data set using specific criteria (a query). There are several types of search ranging from simple keyword to concept searches that identify documents related to the query even when the query term is not present in the document.

A

Search

46
Q

The unused portion of a disk that exists when the data does not completely fill the space allotted for it. This space can be examined for otherwise unavailable data.

A

Slack space

47
Q

The destruction or alteration of data that might be relevant to a legal matter.

A

Spoliation

48
Q

Data stored in a structured format such as a database. _______ can create challenges in ediscovery.

A

Structured Data

49
Q

_________ is a common graphic file format. The file extension related to this format is ___.

A

TIFF, .tif

50
Q

Most often, this is space created on a hard drive when a file is marked for deletion. This space is no longer allocated to a specific file. Until it is overwritten, it still contains the previous data and can often be retrieved.

A

Unallocated space

51
Q

The code standard that provides for uniform representation of character sets for all languages. It is also referred to as a _____________.

A

Unicode, double-byte language.

52
Q

Data that is not stored in a structured format such as word processing documents and presentations.

A

Unstructured data