Concepts Flashcards
Learn about digital preservation concepts.
What is…
Format Identification
Recognizing a file’s format based on markers inside the binary stream. The first stage of digital preservation recognised as ‘knowing what you’ve got’.
What is a…
File Format Extension
File extensions are the characters following the last full-stop in a filename, e.g. ‘.txt’, ‘.exe’, ‘.xls’. A clue to the file’s format but not a guarantee.
What is a…
File Format Signature
Binary markers inside a file that indicate its format and version. To find a signature means reading a file’s content and so it is a more sure way of finding what a file is.
What is a…
Magic Number
✨Magic✨ number is often used as a synonym for file format signature.
The etymology for the term dates back to the seventh version of the Unix operating system (1979).
The use of magic numbers grew as requirements for them did. The use of the phrase file format signature seems to have come about through the maturisation of the field of digital preservation.
What is…
Web Archiving
The process of crawling a website in its current form and duplicating it, and its resorces (images, sound files, etc.) offline, or simply, elsewhere, that is, in a web-archive.
What is a…
Memento
- Memento is first a project to support web archiving and finding of web archives in a standardised way.
- Memento is second, a synonym for the snapshot of a website - an archived website may be called a Memento.
- Memento is an intitiative from Los Alamos National Laboratory and Old Dominion University, in the US.
What is…
Web Crawling
Automation of the web-archiving process. A tool crawls a website by looking at all of the links stemming from it and then visiting those one-by-one, potentially doing the same at the next site - figuratively, crawling.
What is…
Fixity
Properties of an object, digital or otherwise, that prove it to be fixed - its state hasn’t changed. The last-modified date of a file is a potential measure of fixity. A file’s checksum value is a more robust measure of fixity.
What is…
Risk
From risk management, the formal statement of a risk is as follows:
“Because of x there is a risk that y which will result in z. “
The statement enables us to think about risk in terms of its impact and therefore steers us away from the concept of risk as in fear.
Impacts should be measurable, and real.
What is…
Obsolescence
The process of technology becoming unreadable ot unusable. Digital preservation requires monitoring of potential obsolesence.
What is…
Data Obfuscation
Hiding data from plain-view or use such that it is obfuscated, e.g. encryption, redaction, and password protection.
What is…
Metadata
Data about another item, for example, the number of pages in a book, and the number of words. Metadata about a digital object can be anything that describes that file or something in the file, for example, a digital image’s resolution (number of pixels along the x and y axis.)
What is…
Metadata Extraction
The extraction of metadata from a digital object, often using tools that can read the file and export the information in a machine-readable form such as XML or JSON.
What is…
Characterization
Characterization is whereby metadata crucial to the preservation of the digital object is recorded.
This information may describe the object itself or part of its technical environment.
What are…
Significant Properties
Properties of individual records or groups of records that may be prioritised for preservation, and used as a measure of a successful ‘preservation action’, e.g. if the number of pages in a record is considered to be important, it is a significant property we need to monitor and measure.
What is…
Metadata Mapping
The process of selecting metadata about a digital object and encoding it into an alternative schema, e.g. for archival description, or preservation.
What are…
Digital Humanities
The use of digital techniques to support the scholarly study of the humanities (Literature, Archaeology, Architecture etc.).