AIIM CIP Deck Flashcards
ANSI/NISO 239.19-2005
Definition of Taxonomy
ISO 15489-1:2001
metadata definition and categorization (records management generally
ISO/IEC 15948:2004
PNG
ISO/IEC 10918-1
JPEG
ISO 19005
PDF/A
ISO 32000
PDF 2nd iteration
ISO 23081
metadata accrues overtime: metadata generally
Capture
The process of (1) getting information from its source into some type of more formal information management environment (input) (2) then recording its existence in the system (tagging)
Sources of information
- PCs
- Laptops
- Tablets
- phones
- other new technology
- Fileshares
- local storage drives
- disks
- USB drives
- hosted applications
- paper
- business apps (CRM HR)
Structured information
- Consists of fundamentally spreadsheet data in a table over many linked tables
- has a fixed structure
- usually database
- most information repositories are combinations of structured data and some place to store binary files associated with them
Unstructured Information
Variable in: * format *content Word, Excel, Project, PDF, Scanned TIFF, email Might have rules associated with content
Capturing structured
input annually by entry or system
extracted from on syntax to others.
Capturing unstructured
a formal procedure so…
(1) info can be controlled
(2) filed in structured content with related items
to give context, protect, retain, and search.
File Format Considerations
Can be highly proprietary only accessible with special software or tools. Consider: -audience -regulatory requirements -value of information overtime
JPEG
Joint Photographic Experts Group
Very good at compression of continuous tone images.
Lossy compression - data loss in compression so repeated conversions may end up in data loss
ISO/IEC 10918-1
PDF/A - archive ISO 19005, ISO 32000
PDF/Engineering
PDF/x prepress digital exchange
PDF/UA - Universal Accessibility
PNG
Portable Network Graphics Lossless Compression 32 bit color a W3c standard '96 ISCO/IEC 15948:2004
TIFF
Tagged Image File Format
-most scanned
lossless compression, good for bi-tonal(black and white)
also support multiple pages but not all TIFF viewers support all options, and not all browsers have TIFF viewers.
ECM
Enterprise Content Manager
Core capability
-Document Management - check out, tracking, version
-Record Management - Formal content based rules and specific retention
-Workflow-take specifications based on metadata
-search
-web content management
-capture/scanning - applying metadata
-collaboration
-publishing - content available on multiple platforms
-archiving - for collaborative communication
Digital Asset Management
ECM for media
rich media, audio, video, digital photographs, design documents, logos
May be a dedicated system or added on to ECM
- Tracks copyright license restrictions
-specialized metadata
EFSS
Enterprise File Synching and Sharing Solutions
- mostly cloud based
- allow users to share and sync documents over multiple devices
BUT come from consumer base and lack many enterprise functions like central control, security, metadata, lifecycle management
Capture - taking control
input side of information management
Everything is tagged: for every object ask “wher does this go?”
Microsoft Office Integration
ECM Typically allow for integration that intercept the file save menu.
ECM - importing existing content
- Most ECM have simple tools
- system admins also may have batch import utilities and can automate BUT some prep is needed including cleanup of dups and junk but follow the IG rules for the company
Automating Information Capture
Ways to Auto
- by role or user
- by content type
- by work process
- through workflows
- through metadata values
- through bulk import
- through analytics
AutoCapture by role or User
- id users most likely to create records-then capture everything
Usually
*senior management
*assistants to senior management
*specific roles, legal staff or personnel/ HR staff
*anyone making business decisions
AutoCapture by Content Type
identify specific types of documents e.g. contracts, invoices, personnel records. Many ECMs have definition of content type or record class and may associate metadata fields or values, business rules, blank templates.
AutoCapture by Work Process
those inherently decision or transaction oriented - by default when the final version is approved or signed normally
- contracts
- invoices
- wage statements
- financial statements
AutoCapture through workflows
rules can be defined that at a certain step in workflow a record is created, identified then processed
Like contract review when approved it is a record then when executed the executed copy associated with the record.
AutoCapture through metadata Values
used with bulk import could use tool to crawl metadata -file formats -dates -location
Autocapture through Bulk import
taking legacy information and building into new system
- maybe too expensive to do one at a time
- at need to be formatted to flat file format to import to new system
- others have utilities to do this
- to be valuable metadata should be included in import
AutoCapture through Analytics
relies on text, metadata, rules, etc. SME's help define -more scalable -doesn't rely on humans -more consistent -even if wrong -transparent
forms processing
most of form unwanted 1. scan form 2. is it readable 3. which standard form is it form recognition software easily places information into a database