Informatics 2- Data Information and Knowledge Flashcards
data
- observations
- may or may not be meaningful
- computers do not understand
- input, store, process
- output zero (off) and one (on)
- each zero or one is known as a bit
- a series of eight bits is called a byte
- ex. 112134493 (pt id number)
information
meaningful data to draw conclusions
- take data and make conclusion
- ex. patient ID, interpreting diagnoses codes
knowledge
information justifiably believed to be true
-ex. smokers are more likely to develop lung cancer
bit
- only zero’s and ones’
- only language a computer understands
- computers do not understand you (they are dumb!)
byte
a series of eight bits is a byte
-10011101
integers
- numbers
- type of data
floating point numbes
- type of data
- 3.5456
- decimal
character
- 8 bytes is a character
- ex. “a” and “z”
strings
- putting characters together makes strings
- “hello” or “ball”
file formats
from least to most storage:
- image files
- text files
- sound files
- video files
image files
- JPG
- GIF
- PNG
- more clarity -> more storage*
text files
- txt
- doc
sound files
- WAV
- MP3
video files
MPG
informatics vs. IT and computer scientists
- computer science- software
- informatics- broad spectrum, hardware, software, combine them into a relative way
- IT search or sort data more efficiently
- takes data and makes it easier to use (sort, filter)
- informatics manipulates information (tools vary, could be computers)
- information retrieval:
- relationship between aspirin and heart attack -> finding correlations
- problem is identifying documents that contain certain meaning
data to information
- vocabularies help convert data into information
- ICD-10 162.9 is meaningless datum
- interpreting ICD-10-CM as “Lung neoplasm, not otherwise specified” turns datum into a unit of information
- human interpretation is necessary
- interoperability- transmission of information
- consistency of interpretation
information to knowledge
- information produces knowledge
- in clinical world, evidence exists that knowledge is true rather than proven fact
you cant convert data to knowledge
-it needs to put into something meaningful first (information)
clinical data warehouse (CDW)
- clinical data are collected via electronic health records (EHRs)
- clinical records composed of:
- structured data- billing codes, lab results, ICD-9-CM 162.9 = Lung Neoplasm, medication lists etc… -> easier to manage and retrieve
- unstructured or (free text)- clinical notes, natural language, but difficult to process -> natural language processing (NLP)
- shared database that collects, integrates, and stores clinical data from a variety of sources including electronic health records, radiology and other information systems
- staging: extract, transform and load
- EHR designed for real time updating of individual data
- CDW supports queries (a search) for groups
- take all the information and make a diagnosis
- cant be deleted after it enters the data warehouse
patients charts are made up of
-structured and unstructured data
van der Lei
- data shall be used only for the purpose for which they were collected
- this law has a collateral: if no purpose was defined prior to the collection of the data, then the data should not be used
- if there is no purpose why collect it?
- waste of time! we dont have time
query
- search
- looking for something
- query the x-ray
CDW as a clinical resource
- monitor quality to query for specific quality measures in specific pt populations
- all the people in the background are communicating with pt, looking that everything is being handled and input correctly
- clinical and translational researchers to identify trends and link research with clinical practice
- hospital infection control specialists track pathogens
- public health agencies conduct surveillance for natural or man-made illnesses
- informatics for integrating biology and the bedside (i2b2) project by Harvard
use of aggregated clinical data
- recognize records for pts with specific conditions
- could be use of billing codes (controlled vocab -> ICD-10-CM)
- concept extraction- identifying concepts within unstructured data -> extracting information from free text clinical notes (discharge summaries or pathology reports)
- need to map between terms or phases and controlled vocab with accuracy
- good notes -> better care