BMI2207C CA2 Flashcards by Benecia Thia

Difference between the EHR and EMR.

Electronic Medical Records (EMR) :
- Digital version of the paper charts and structured data in hospitals or clinical office
- System to capture medical records of a patient
- Does not travel easily out of hospitals
- Examples : EPIC and Allscripts

Electronic Health Records (EHR) :
- Total health of the patient going beyond standard clinical data collected in the provider’s office or hospital
- Reach out beyond the health organisation that originally collects the information
- Includes information from all clinicians in the patient care
- Built to share information with other healthcare providers like labs and specialists
- Examples : NEHR

How well did you know this?

Not at all

Perfectly

Explain what the term “Meaningful use” means.

It is a term to get HCPs to begin sorting and sharing of health data electronically to be able to better improve clinical processes and health outcomes for patients

How well did you know this?

Not at all

Perfectly

Functions of “meaningful use”.

Improve quality of patient care
Engaging patients in health
Easier to coordinate care
Improve overall health of a given patient population
Secure and protects people’s health information

How well did you know this?

Not at all

Perfectly

Explain the stages in meaningful use, and state which stage Singapore’s EHR is in.

Stage 1 : Focused on getting healthcare providers to adopt EHRs and store clinical data electronically

Stage 2 : Encourage healthcare professionals and institutions to then use the data and technology to improve the quality of care for their patients and make it easier to exchange information within and between organisations

Stage 3 : Centred on leveraging EHRs and clinical data to improve health outcomes, and ease reporting requirements to align with other government health programs

**Singapore’s EHR meets Stage 1 definition of “meaningful use”

How well did you know this?

Not at all

Perfectly

Explain what “Directed Exchange” means.

Health information is sent directly to other providers over an encrypted secure connection.

How well did you know this?

Not at all

Perfectly

Explain what “Query-based exchange” means.

Data is requested by providers to a central Health information exchange (HIE).

How well did you know this?

Not at all

Perfectly

Explain what “Consumer mediated exchange” means.

Patients are involved in the collection and transmission of healthcare data to providers.

How well did you know this?

Not at all

Perfectly

List the 12 persistent risks in AI.

Disinformation
Safety and security
Black box problem (cannot see what process AI has undertaken before it has generated data output)
Ethical concerns (Non-maleficence, Beneficence, Autonomy, Manipulation)
Bias
Instability
Hallucinations in LLMs (AI giving false outputs and causing unnecessary panic)
Unknown unknowns (we are not really sure how AI is going to react, blindspots preventing anticipation of AI behaviour)
Job loss and social inequalities
Environmental impacts
Industry concentration
State overreach

How well did you know this?

Not at all

Perfectly

Explain the relationship between AI, ML, DL, NLP, LLM and Conversational AI.

Artificial Intelligence includes ML, DL, Conv. AI, LLM, NLP
ML includes DL, LLM, Conv. AI, and can intersect with NLP
DL can intersect with Conv. AI and LLM
Conv. AI and LLM are both subsets of ML and NLP
NLP overlaps with ML, but has independent parts as well
**Data science, data management, descriptive analytics and visualisation is NOT AI and is a complete other subset

How well did you know this?

Not at all

Perfectly

List the 2 roadblocks of modern AI.

Common sense problems and implicit knowledge – Cognitive and contextual limitations require excessive exceptions to be inputted to model interactions
Lack of training/labelled data – Amount of inputs are never enough, especially for cellular images

How well did you know this?

Not at all

Perfectly

Differentiate supervised and unsupervised learning in machine learning.

Supervised learning :
- Classification is the problem of predicting the correct category/label for a input object
- Regression is the prediction of continuous response

Unsupervised learning :
- Clustering is the problem of identifying implicit groupings in the data

How well did you know this?

Not at all

Perfectly

Explain regression analysis.

Regression analysis is a set of statistical processes for estimating the relationship between a dependent variable and one or more independent variable

Linear regression : Models relationship b/w 2 variables and estimates the value of a response by using line of best fit
Multiple regression : Models relationship between variables (with >1 independent variable) and method of least squares is used to find p-dimensional plane
Logistic regression : Predicts a binary outcome and is a sigmoid function to map predictions to probabilities ; independent variables can be categorical/numeric, but dependent must ALWAYS BE CATEGORICAL due to binary output

How well did you know this?

Not at all

Perfectly

Define artificial neural network (ANN).

ANN represents a prediction model as a series of multi-layered interconnected nodes functioning similar to neurons.

Learning in ANN involves adjusting the weights of various inputs to minimise errors in output
Model will back propagate to correct weights until the error rate converges

How well did you know this?

Not at all

Perfectly

Explain Deep Learning.

Subset of ANN that uses multi-layered neural networks
Allows for more complex feature detections and requires more training data
Image recognition, natural language, processing and autonomous driving

How well did you know this?

Not at all

Perfectly

Define clustering and the 2 categories.

Clustering is defined as the classifying of data into points based on similarities, extracting underlying groupings without labels.

K means clustering : Clustering based on centroids, which changes its centroids until intra-cluster distance is minimised and inter-cluster data is maximised

Density based clustering (DBscan)

How well did you know this?

Not at all

Perfectly

How do we conduct evaluation of ML models?

Choose the best model in the market and use it as the benchmark for comparison with other models
Ensures models are generalised well, not overtrained or under-trained
Hyper-parameter tuning where hyperparameters can significantly affect model performance
Interpreting pitfalls of models

How well did you know this?

Not at all

Perfectly

Explain the train-validation-test split.

Study These Flashcards

Refers to the segregation of an entire dataset into various percentages to train, validate and test the learning model.
- 70% of entire database to help ML model learn and train
- 15% of database to help validate (fine tune model hyperparameters)
- 15% of data to test the model to see if it is working

Explain what cross validation for model fitting means.

Study These Flashcards

The N-fold cross validation is used to evaluate the performance of a ML model through a series of training and testing with random sets.

List the metrics for evaluating ML.

Study These Flashcards

Accuracy
Precision
Recall/sensitivity (proportion of correct predictions from all positive instances)
F-1 measure
True positive, False positives
Area under curve

Interpret the area under curve.

Study These Flashcards

Residuals : Errors and deviations between actual and prediction values, hence we use line of best fit

Overfitting : Model “learns too much” and is unable to give a generalised trend

Underfitting : Model “learns too little” and gives a low prediction accuracy

Applications of AI in healthcare

Study These Flashcards

Enhances diagnostic accuracy
Improve treatment outcome and process
Reduce healthcare cost
Expand access to healthcare services
BUT, Ethical, regulatory and privacy concerns

What are Language Models?

Study These Flashcards

Creates a circuit that guesses an output word given a bunch of input words.

What does self-supervision mean?

Study These Flashcards

Encoder, decoder network is trained to output the same word as the input
Encoders are circuits that are able to take in much more words using point vectors

What does a masked language model mean?

Study These Flashcards

Masking part of the sentences and testing the LLMs to see whether they can fill up the mask with the same output
However, when the mask is at the end of the sentence, the model would keep guessing for the next word in the sequence and form a autoregressive model instead

Unique points of LLMs

- Uses transformer architecture - Self-attention mechanism - Embeddings - Positional encoding - Multi-Head Attention - Feedforward Neural Networks - Training processes by learning language (pre-training on large amount of texts and fine-tuning by training on specific tasks)

**Things to know about LLMs

1. Training on random information 2. Model is data biased (would only generate content it is trained on) 3. Model cannot differentiate what is right or wrong 4. LLMs may make mistakes and hallucinate, or generate certain preferences 5. Trust deficit therefore requires validation 6. Quality of response is directly proportional to quality of input prompt 7. Model cannot undergo a real conversation as it does not remember everything that was said

What is a "prompt" in the context of generative AI?

Text or input provided by the user to initiate a response or action from the AI

Parameter to change how random and creative AI generated response is?

Temperature. The higher the temperature, the more random, diverse and less predictable the outcome is.

What are tokens in LLMs?

A token is a word, punctuation mark or symbol that represents the smallest unit of text that the LLM can process.

Name the framework for prompt generation.

CO-STAR framework

Explain the CO-STAR acronym.

- Context (background info of scenario) - Objective (clear and specific task definition) - Style (writing style of response) - Tone (sentiment) - Audience (intended audience of the response) - Response (response format)

6 common task specific prompts to get LLM-powered AIs to do?

- Rewriting - Extracting - Classifying - Clustering - Summarising - Generating

What is the purpose of thought-based prompting?

Getting the AI to generate a list of steps as to how it come to a certain conclusion.

Explain the data challenges in medical AI.

1. Data Quality and Availability 2. Privacy and Security Concerns 3. Data bias and representation

Explain the technical and integration challenges in medical AI.

1. Integration with existing systems 2. Scalability issues 3. Complexity of Medical Conditions

Explain the ethical and legal considerations in Medical AI.

1. Ethical concerns 2. Regulatory challenges 3. Liability & Accountability

Implications and importance of medical data security.

- Leads to identity theft, financial loss - Has life-threatening implications if medical records are altered or stolen - Results in hefty fines and reputational damage on organisations

List the practices in data security.

1. Two-factor authentication 2. Access control 3. PDPA guidelines 4. Guide staff on data handling, access and protection 5. Have clear SOPs on how to respond to breaches 6. Monitor and conduct audits to ensure compliance with policies and regulations 7. Encrypting data in transit Future works : - Decentralised data storages - Privacy considerations from the outset

List the regulatory framework for healthcare data privacy and security.

- Cyber and Data security guidelines (CDSG) - Personal Data Protection Act (PDPA) - Health Information Bill (HIB)

BMI2207C CA2 Flashcards

(40 cards)