10. NLP for Clinical Text Flashcards

1
Q

what is NER and how is it used in health

A

NER = named entity recognition

usage = clinical NER to scan clinical documents, research papers & categorise entities such as treatments, drugs and diagnoses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is clinical de-identification & how does it satisfy the governance around patient privacy

A

HIPAA requires hcp to protect patient medical information from disclosure with the only exception of disclosing data without their consent or knowledge if it is de-identified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how does NLP apply to HIPAA requirements on HCP

A

can use NER to scan medical documents & identified protected hc info (PHI) including patient names & phone #

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

give examples of NLP in hc

A
  • clinical documentation
  • speech recognition/dictation
  • computer assisted coding
  • clinical decision support
  • virtual scribe/chatbot
  • clinical trial matching
  • computation phenotyping
  • EMR dictation
  • root cause analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is the EHR & it’s architecture

A

structured coded form of lab results, discharge diagnoses, pharmacy orders etc.

uses XML standard model to form CDA (clinical data architecture)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is the NLP workflow

A
  • obtain data
  • preprocessing
  • tokenisation
  • word embedding & representation
  • build & train model
  • evaluation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is stemming

A

crude chopping of affixes to reduce terms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is lemmatisation

A

reducing inflections or variants to base form

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

give an example of the diff. between stemming & lemmatisation

A

lem: changing = change
stem: changing = chang

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is the purpose of word embedding

A

representing words in a vectorised format of fixed numbers

this allows calculation of similarities between two sets of text using the cosine distance, which measures the angle between two vectors when represented in space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is a common embedding model

A

Word2Vec

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is lstm

A

long short term memory

modified version of RNN that make it easier to keep past data in memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is BERT

A

bidirectional encoder representations from transformers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the BERT procedure

A
  • uses an MLM (masked language model)
  • MLM masks a word and uses words on either side to predict the masked word
  • uses a transformer architecture
  • transformers create differential weights to identify the most important words in sentences that should be processed
  • the transformer layer is often called the encoder
  • transformers can be good for processing a lot of unsupervised data efficiently
  • BERT does not have a decoder
  • decoders are used to predict target output
  • also uses next sentence prediction where pairs of sentences are used for training, so the subsequent sentence in the document can be predicted by the model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

equation for precision

A

TP/TP+ FP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

equation for recall

A

TP/TP+ FN

17
Q

what are some NLP eval metrics

A

basic = precision, recall, F score
BLEU = bilingual evaluation understudy

18
Q

what is BLEU

A

an NLP evaluation metric