Measurment and Descriptive Analysis Flashcards
Why is it important to classify the type of data?
-It determines the type of statistical test that is going to be used
-the type of data will determine how it is described
-when analyzing an article the type of data needs to be determined
Data used for Qualitative data
-Categorial (Nominal data)
-Ordinal data
How are qualitative data described?
-Qualitative
-No mathematical data
-fall into distinct and discrete categories (finite number of categories)
-Gender (1=male, 2=female)
-Pass/fail
-Race
-Eye color
-Clinical diagnosis (1=heart failure, 2=renal failure,..)
Characteristics of categorical data
-Qualitative
-There is no natural order between categories (eye color, dead or alive, male or female)
What are dichotomous data?
If there are only 2 groups, data are
dichotomous (e.g., male/female)
What is an Ordinal data?
-Qualitative
-data with natural order
-Values/observations can be ranked (put
in order) or have a rating scale attached (f.e. rate your experience from good to bad)
-Numbers are not arbitrary in ordinal data (it has a meaning, f.e. the higher the better)
What are examples of Ordinal data?
-Pain scale (ranked, but not continuous)
-Likert scale (Strongly agree=5, agree=4, undecided=3)
-both are not continuous
The average score of an ordinal data (from 1 to 5) is 4.75, what is wrong with that statement?
4.75 does not represent a given category
it is better to use the median (middlemost value) rather than the average, bc the median (=3) fits into a category
What is Quantitative data?
-have mathematical meaning
-derived from counts or measurements
-most biological systems are represented in quantitative data
What type of data is used for Quantitative data?
-Continuous data
-values can take on any number (also fractions)
-biomedical values are continuous
-temperature, blood pressure, weight, LDL, age
What is the Baseline Characteristics important for?
-Internal Validity: ensure that both groups are similar, thereby preventing cofounding
-External Validity: are the results generalizable to another location?
The duration of treatment:
The number of patients who are treated < 4 wk in (%):
Drug A: 60 (25.1%) n=239
Drug B: 44 (18.5%) n=238
Drug C: 55 (22%) n=250
What type of data is that?
-Dichotomous: YES OR NOR
-> The question is: are pt for a Drug treated for less than 4 weeks / YES OR NO
for Drug A: 60 were treated for < 4 weeks: YES (179 were not)
for Drug B: 44 were treated for <4 weeks
for Drug C: 55 were treated for <4 weeks
Concomitant psychotropic treamtment with
Trazodone: 23
Anxiolytics: 44
Seative or hypnotics: 19
What type of data?
Categorial bc it can be put into buckets, which are Trazodone/Anxiolytics/Sedative or hypnotics
How many patients had Fever/Cough/Ronny nose?
Fever: 213
Cough: 163
Runny nose:78
What type of data?
Categorial (Nominal)
-can be put in buckets
Which type of data do percentages often fall into?
-Qualitative data
-Categorial, Nominal, dichotomous
What is Descriptive Data?
-Measures of Central Tendency (values around the mean)
-Measure of Variability: How scattered, dispersed are the data
What does “Measures of Central Tendency” mean?
The data has the tendency to convert on the most central value (Median and Mean)
How is the Measures of Variability expressed?
-Standard deviation SD
-standard error of the mean SEM
-confidence intervals
-range
-percentile
-interquartile range
What is the purpose of descriptive data?
-Describe, organize, or summarize actual data
-No statistical conclusions are drawn
What is a “Mean”?
-Arithmetic average of the data
-Affected by outliers (extreme values of
data distribution
-often used to describe normally distributed continuous data
Which value is less affected by outliers?
-The median
-bc we have more values in the normal range which are near to the median and outliers far away from the median -> thereby less affected by outliers -> giving a better picture of the average
What is the Median?
-Mid-most value (50th percentile)
-Half the data points are above and below
-Unaffected by outliers
-Often used to describe non-normally
distributed continuous data
-often used to describe ordinal data (Pain scale, Likert scale)
What is the median out of these values?
1, 2, 3, 4, 5 -> 3
1, 2, 3, 4, 5, 6, 7, 8 -> calculate the average of the middlemost values -> 4+5 = 9/2 = 4.5