Biostatistics and its Application to Health Science Research Flashcards

1
Q

Statistics is the field concern with four things, and what information is drawn from a subgroup ?

A

Statistics definition

•Field of study concerned with:

1) The collection, organization, summarization, and analysis of data.
2) The drawing of inferences about a body of data when only a part of the data is observed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a population Vs sample? What is a unit of analysis ? What are variables ?

A

Basic Concepts

Population: set of individuals on which a given characteristic is studied

–Sample: a subset of a population that is used to represent the entire group.

Unit of analysis: is the “who” or the “what” that you are analyzing for your study. –Variables: properties, attributes or characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are parameters ? Statistics ? What is an estimator ?

A

Basic Concepts •Parameters: numbers that summarize data for an entire population. •Statistics: numbers that summarize data from a sample –If a statistic is used to approximate a parameter it is also often called estimator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What’s the difference between population and sample ?(diagram)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are dependent vs independent variable.

A

Type of variables

•Dependent (outcome)

–is the variable being tested in a scientific experiment

–those that depend on the value that is assigned to other phenomena or variables

•Independent (exposures, predictors)

–a factor or phenomenon that causes or influences another associated factor or phenomenon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Nominal ? Ordinal ?.

A

Type of variables

•Categorical Variables (qualitatives)

Nominal: Two or more mutually exclusive categories without having any kind of natural order. They are variables with no numeric value.

nominal sounds like name

..Names.basically refers to categorically discrete data such as name of your school, type of car you drive or name of a book. This one is easy to remember because nominal sounds like name (they have the same Latin root).

–Ordinal: The order matters but not the difference between values.

Ordinal refers to quantities that have a natural ordering. like people in the line

ordinal sounds like order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the difference between discrete vs Continuous ?

A

Type of variables

•Numeric Variables (quantitative)

–Discrete: can only takes on a finite number of values

•Ex: # of beds at the hospital, # persons in a room

–Continuous: can take on any value in a certain range.

•Ex: Age, Weight, Height

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are four type of measurable scales ?

Why are they assigned this way ? (NO Irvin) measure this

A

Measurement Scales

•There are 4 types of measurement scales

–Nominal

–Ordinal

–Interval

–Ratio

•The 4 types of scale are arranged so that all the above scales have properties from the previous scale plus an additional property

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

.What is is nominal used for ? what is an example with smokers and non smokers.
Do binary data count ?

A

Nominal

  • Use names to establish categories
  • You can use numbers, but these are of a symbolic nature

Example:

Dichotomous / binary data

•Gender –> Male = 1 / Female = 0

Categorical data

•Civil Status–> Single = 0 /Married = 1 / Widow = 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is an example of a psychiatric evaluation of ordinal scale ?

What is ordinal ? does it number matter ?

A

Is the following example of nominal scale?

yes, it is just setting catagerous,

Symptoms of depression in psychiatric evaluation

  • None = 0
  • Mild = 1
  • Moderate = 2
  • Severe = 3

Ordinal

•When the order of the categories is important

–Numbers are used to place the categories in an order

•You can not set distance between two points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is an interval ? What does zero mean ?

A

Intervals

  • Meets the above characteristics
  • Numerically records the distance between two points
  • The zero does not indicate the absence of the characteristic and it is arbitrary

Example: temperature (0° C does not express an absence of temperature instead it is a state of the variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is ratio ? What does zero mean ? Where is this method used in medicine ?

A

Ratio

  • It corresponds to the more complete level of measurement.
  • Zero indicates the absence of the characteristic and is absolute
  • The difference between two values is of known magnitude
  • Most of the measures used in medicine (anthropometrics and quantitative from the labs) use this type of scale.

Example: Weight – 0 pounds state an absence of weight Distance – 0 Km state an absence of distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

.What is descriptive statistics ? What kind of conclusion do you reach ?

A

Descriptive Statistics

  • Numeric or graphical summary of data
  • You reach statistical inferences using descriptive statistics

Like a scatterd plot

–Process to generalize the collected data by the researcher (sample) to all possible observations of interest (population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

.What is Frequency ? frequency distribution?

A

•Frequency: the number of times that something happens during a particular period

Frequency Distribution

•An arrangement of statistical data that exhibits the frequency of the occurrence of the values of a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

.What are the three frequency in frequency distribution?

A

Frequency Distribution

  • Absolute Frequency
  • Relative Frequency
  • Cumulative Frequency

–Absolute

–Relative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is an absolute freqency ?

A

The absolute frequency is simply the total number of observations or trials within a given range. For example, assume there is a collection of grouped data for the percentage returns for a particular stock, which is ranged from lowest to highest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is relative frequency ?

A

how many times on thing happen/ total times all other thigns happen.

5 red apples were picked / 20 total apples were picked

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is relative frequency ?

Cumulative Relative Frequency

A

Relative Frequency. How often something happens divided by all outcomes. Example: Your team has won 9 games from a total of 12 games played: theFrequency of winning is 9. the Relative Frequency of winning is 9/12 = 75%

The Cumulative Relative Frequency is the sum of the relative frequencies for all values that are less than or equal to the given value.Aug 24, 2010

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

.

A

Measures of Central Tendency

  • Numbers representing a central value in which the data appear to be clustered around
  • Mean
  • Median
  • Mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is a mean? what is the formula ?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is a medium ?

A

Median

  • It is less affected by extreme values than the average
  • It is the number that is exactly in the middle of a list of values that has been ordered
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

.What is a mode ? unimodel ? Bimodel ?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is an example of a dot plot ?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is skewness and kurtosis ?

A

Skewness and Kurtosis

  • The measures of central tendencies obtain a representative value of the data.
  • From the measures of variability (dispersion), we can know that whether most of the items of the data are close to or away from these central tendencies.
  • Statistical means and measures are not enough to draw sufficient inferences about the data.

–This symmetry is well knowledge of the skewness

–Still one more aspect that we need to know is its flatness or otherwise its top. This is understood what is known as Kurtosis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Review mean , median, mode grapth.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is leptokurtic, mesokurtic, and platykurtic ?

A

lep·to·kur·tic

ˌleptəˈkərdik/

adjective

STATISTICS

(of a frequency distribution or its graphical representation) having greater kurtosis than the normal distribution; more concentrated about the mean.

mes·o·kur·tic

ˌmezəˈkərtik,ˌmē-/

adjective

STATISTICS

(of a frequency distribution or its graphical representation) having the same kurtosis as the normal distribution.

Platykurtic is a type of statistical distribution where the points along the X-axis are highly dispersed, resulting in a lower peak (lower kurtosis) than the curvature found in a normal distribution.

Lep-high

meso-middle-normal

platy-platued,down.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

.What is a histogram, and what is another name for it ?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is the six degree of variations of observations.

A

vartiations is a type of srick

Measures of Dispersion

•Represents the degree of variation of the observations

–Amplitude or Range

–Interquartile range

–Standard Deviation and Variance

–Coefficient of variation

30
Q

What is amplitude and range? Is it useful , when is it not ?

A

Amplitude or Range

•The difference between the highest and lowest values

13, 15,17, 18, 19, 21, 21, 25, 28, 28

𝐴𝑚𝑝𝑙𝑖𝑡𝑢𝑑𝑒=28−13=15

  • Not very useful since it is limited to the extreme values
  • Sensitive to very large and very small values
31
Q

What is interquartile range (IQR) ? What percent does it indicates the disperson in the central ?

is it easily influenced by extreme values ?

What is the formula ?

A

Interquartile Range (IQR)

  • It is not easily influenced by extreme values
  • It indicates the dispersion in the central 50% of the distribution
  • IQR is the difference of the third (Q3) and the first quartile (Q1) of the data

IQR = 𝑄3−𝑄1

32
Q

.What is interquartile range ? (IQR)

A
33
Q

What is the formula for interquartile range ? what do each mean ?.

A
34
Q

Turn it around and the the problem of interquartile range ?.

A
35
Q

Examples of interquartile range . Turn in around

A
36
Q
A
37
Q

What is variance ? The formula ? What is the same too ?

A
38
Q

Turn around the variance example ?

A
39
Q

Turn around example of variance ?.

A
40
Q

What is standard deviation, and what is the formula ?.

A
41
Q

What is standard deviation for ?

A
42
Q

What is coefficient of variations used for ? What is the ratio of ?

A
43
Q

When does the coeeficiant variations less useful ? When CV is greater than 26% what is it considered ?

why does this make sense ?

A

Is it used to compare 2 data sts, when the unit is differetnt

standards deviaiton/ mean

Coefficient of variation (CV)

  • CV is less useful when the average approaches zero.
  • CV greater than 26% is considered very heterogeneous

–0-10% = very homogeneous

–11-15% = homogeneous

–16- 25% = heterogeneous

–>25% = very heterogeneous

44
Q

Which is the most and less homogenious ?.

A
45
Q

what is a hypothesis ? What is a null hypothesis ? Alternative hypothesis ?

A
46
Q

.

A

What is HO, and HA in two sided , and the one-sided ?

47
Q

.Where is rejection region or critical value approach ?

A
48
Q

What is a P value ? What happens if your P value is less or equel to significance value ?

A
49
Q

What happens when you have signicance level of alpha of 5% ? Would you reject the Ho ? What is signfiance level also mean ?

A
50
Q

.Turn over and explain true and false negative or positive

A
51
Q

.Label what is statistical power, reject null, and fail to reject null

A
52
Q
A
53
Q

.What is statistical hypothesis ? Empirical statement , and how can it be proven right or wrong ?

A

A statistical hypothesis is an assumption on the parameter of a population.

–Prediction of what study will find.

  • Empirical statement – verified based upon observation or experience.
  • Testable to be true or false through the research study findings.
54
Q

Chi-sqaured test and fishers’s exact , which one is parametric, and one which is not ?

do all people like chinese food ? how about fish ?

practice the example of Ho and Ha between males who will be obese compared to females

A
55
Q

Independent vs dependent variables. How would you draw this ?

A
56
Q

.what is used for unpaired T test ? Is it parametric or non-parametric ?

What is indepdence ? Normal distribution ?

Homogeneity of variance ?

Is Mann-Whitney test parametric ?

A
57
Q

Examples of how to use Ho and Ha

A
58
Q

.examples of indep and dependent variable ?

A
59
Q

What do you use for unmatched data ? is it parametic ? and what other usesis there ?

Is keuskal-Wallis Test parametric ?

A
60
Q

.Examples of indep- and depedent

A
61
Q

Examples of independent and dependent variable .

A
62
Q

What is Pearson correlation ? It must be normally distrubuted, therefore it cannot have any…

The values are between ? What do the ranges mean ?

Is it possible to predict? and what are accepted scale of measurements ??

A
63
Q

What happens to pearson correlation when p=0 or p=/ 0

What happens if a null hypothesis is rejected ?

A
64
Q

Is Spearman correlation sensitive to outliers ? Are avariables normally distributed ?

Is it non linear ?

What are the ranges ? from # to # ?

What happens if Rs is =1 , -1, or 0

What are accepted variable scale ?

A
65
Q

What is simple linear regression ? what is this simialr to ?

A

Outcome/Predictor = continuous

•Simple linear regression

–Prediction - we ‘re predicting what will be the value Y when the value of X is given.

–Estimation - estimate the mean of Y values that it is assumed exists for a given value X.

𝑦=𝛽0+𝛽1𝑋+𝜖

66
Q

In simple linear regression, what does B, Y nd X tell you ? What does a negative sign mean? What happens of B1=0?

A

Simple Linear Regression

  • The coefficient b ( β ) tells us how much the dependent variable ( Y) increases by increasing the predictor variable (X ) in one unit.
  • The sign is negative if the correlation is negative (and must speak of decline and not increase) .
  • If β1 = 0 , it is said that there is no linear relationship between the two variables and that these are independent.
67
Q

What is logistic regression used for ? It is a measure between ? It represents the odd that ? It can be used to determine ?

it can be used to compare the magnitude of ?.

A
68
Q

.

A
69
Q

.understanding simple logistic regression

A

The odds of being obese among males is 2.6 times higher (95%CI: 1.18-5.79) the odds of being obese among females.

How the OR should be interpreted if it was -1.50?

What would happen if the 95%CI includes the value 1?

70
Q

what is the diffrences in even and odd N values during interquartile range ?

A

If even, take the number and the next one, while off just take on number to determine the Q3-Q1