Inferential Statistics Flashcards
Inferential Statistics
Criterion of “TRUTH”
Validity
The percentage of people with the disease who are detected by the test
% SENSITIVITY
TP ÷ [TP + FN] x 100
% SENSITIVITY
% SENSITIVITY, higher the sensitivity the better?
ye
what does % SENSITIVITY measures?
TRUE POSITIVE
yung mga tunay na may sakit if ever
is the percentage of people with the disease who are not detected by the test, complement of sensitivity
% FALSE NEGATIVE
FN ÷ [TP + FN] x 100
% FALSE NEGATIVE
Counterpart of %sensitivity
% FALSE NEGATIVE
T or F
Higher the sensitivity, the lower the false negative
T
inversely proportional sila with %sensitivity
is the percentage of people without the disease who are correctly labelled by the test as not diseased.
% SPECIFICITY
TN ÷ [FP + TN] x 100
% SPECIFICITY
T or F
Higher the specificity the better – mababa false positive
T
is the percentage of people without the disease who are incorrectly labelled by the test as having disease, complement of specificity.
% FALSE POSITIVE
inversely proportional with %specificity
FP ÷ [FP + TN] x 100
% FALSE POSITIVE
T or F
yes to false positive and false negative
F
NO DAPAT
is defined as the likelihood that an individual with a positive test has the disease.
PREDICTIVE VALUE OF A POSITIVE TEST
TP ÷ [TP + FP] x 100
PREDICTIVE VALUE OF A POSITIVE TEST
Lahat ng positive result to get who are TRULY POSITIVE
is defined as the likelihood that a person with a negative test does not have the disease.
PREDICTIVE VALUE OF A NEGATIVE TEST
TN ÷ [FN + TN] x 1004
PREDICTIVE VALUE OF A NEGATIVE TEST
All of the negative result to get who are FALSE NEGATIVE talaga
The ratio of the chance of the test being positive if having the condition compared to the chance of testing positive if not having the condition
Positive Likelihood Ratio +LR
The ratio of the chance of the test being negative if having the condition compared to the chance of testing negative in not having the condition.
Negative Likelihood ratio -LR
if u see this card
practice the example of maam given for the Indices to Evaluate Accuracy of a Test or Diagnostic Examination
go na
Also termed as “reproducibility” or “repeatability”
Reliability
Na ulit yung test then same result = reliability – CONSISTENT
Validity = nearest to true value
Refers to the stability or consistency of information
Reliability
The extent to which similar information is supplied when measurements are performed more than once.
Reliability
T or F
A key goal in applied biostatistics is to make inferences about unknown population parameters based on sample statistics.
TRUE
what is the difference for parameter and statistics when it comes to mean, SD, and Proportion
Paramerter = Population
Statistic = Sample
this means that kung anong TESTING used for sample, and popluation yun lang din gagamiting sa parameter
There are two broad areas of statistical inference,
- Estimation
- Hypothesis Testing
The process of determining a likely value for a population parameter (e.g., the true population mean or population proportion) based on a random sample.
Estimation – APPROXIMATION
Estimation - T or F
In practice, we select a sample from the target population and use sample statistics (e.g., the sample mean or sample proportion) as estimates of the unknown parameter
T
Estimation - T or F
The sample should be representative of the population, with participants selected at random from the population.
T
alam niyo nayan very ez
Estimation - T or F
In generating estimates, it is also important to quantify the precision of estimates from different samples.
T
Estimation
Point Estimate =
Single number
e.g.: 1, 2, and 69
Estimation
Interval Estimate (Confidence Interval Estimate) =
may decimals (2 values lower and upper limit with confidence intervals)
a range of values, derived from sample statistics, that is likely to contain the value of an unknown population parameter.
Confidence Interval
Estimation - confidence interval
Because of their _ _ _ _ _ _ _ _ _ _ _ _ , it is unlikely that two samples from a particular population will yield identical confidence intervals.
Random Nature
Estimation - confidence interval: T OR F
But if you repeated your sample many times, a certain percentage of the resulting confidence intervals would contain the unknown population parameter.
T
diko parin gets to
If you see this card
go over the inferential statistics, check the estimation interval pls
There are a number of population parameters of potential interest when one is estimating health outcomes (or “endpoints”).
Parameter Estimation
Parameter Estimation
Many of the outcomes we are interested in estimating are either
continuous or dichotomous variables
, although there are other types.
Parameter Estimation
The parameters to be estimated depend not only on whether the endpoint is continuous or dichotomous, but also on the ?
number of groups being studied.
Parameter Estimation
When 2 groups are being compared what you need to establish between the groups?
- Independent (e.g., men versus women)
- Dependent (i.e., matched or paired, such as a before and after comparison).
Parameters to estimate in health-related studies
One sample - Continuos varible
Mean
Parameters to estimate in health-related studies
One sample - dichotomous variable
Proportion or Rate
yung mga prevelance,incidence rate …
Parameters to estimate in health-related studies
2 Independent Samples - Cont. Variable
Difference in MEAN
Parameters to estimate in health-related studies
2 Independents Samples - Dichoto. Variable
Difference in proportion or rates
pag 2 independent samples, lagi difference okay? okay
Parameters to estimate in health-related studies
2 Dependent, Matched Samples - Cont. Variable
Mean Difference
iba ang difference in means sa mean difference okay? okay
Confidence Intervals
Two types of estimated for each population parameter
- Point estimate
- Confidence interval (CI) estimate.
What is the difference between Cont and Dichotomous Variable?
Cont is all about MEAN, while Dicho is proportions or rate
okay? OKAY
Confidence Intervals
one first computes the point estimate from a sample?
Ye
para makuha mo Confidence intervals
Confidence Interval - T or F
Sample means and sample proportions are unbiased estimates of the corresponding population parameters.
True
If you see this card
go over the PRINCIPLES of confidence interval
need siya understood, not memorized
The confidence interval estimate (CI) is a range of likely values for the population parameter based on
the point estimate, e.g., the sample mean
Confidence Intervals Estimate - T or F
In practice, we select one random sample and generate one confidence interval, which may or may not contain the true mean. The observed interval may over- or underestimate μ.
True
Confidence Intervals Estimate - T or F
The confidence interval does not reflect the variability in the unknown parameter.
T
Confidence Intervals Estimate - T or F
what does confidence interval estimate REFLECTS
amount of random error in the sample and provides a range of values
likely to include the unknown parameter. sa range of values
if u see this card
araling formula sa confidence interval
Confidence Interval
For n >= 30, T or Z table?
Z table
Confidence Interval
For n < 30
Use the t-table with df-n-1
Point Estimate Z SE
where is the Z values got from?
the standard normal distribution for the selected confidence level
(e.g., for a 95% confidence level, Z=1.96).
In practice, we often do not know the value of the population standard deviation (σ).
However, if the sample size is large (n > 30), then the sample standard deviations can be used to estimate the population standard deviation.
Point Estimate (+-) Z SE
With smaller samples (n< 30) the Central Limit Theorem does not apply, and another distribution called
T-distribution
Confidence Intervals Estimate for Smaller Samples
Similar to the standard normal distribution but takes a slightly different shape depending on the sample size.
T-distribution
T-Distribution - T or F
In a sense, one could think of the t distribution as a family of distributions for larger samples.
F
smaller, n <30 - LESSSSSSSSSSSS THAN
T - distribution - T or F
It produces smaller margins of error
F
It produces LARGER, because small samples are less precise
t values are listed by?
degrees of freedom (df)
T-Distribution - T or F
Just as with large samples, the t distribution assumes that the outcome of interest is approximately normally distributed.
T
If u see this card
go over the example for Confidence intervals pls pls
PLEASE
PUHLEASE
The sample proportion
p̂ (p hat)
confidence interval can be computed by this
p hat formula
so go over it
if u see this card
go over the example of Confidence Intervals Estimate for Population Proportion
a contention or assumption made concerning a population characteristics.
STATISTICAL HYPOTHESIS
. It is usually concerned with the parameters of the population about which the statement is made.
STATISTICAL HYPOTHESIS
NOT YET TRUE – ipprove palang
The purpose of the research is to provide evidence to support or refute the null hypothesis
Hypothesis Testing
Hypothesis testing comprises a set of what?
set of procedures
Hypothesis Testing - T or F
A hypothesis is either rejected or not based on the probability of occurrence of the sample results if the null hypothesis were true.
T
how is Statistical Hypothesis validated?
if calculated probability of results exceeds a prespecified value of alpha
Hypothesis
If calculated probability is less than or equal to alpha?
hypothesis is rejected, therefore, result is statistically significant.
< (less than) = (equal) baka di mo alam eh
This is the hypothesis of “no difference”. Statement of equality
Null Hypothesis (Ho)
This is the hypothesis of “no relationship”.
Null Hypothesis (Ho)
Asserts that population parameter is some value other than one hypothesized.
Alternative Hypothesis (H1 or HA)
Usually the research hypothesis, the hypothesis the investigator believes in.
Alternative Hypothesis (H1 or HA)
Null hypothesis should always be framed in hopes of being able to reject it so that the alternative hypothesis could be accepted.
oo
Include values of statistics leading to rejection of null hypothesis. Usually called alpha or tail of the curve.
Critical Region or Region of Rejection
These values are those whose probability of occurrence is less than (<) or equal to the level of sig/nificance, α.
Critical Region or Region of Rejection
The probability level that is considered too low to warrant support of the hypothesis being tested.
Level of Significance or ALPHA Level
Basis for inferring the operation of non-chance factors (0.05, 0.01, 0.1)
Level of Significance or ALPHA Level
omegaverse????
if u see this card
MASTER the decision table
yung may Ho true, Ho false
Region of Acceptance
alpha of the curve, greater than or equal to level of significance
1
Region of Acceptance
When Ho is rejected?
statistically significant and the observed difference may not be attributed to sampling variation
Region of Acceptance
If Ho is not rejected
ot statistically significant and may be due to sampling variation
what is Ho?
null hypo
When HA asserts that population parameter is different from one hypothesized (2-tailed test)
NON-DIRECTIONAL Ha
Asserts the direction of the difference ( 1-tailed test)
DIRECTIONAL Ha
Steps in Hypothesis Testing
1st step
Determine whether a 2-tailed or a 1-tailed test be made.
Steps in Hypothesis Testing
2 step, what do you need to assume?
Ho and Ha
Null and Alternative
Steps in Hypothesis Testing
3rd step, after the hypothesis
Choose alpha, the arbitrary level of significance
here is the basis of rejection AFTER the computation
Steps in Hypothesis Testing
4th and 5th step
- Determine critical region
- Determine appropriate test
Steps in Hypothesis Testing
6th and 7th the last
- Solution
- Conclusion
Conparison of Parameters or Indicators
Single Population
what interval/ration testi used
Z or T Test
Conparison of Parameters or Indicators
Single Population
what ordinal test is used
- Kolmogorov
- SMirnov one sample test
Conparison of Parameters or Indicators
Single Population
what nominal test is used
- Z test
- Chi Square Test
Conparison of Parameters or Indicators
2 Population: Related Samples
what interval/ratio test is used
Paired t test
Conparison of Parameters or Indicators
2 Population: Related Samples
what ordinal est is used
- Wilcoxon
- Matched pairs
- SIgned ranks test
Conparison of Parameters or Indicators
2 Population: Related Samples
what nominal est is used
McNemar’s Test
Conparison of Parameters or Indicators
2 Population: independent Samples
Interval/Ratio
Independent T-test
Conparison of Parameters or Indicators
2 Population: independent Samples
Ordinal
Mann whitney U test
Conparison of Parameters or Indicators
2 Population: independent Samples
Nominal
- fishers exact
- probability test
- Chi square test
Conparison of Parameters or Indicators
3 or More population: Related samples
interval/ratio
F-test: 2 way analysis of Variance
Conparison of Parameters or Indicators
3 or More population: Related samples
Ordinal
Friedman’s Analysis of Variance
Conparison of Parameters or Indicators
3 or More population: Related samples
Nominal
Cochran’s Q test
Conparison of Parameters or Indicators
3 or More population: Independent
Nominal
Chi square test
Conparison of Parameters or Indicators
3 or More population: Independent
Ordinal
Kruskali wallis one way ANOVA
Conparison of Parameters or Indicators
3 or More population: Independent
Interval/Ratio
F-Test: one way ANOVA
Study of Relationship Between Variables
Interval/Ratio
- Regression
- Correlation
Study of Relationship Between Variables
Ordinal
- Spearman Rank
- Correlation
- Coefficient
Study of Relationship Between Variables
Nominal
- Kappa Test
- Contingenct
- Coefficient Test
What statistical test
if u see this card
go over the relationship for the Independent and dependt and what statistical test will be used
what statistical test is used when
- Independent - Qualitative
- Dependent - Qualitative
Chi square test
what statistical test is used when
- Independent - Qualitative
- Dependent - Quantitative
T,Z, ANOVA
what statistical test is used when
- Independent - Quantitative
- Dependent - Quantitative
Linear Regression
what statistical test is used when
- Independent - Quantitative
- Dependent - Qualitative
Logistic Regression
If u see this card
Please go over the CASE, example yan ha
also read the nte