CH17 Biostatistics Flashcards

1
Q

______ is the use of data analysis and interpretation in health care research.

A

Biostatistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

______ involves the application of statistical tests to the data in order to organize, describe, summarize, and analyze it to answer a research question or test a hypothesis.

It also explains results and requires that ________ be used to explain the meaning and application of the findings, identifies possible factors that could have influenced the results, and draws inferences to the population.

A

Data Analysis; critical thinking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Dental hygienists should know the research process in order to understand the epidemioloty of disease, practice therapies, implement programs and practice _________ dentistry.

A

evidence-baseed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

An insufficient number of subjects, too short of a duration, as well as the use of incorrect measurement instruments, incorrect procedure utilization & incorrect statistical tests are all causes of ________.

A

invalid research

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some example of Nominal scale data?

(Unordered categories)

A

Male/Female

Smoker/Non-smoker

(Qualitative categories)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are some examples of Ordinal scale data?

(Ordered categories)

A

Mutually exclusive categories:

1, 2, 3, 4, 5

IOTN

Minimal, Moderate, Severe, Unberable pain

(Each of the above have data that exclude all other data in the data set)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

_______ data are a scale of measurements that contain all of the characteristics of the preceding scales.

This data is quantitative and has an absolute zero point (0 means there is an absence).

Some examples are height, weight, duration, and number of teeth/sealants.

A

Ratio Scale Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data that is reperesented by numbers would be considered _________. This data can be expressed as counts, percentages, and means of something.

An example of this in DH is pockets depths, # of DMFT, time spent scaling.

A

quantitative data

Asks the question HOW MANY

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data focused on Information that reflects the quality or nature of variables that cannot be expressed numerically is called ________ data. It is expressed as outcomes, or states, and can be counted for reporting and its variable can be rank ordered.

An example of this in DH is tissue color, tenacity of calculus, and what patients liked most & least about visit.

A

Qualitative Data

Asks the question, HOW MUCH?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some example of a Continuous Variable?

A

Height in cm

pocket depth in mm

Age

Time

(Example of age: 25 years, 10 months, 2 days, 5 hours, 4 seconds, 4 milliseconds, 8 nanoseconds, 99 picosends…and so on.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some examples of data that are Discrete Variables?

A

Number of visits to the dentist

DMF

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

______ is a type of data that has no numeric representation therefore, it is qualitative in nature.

Ex: male/female, freshman/sophomore/jr/sr, eye color, race

A

Catergorical Variable Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

_______ data are categorical variable data that places subjects into ONLY two groups/catergories. it takes on one of only two possible values when observed or measured and are qualitative in nature.

Ex: M/F, yes/no, T/F

A

Dichotomous Variable Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the 3 Categorical Data Categories?

  • (Qualitative Data Categories)*
  • Double Check!*
A
  1. Nominal
  2. Ordinal
  3. Dichotomus
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Name the 4 Numerical Data Categories.

(Quantitative Data Categories)

A
  1. Discrete
  2. Continuous
  3. Interval
  4. Ratio
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

__________allows raw data to be organized and summarized in a meaningful way that allows for a pattern to emerge.

This type of data alway precedes ________.

A

Descriptive statistics; inferential statistics

(If raw data was just presented it would be hard to visualize what was being seen. By using descriptive statistics we can see data in a meaningful way.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

_______ are used to study something but do not have access to the entire population (or total). It is a ________.

Because of this limitation a sample of the population is taken and studied.

A

Inferential Statistics data; generalizations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What measure of central tendency is an average usedwith continuous data?

It is appropriately used for ratios and interval data.

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What measure of central tendency is a midpoint of data when placed in ascending or descending order?

If there are an even amount of numbers, the ____ of the two middle numbers must be taken.

It’s appropriate use is for ordinal data.

A

Median; mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Calculate the Mean of the following:

2, 3, 3, 5, 7, 10 = 30

A

30 ÷ 6 = 5

Mean = 5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Calculate the median of the following numbers:

3, 2, 5, 10, 3, 7

A

In order to calculate the median the numbers must be placed in ascending order.

2, 3, 3, 5, 7, 10

(the median point is when ½ the data is above and ½ the data is below)

NO MIDPOINT!?!?

3 + 5 = 8 ➗ 2

Median = 4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Calculate the median of the following numbers:

7, 3, 2, 3, 5, 4, 10

A

In order to calculate the median the numbers must be placed in ascending order.

2, 3, 3, 4, 5, 7, 10

Median = 4

(the midpoint)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What measure of central tendency is concerned with the value that occurs most often? It is used in all types of data.

It’s appropriate use is for nominal data.

A

mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Calculate the Mode of the following numbers:

2, 3, 3, 5, 7, 10

A

2, 3, 3, 5, 7, 10

Mode = 3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the goal of using the measure’s of central tendency?

A

To take a collection of data and identify the middle of the data collected.

A measure of central tendency is a single value that attempts to describe a set of data by identifying the central position within that set of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Name the 3 Measures of Central Tendency.

A

Mean, Median and Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What two data categories are numerical?

A

Discrete and Continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Define: Discrete Variable

A

Discrete Variable are counted a finite number of times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Descriptive Statistics are used to summarize data in a meaningful way. There are generally two MAIN types of statistics used to describe data. Name them.

A
  1. Measures of Central Tendency

  • (Mean, Median, Mode)*
    2. Measures of Dispersion
  • (Range, Variance, Standard Deviation)*
  • **Though not a statistic type Graphs, histograms, and charts are also used to describe and summarize***
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

_______ communicates how much variation is present in a group of data.

In statistics, this is a way of describing how spread out a set of data is.

(Range, Variance, Standard Deviation)

A

Measures of Dispersion

(aka Measure or variability)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Measures of dispersion communicate how much variation is present in a group of data. Name the three data sets that are used to describe the dispersion of a group of data.

A
  1. Range
  2. Variance
  3. Standard deviation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What measure of dispersion is determined by subtracting the lowest score from the highest score?

It is the simplest and least helpful measurement and is usually reported with the median.

A

Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What represents the average distance of each score from the mean, is associated with standard deviation, and is the most common and useful measure of dispersion.

It is usually reported with the mean to calculate data intervals.

Its value or the SD in relation to the mean depicts the distribution of scores.

A

Variance

(it measures how far each number in the set is from t_he mean_ and therefore from every other number in the set.)

Square root of the variance = standard deviation

Define: Variance

34
Q

Wheen is Standard Deviation used and how is it determined?

A

Standard deviation is used when determining how spread out the numbers are around the mean.

(Used with Qualitative Continuous Data)

Square root of the variance = standard deviation

35
Q

Define: Standard Error of the Mean

I need your help with this one Jesse!

A

Standard Error of the Mean is used to determine how accurate your estimation (or generalization) of the sample is to the entire population.

36
Q

A _________ is an asymmetrical curve distorted by a few extreme scores.

A

Skewed Distribution

37
Q

A ________ shows how often something happened in a specific catergory.

These tables may be _______ or _______.

Example: how many times does the number 9 occur?

1, 2, 3, 4, 6, 9, 9, 8, 5, 1, 1, 9, 9, 0, 6, 9.

A

Frequency Distribution Table

38
Q

Is this frequency distributuion table grouped or ungrouped?

A

Grouped

39
Q

Characteristics of Effective Tables

A
  1. Accuracy
  2. Simplicity
  3. Clarity
  4. Appearance
  5. Well-Designed Structure
40
Q

________ is a relationship or association between variables that can be measured mathematically.

A

Correlation

41
Q

A _______ is a relationship between two variables in which both variables move in the same direction.

A

Positive Correlation

42
Q

A ________is an inverse correlation is a relationship between two variables that move in opposite directions

A

Negative Correlation

43
Q

“_” signifies the correlation coefficient. Its value communicates the ______ and strength of the association.

A

“r”; direction

44
Q

Hypothesis testing

______ is a formal decision-making process of testing a hypothesis using statistical significance and inference, followed by interpreting the statistical results

A

Hypothesis testing

45
Q

A ________ is an initial negative statement of belief about the value of a population parameter. It rejects the research or alternative hypothesis.

A

Null hypothesis

46
Q

Probability

A

expressed as “p” value

(AKA alpha 𝛼 level)

47
Q

A ______ is also called an alpha a error. It occurs when the null hypothesis is rejected, but is actually true so it should have been accepted.

The probability of computing this error is the same as at the alpha level.

Researchers can control a type I error by setting the alpha level low.

This type of error can be very costly.

A

Type I Error

48
Q

A _____error is also called a beta b error. It occurs when the null hypothesis is accepted, but it is actually false, so it should have been rejected. The exact probability of computing this type of error is generally unknown.

They are caused by using too small a sample, unreliable measuring devices, or imprecise research methods.

A

type II

49
Q

Chi-square test

A
50
Q

External validity

A
51
Q

Less than ____ subjects would a research project invalid.

A

30

52
Q

A _______ are made up of distinct and separate units or categories is is expressed by a large or infinite number of measures along a continuum and can be expressed in fractions or decimals.

This type of data are considered quantitative and can be converted into nominal or ordinal scales.

A

continuous variable

53
Q

_________ are data made up of distinct and separate units or categories, but is counted only in whole numbers. This data is quantitative in nature because it is represented numerically. It can be converted to nominal or ordinal scale.

A

Discrete Variable Data

54
Q

What type catergorical variable data organizes its data into mutually exclusive categories that have no rank order, value or numeric relationship between the different classifications?

Ex: L/R handed, M/F, hair color

A

Nominal Scale

55
Q

What type of catergorical data organizes data into mutually exclusive catergories that are rank ordered based on criterion.

In this type of data, the difference in rank is not equal ibn value.

Ex: Poor/fair/good/excellent, shades of whiteness of teeth, calc class A-B-C-D

A

Ordinal Scale

56
Q

What type of data has the characteristics of the ordinal scale and an equal distance between any two adjacent units of measurement.

This type of data is quantitative in nature and does not have a meaningful zero point.

Ex: temperature (0 degrees is colder than 90 degrees

A

Interval Scale Data

57
Q

Data summary such as bar graphs, histograms, pie charts; measures of central Ttndency such as mean, median, mode; and measures of variability such as range, variance and standard deviation are all considered _______ Statistics.

A

Descriptive

58
Q

A mode value can be either _____ (consisting of 2 modes) or _____ (consisting of more than 2 modes).

A

bimodal; multimodal

59
Q

The _________, also referred to as ________ forms the theoretical foundation for comparisons and making statistical decisions.

It is a symmetrical, unimodal, bell-shaped curve that explains why random variables tend to be normally distributed.

The mean, median, and mode are equal in value.

A

Normal Distribution; Gaussian Distribution

60
Q

The ______ provides an estimation of the spread of data given the mean and the standard deviation of a data set that follows the standard normal distribution.

A

Empirical Rule

61
Q

The Empirical rule says that ___% of data fall within one SD of the mean, ___% within two SD of the mean, and ___% within three SD of the mean.

A

–68%

–95%

–99.7%

62
Q

________is the foundation of the ________.

A

Normal distribution; central limit theorem

63
Q

What factor is most effected with skewed distribution?

A

the mean

64
Q

Skewed distributions can be _____ or ______.

A

positive or negative

65
Q

Is this frequency distributuion table grouped or ungrouped?

A

Ungrouped

66
Q

An example of an ________ frequency distribution table would include all the scores in the distribution, good for less than 30 observations.

A

Ungrouped

67
Q

An example of a ____ freqency distribution table groups a set number of scores into mutually exclusive intervals, usually 5-10 intervals (easier to understand) (Those who got A’s, Those who got B’s…)

A

grouped

68
Q

A ____ is used to represent categorical data. Its length corresponds with the frequency of the value.

A

Bar graph

69
Q

A _______ is similar to a bar graph but the bars appear side by side and are touching. They are used to represent interval or ratio variables, grouped & ungrouped frequencies and ordinal datathat is treated as continuous data.

A

histogram

70
Q

A _________is a line graph that represents frequency data that are continuous in nature. It is drawn by connecting midpoints of the bars of a histogram, then extending the line at both ends to imaginary midpoints at the right and left of the histogram

This graph represent grouped or ungrouped frequencies and can also represent frequency, percent, cumulative frequency, or cumulative percent.

A

frequency polygon

71
Q

A _______is a line graph used to plot a variable over time.

A

Polygon

72
Q

A ______ shows the relationship between two variables and how the level of one variable varies as the level of the other variable changes.

A

Scattergram

73
Q

As it relates to correlation, the “r” value indicates the ______ of relationship.

If a value moves closer to +1 or -1, there is a stronger relationship. When it is closer to 0 there is a weaker relationship

+1 or -1 indicate PERFECT relationship, while 0 indicates ZERO relationship

A

strength

74
Q

A _________ can be used to quantify the relationship of two variables, and expresses the functional relationship between the variables.

It is used to predict the score of one variable based on the score of another

Example: National board scores based on students’ GPA

A

regression analysis

75
Q

A _________ provides a mathematical model that gives the strength or ability of two or more variables to predict another variable.

Examples: SAT scores, GPA strength

A

Multiple Regression Analysis

76
Q

A _____ is called the alternative or positive hypothesis. It is the logical opposite of the null hypothesis and can indicate a direction of difference.

Example: One brand of sealants does differ from another brand of sealants.

A

Research Hypothesis

77
Q

The ______is a probability value, also called alpha value or significance value. It represents the probability that the findings from the study are due to chance. It is commonly accepted in oral health research as equal to or smaller than 0.05 (p≤.05) so we reject the null hypothesis because we are confident that statistical decision is correct

If this value is ______ than 0.05, the results are said to be not statistically significant so we do not reject null hypothesis.

A

p-value; larger

78
Q

_________ are used for hypothesis testing when the data meet certain assumptions.

It must be classified as continuous (includes ratio, interval, and ordinal data)

A

Parametric Inferential Statistics

79
Q

What are the types of parametric statistics?

A

–Student t-test

–Analysis of variance (ANOVA)

80
Q

The ______ determines is a statistically significant difference exists between two mean scores

A

T-test

81
Q

_____ determines if statistically significant differences occur when comparing more than two mean scores and tells researchers that there is a difference among groups.

It does not, however, specify which group is different.

A

ANOVA

(Analysis of variance)