Stats definitions Flashcards

1
Q

Define ‘Statistical Inference’

A

To make inferences about a population from data contained within a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define ‘Medical Statistics’

A

Assesses the size and strength of the influence of one or more exposure variables (risk factors or treatments) on the outcome variable of interest (such as occurrence of disease or survival)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define ‘Evidence-Based medicine’

A

Appraises the evidence based on the average effect of a treatment assessed on a large number of people and judging it’s relevance to the management of a particular patient.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name the 5 steps in the PPDAC cycle in order

A
  1. Problem
  2. Plan
  3. Data
  4. Analysis
  5. Conclusion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the difference between the ‘Treatment group’ and a ‘Control group’

A

The control group does not include the thing being tested on while the treatment group does. i.e 1 group has a heart valve implanted while the other does not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define a ‘Variable’

A

Characteristic or attribute that can take on different values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define a ‘random variable’

A

A variable whose values occur due to some random process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define ‘Data’

A

Observed values that the variable takes on

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define ‘Datasets’

A

Collections of data on several variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define a ‘Population’

A

The complete group of subjects that are being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define a ‘Sample’

A

Group of subjects chosen from the population i.e a subset of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Name the 2 subsets of Variables

A
  1. Numerical
  2. Categorical
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Name the 2 types of numerical variable explaining each with an example

A
  1. Continuous - A variable that can take on any value i.e temperature time distance
  2. Discrete - A variable that is counted in steps numbers i.e counting sheep before you sleep or the amount of money collected
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Name the 2 types of Categorical variable explaining each with an example

A
  1. Ordinal - Variables with a pre-existing order however can’t be compared mathematically i.e Education as masters>bachelors>nat 5
  2. Nominal - Variables that have no set order i.e ethnicity can’t say one is superior so it’s undefined
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define a ‘population parameter’

A

a quantity or statistical measure that, for a given population, is fixed and that is used as the value of a variable in some general distribution or frequency function to make it descriptive of that population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define a ‘Sample Statistic’ (also known as an estimator)

A

A value that can vary but is known

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Define an ‘explanatory’ and ‘response’ variable

A
  1. Explanatory variable - a fixed value
  2. Response variable - a random variable that might be affected by the explanatory variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Define a ‘Confounding Variable’

A

A variable that is correlated with both explanatory and response variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Define ‘Simple Random Sampling (SRS)’

A

Srs is where every unit within the population has, in theory, an equal chance to be included in the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Define ‘Stratified sampling’

A

Divide a group into smaller groups ‘strata’ based on some group characteristic. Then another sampling method is employed within each stratum.

21
Q

Define ‘Systematic Sampling’

A

Take every kth unit when sampling.

22
Q

Define ‘Cluster Sampling’

A

Population is split into many groups that are representative of a population called clusters and a fixed number of clusters are sampled.

23
Q

Define an ‘Observational Study’

A

A study where researchers simply observe a variable of interest with no intervention .

24
Q

Explain The difference between a ‘Cross-Sectional study’ and a ‘longitudinal study’

A

A cross sectional study studies a group of individuals at a SPECIFIC point in time.
A longitudinal study studies a group of subjects over a period of time and measurements are recorded at set time points.

25
Q

Define a ‘Cohort Study’

A

Also known as a prospective study, this is when a cohort is divided into groups by the factor of interest and other factors, after some time they are inspected to see what changed in these groups.

26
Q

Define a ‘case control study’

A

Also known as a retrospective study, is when a group that had a disease are compared to a group that did not to ascertain what caused it.

27
Q

Name the 3 types of ‘Categorical graphs’ and explain how many groups can be studied for each

A
  1. Bar chart - single sample
  2. Grouped bar chart - two or more groups
  3. Pie chart - single sample
28
Q

Name the 3 types of ‘Numerical graphs’ and explain how many groups can be studied for each

A
  1. Histogram - single sample
  2. Box-plot/Dot pot - single or groups
  3. Scatter plot - relationship between variables
29
Q

Define a ‘Partition’

A

A partition is a set of Disjoint events/outcomes and the probability of these events sums to 1

30
Q

Define an ‘Experiment’

A

Any process that requires some action to be performed and has an outcome that can be recorded

31
Q

Define an ‘Outcome’

A

Any single result of an experiment

32
Q

Define a ‘Sample space’

A

A set(collection) of possible outcomes of an experiment

33
Q

Define an ‘Event’

A

A collection or set of outcomes from the sample space. Or as subset of the sample space

34
Q

Define a ‘Null event’

A

An event that doesn’t have any outcomes

35
Q

Define the ‘Complement of an event’

A

The sum of the outcomes in the sample space where the event did not occur.

36
Q

Explain the difference between an ‘independent’ set of events and a ‘dependent’ set of events

A

Independent events have no influence on the probability of each other given one has occurred dependent are the opposite and P(x|y) =! P(x)

37
Q

Define ‘Mutually exclusive’ events and state if independent variables can be described as this

A

Mutually exclusive events are 2 events that cannot occur within 1 outcome e.g rolling an even AND odd number .

37
Q

Define ‘Mutually exclusive’ events and state if independent variables can be described as this

A

Mutually exclusive events are 2 events that cannot occur within 1 outcome e.g rolling an even AND odd number .

Independent variables are NEVER mutually exclusive .

Mutually exclusive events are however always dependent BUT NOT ALL DEPENDENT events are mutually exclusive

38
Q

Define a ‘partition’

A

A partition of a sample space is a set of disjoint events/outcomes

39
Q

Define a ‘Numerical random variable’

A

A numeric representation of the result from an experiment

40
Q

State the 3 rules of a ‘Probability Mass function’

A
  1. The outcomes in the sample space are disjoint
  2. 0 < p(x) 1 for each outcome x
  3. sum of p(x) = 1
41
Q

State the 3 properties of the ‘cumulative distribution function’

A
  1. F(- inf) = P(X <= - inf) = 0
    1. F(inf) = P(X <= inf) = 1
  2. If a<=b then F(a) <= F(b) since F is non decreasing
42
Q

Define the ‘Discrete uniform distribution’

A

Assigns equal probabilities to each outcome in the same space

43
Q

When can the Bernoulli Distribution be applied

A

When the sample space can be divided into successes and failures

44
Q

Define a “continuous random variable’

A

A random variable whose sample space has infinite outcomes

45
Q

What are the properties of a good estimator

A

Unbiasedness - i.e. when the sample statistic expectation is equal to the parameter being estimated

Consistency - When the value of the estimator tends towards the value of the parameter with an increase in sample size

46
Q

What is ‘Sampling Error’ a measure of ?

A

Measures how much the estimator tends to vary from sample to sample

47
Q

define a ‘population proportion’

A

A fraction of the population that has a certain characteristic