Introduction to statistics - Data Flashcards

1
Q

Data collection techniques

A
  • Observations
  • Tests and assessments
  • Surveys
  • Document analysis (published articles)
  • Interviews
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are Secondary data?

A

data that someone else has collected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are Primary data?

A

data that you collect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Secondary Data Sources

A
  • Country & local city departments -> e.g.,
    Ministère-Direction de la Santé, STATEC (Statistic portal of Luxembourg)
  • Hospitals, Clinics & Schools
  • Research institutions e.g., Luxembourg Institute of Health
  • Journal Articles
  • International institutions e.g., OECD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Secondary Data disavantages

A
  • May be out of date for what you want to analyze.
  • May not have been collected long enough for detecting trends.
  • There may be missing information on some observations
  • May be incomplete
  • You have no control over data quality
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Secondary data advantages

A
  • Saves time
  • Saves money
  • Easily Accessible
  • Increases the feasibility of multicenter/ international collaboration
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Challenges of primary data

A
  • Can be expensive to collect
  • Selection of population or sample
  • Difficulty recruiting participants
  • Pretesting/piloting the instrument to determine the presence or absence of measurement bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Who/What do we collect data from?

A
  1. Patients
  2. Therapists
  3. Published research
  4. Electronic devices
  5. Public & Private organisations
  6. Anything that exists can be a source of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Defenition Population

A

A population is a group (e.g. patients) that have something in common (e.g. back pain).
(-> Target population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Defenition Sample

A

A sample is a smaller group with similar characteristics from within that population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What’s a sample bias?

A

When the sample shows a higher % of a special caractéristics, or a type of person compared to the % in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which are the probalistic sampling methods?

A
  • simple random
  • statified random
  • systematic random
  • clustered random
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Non probabilistic sampling methods?

A
  • convenience
  • purposive
  • snowball
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are descriptive statistics used for?

A
  • summarise data
  • describe data
  • present data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Types of descriptive statistics

A
  1. Measures of Frequency
  2. Measures of Centeal Tendency
  3. Measures of Dispersion or variability
  4. Measures of position & rank
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Measures of Frequency

A

Count, Percent & Frequency
Shows how often an observation occurs.

17
Q

Measures of Centeal Tendency

A

Mean, Median & Mode
-> Locates the distribution by variouspoints.
-> Shows the average or the most common score.

18
Q

Measures of Dispersion or variability

A

Range, Variance, Standard Deviation & Interquartile Range.
- Identifies the spread of scores by stating intervals
- Range = high/low points in the dataset
- Variance or Standard Deviation = difference between observed score and the mean
- Shows how “spread out” the data are. It is helpful to know when data are spread out as this can affect the average.
- The Interquartile range is the difference between the third quartile and the first quartile.

19
Q

Measures of position & rank

A

Percentile Ranks, Quartile
-> Describes how scores fall in relation to one another. Relies on standardized scores
-> Compares scores to a normalized score (e.g. a national norm)

20
Q

Mean

A
21
Q

By what can be affected the Mean?

A

By outliers
-> can make the mean a bad measure of central tendency

22
Q

What’s the median

A

When data are listed in order, the median is the point at which 50% of the cases are above and 50% below it.
This is the same as the 50th percentile.

23
Q

What’s the mode?

A

The observation with the highest frequency

24
Q

What’s the range?

A

The range is the difference between the lowest value and the highest value in a dataset

Range = (maximum value – minimum value)

25
Q

What’s Q1?

A

=1st Quartile
= the value occupying the ¼ position of all values arranged in ranked order. Or median of the 1st half of all observations.

26
Q

What’s Q3?

A

= 3rd quartile
= the value occupying the ¾ position of all values arranged in ranked order. Or the median of the 2nd half of all observations.

27
Q

What’s IQR?

A

= Interquartile = Q3-Q1

28
Q

What’s Q2?

A

= median

29
Q

Variance

A

= is a measure of how close together or far apart the values in a databest

->The larger the variance, the further the individual values are from the mean.

->The smaller the variance, the closer the individual values are to the mean.