Lecture 6 Flashcards

mean, SE, SD, median and plots

1
Q

What can stats provide?

A

provide objective criteria for evaluating hypothesis, synthesize information, help detect patterns in data, help to critically evaluate arguments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what stats cannot provide?

A

tell the truth, compensate for poor design,

indicate clinical significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

examples of narrative data

A

gender, social status, profession, colour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

numerical measurements examples

A

lab results, vital signs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is a population

A

A population is any entire collection of people, animals, plants, or things from which we may collect data. It is the entire group we are interested in, which we wish to describe or draw conclusions about.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is a sample

A

A sample is a group of units selected from a larger group (the population) for the study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

why we select samples for data analysis

A

• A sample is generally selected for study because the population is too large to study in its entirety. The sample should be representative of the general population. This is often best achieved by random sampling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is descriptive statistics

A

brief descriptive coefficients that summarize a given data set, which can be either a representation of the entire population or a sample of a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

why descriptive statistics are important

A

we get acquainted with the data, calculate the outliers, asses assumptions needed to check statistical hypothesis, check that the data doesnt have any errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

describe nominal (categorical/qualitative) variable

A

values are in arbitrary (sutartinis)categories and no units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

describe ordinal (categorical/qualitative) variable

A

values in ordered categories and no units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

describe discrete (metric/quantitative) variable

A

integer (sveiki skaiciai), counted units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

describe continuous (metric/quantitative) variable

A

continuous values, measured units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how can we systemise descriptive statistics

A

frequency and contingency tables, charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how to summarise metric data

A

Mode, median, mean, quartile, variance, standard deviation

Asymmetry, kurtosis , Charts , histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is a limitation of histograms

A

they can only represent one variable

17
Q

how to write assumption of normality?

A

mean ± SD

18
Q

what is mean

A

The mean is the arithmetic average of the observations

19
Q

what is variance

A

the spread around the mean.

20
Q

standard deviation

A

a quantity expressing by how much the members of a group differ from the mean value for the group.

21
Q

why analysing only the mean is not accurate

A

• Analysing just the mean values is not as accurate, due to various values that could be present in the data set. Therefore we need to look at the variance of the data and SD to evaluate the accuracy.

22
Q

what the empirical rule states?

A

The empirical rule states that for a normal distribution (bell shaped histogram), nearly all of the data will fall within three standard deviations of the mean.

23
Q

the empirical rule

A

In particular, the empirical rule predicts that 68% of observations falls within the first standard deviation (µ ± σ), 95% within the first two standard deviations (µ ± 2σ), and 99.7% within the first three standard deviations (µ ± 3σ)

24
Q

standard error definition

A

• The SD of the sample means, known as the standard error (SE), measures the deviation of individual sample means from the population mean. It measures the variability in the sample means.

25
Q

what small and large SE values indicate

A

small - little variation between the sample means

large - much variation in the sample means

26
Q

what determines the size of SE?

A

The size of the SE depends on the variation between individuals in the population and on the sample size, and is calculated as follows: SE = s/sqrt(n), where s is the SD of the sample and n is the sample size

27
Q

why using a large sample size is beneficial?

A

Larger sample sizes lead to smaller standard errors (the denominator, n, is larger). Large sample sizes produce more precise estimates of the population value in question.

28
Q

does 95% means probability?

A

no

29
Q

what 95% means

A

95% mean that 5% of intervals will not include the population mean

30
Q

why SE is useful

A

Standard error is a useful measure of uncertainty of the mean. However, it is better to use confidence intervals instead

31
Q

define the median

A

The median is the middle observation, that is, the point at which half the observations are smaller and half are larger. Symbolized by Md or M.

32
Q

is median sensitive to extreme values?

A

no

33
Q

symmetric histogram

A

If the mean and the median are equal, the distribution of observations is symmetric.

34
Q

skewed to the right

A

If the mean is larger than the median, the distribution is skewed to the right.

35
Q

skewed to the left

A

If the mean is smaller than the median, the distribution is skewed to the left.

36
Q

define the quartiles

A

• The quartiles of a ranked set of data values are the three points that divide the data set into four equal groups, each group comprising a quarter of the data.