Key Terms Flashcards

1
Q

What is a sample?

A

Individuals selected from a population, intended to represent the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a variable?

A

Characteristic or condition that changes or has different values for different individuals.

And is the operalization of a construct/concept.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is data? A data set?

A

A score (singular), single measurement. Data set is a collection of measurements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the two types of statistical methods?

A

Descriptive and inferential.

Descriptive are the basic info, tables and characteristics of the sample. They are used to summarise, organise and simplify the data.

Inferential is where you answer your research question. It is the techniques that allows us to study samples and then make generalisations about the populations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How can you minimise sampling errors?

A
  1. A larger sample size or 2. through different sampling methods.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are 2 sampling methods?

A
  1. Probability sampling: simple random sample, systematic sample, stratified sample, cluster sample (random subgroups first).
  2. Non Probability sampling: subjective, based on judgement of research. For example, convenience sampling or snowball sampling. It is hard to be representative.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the key aspects of a good research question?

A

It should be original: something new should be included. It should be specific, researchable/feasible and meaningful to save practical problems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How are hypothesis formed?

A

H0 - The null hypothesis, this is what is considered first.
H1 - alternative hypothesis. Competes with null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are variables?

A

Variables are the operalization of a concept/construct.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some examples of nominal scales/categories?

A

Something with no ranking between categories. Names/Categorical/Different Names. Ex: Nationality, gender, major of study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some examples of ordinary scales/categories?

A

Something with ranking between categories. These are sets that can be ordered in a sequence, though the interval may not be equal. Ex: Ranking in sports, dress size, level of education.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are some examples of interval scales/categories?

A

There is no true “0” score, the value can go below 0. Ex: Temperature.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are some examples of ratio scales/categories?

A

Ordered categories of exactly same size, whole #s or fractions. 0 true value. Never falls below 0. ex: age, weight.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the two criteria to judge if your measurement is good?

A

Reliability: can you produce consistent results over time?

Validity: Does is measure what it is supposed to measure?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the 3 types of research methods?

A
  1. Correlational method (observe two variables as they exist naturally. Is there a relationship between them?).
  2. Experimental study (can analyse cause and effect. Difficult as you have to manipulate a condition, one variable).
  3. Longitudinal study (repeated cross sectional studies, prospective studies, retrospective studies. Anything that is studied more than once).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the correlational method of research?

A

Observing two variables as they exist naturally. Is there a relationship between them? Limits: you can’t determine cause and effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is an experimental study?

A

A study in which you manipulate a variable or condition. You can analyse cause and effect. Independent variable is the cause, which is manipulated. (The type of medicine, yoga vs cardio). The dependent variable is the effect, the outcome. This is what is observed and measured to assess the effect of the treatment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a quasi experimental method?

A

An experiment in which you are just observing and not interfering. You observe the independent variable without manipulating it, and measure the dependent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the independent variable vs the dependent variable?

A

The independent variable is manipulated, it is the cause. For example, the type of medicine, or yoga vs cardio.

The dependent variable is the effect, the outcome. This is what is observed and measured to assess the effect of the treatment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What are the three types of longitudinal studies?

A

Repeated cross sectional studies (trend studies), prospective studies (same participants followed over time) and retrospective studies (designed after some have experienced events of relevance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Speak to the shape of frequency distribution.

A

1 - the shape: is it symmetrical? Bell shape is normal. The mean, median and mode over lap in the highest part of the bell. Positive skew (the tail of the chart points toward the positive, above 0, left to right). Negative skew.
2 - the central tendency: the mean, median and mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the mean?

A

It is the average. There is a population mean vs a sample mean. The weighted mean is calculated by combining the two samples to calculate the overall mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the median?

A

It divides the set into two equal parts, and finds the middle number. Without outliers or extreme values, this is a good measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the mode?

A

Mode is the greatest frequency. It could be multiple and is not affected by extreme values. It describes qualitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is variability?

A

Variability, also dispersion, shows us the differences between scores in a distribution and describes the degree to which the scores are spread out or clustered. If there are small differences between scores, then variability is small. If there are large differences between scores, then variability is large.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How can we measure the difference between scores?

A

1 - The range (gives us a general idea of the difference in scores. It is a poor measurement of distribution. It is X max - X min).

2 - The deviation score (the difference between the score + the mean. It is more precise. Deviation score should add to 0. It is X - mean).

3 - Variance (the average squared distance from the mean). It takes into account the “N” or sample size.

27
Q

What is sum of squares?

A

Sum of squares is the sum of the squared deviation scores. It helps us to measure the difference between scores.

28
Q

What is standard deviation?

A

It is the square root of the variance. It shows you more clearly how close an individual score is to the mean. Helps you measure difference between scores.

29
Q

What is the coefficient of variation?

A

The coefficient of variation measures the relative variation between two samples or data sets. It compares the variation of two data sets.

30
Q

What do we use percentile ranks and percentiles?

A

We use these to see where a single score is in the overall rankings. 50th percentile = the median =P50. Based on this, we compute quartiles, and split it into 4 quartiles.

31
Q

What do we use a box plot for?

A

It gives us a better idea of distribution of scores, the concentration and symmetry of data. Ex: the median is close to the 1st quartile you can say the data is positively skewed.

32
Q

What is the outlier? Why are we interested in it?

A

Outlier = IQR. Technical definition is that it either exceeds the value of 3rd quartile by 1.5*

we are interested because one single outlier can change all measures, it impacts the mean, variants value, sum of deviation, etc.

33
Q

What does variability mean for our research question?

A

If there is a huge amount of variability you may lose data to see meaningful patterns in results. It is hard to understand what is going on and see a clear pattern.

34
Q

What are extreme values?

A

Extreme values are values that occur in the extreme parts of the distribution, which differ from 95% of the scores. This is most likely not by chance and are a significant difference.

35
Q

What is a Z score?

A

A raw score won’t provide information on the position of the score in the data, so we translate these X values into Z scores, so we can see where they are located in the distribution. Second purpose is to standardise an entire distribution, so we can directly compare it to other distributions also transformed into Z scores. We normalise the data, and have 95% of data within the 2 standard deviations.

36
Q

When do we have normal distribution? (Also called Gaussian distribution)

A

Commonly occurring shape for probability distributions. Includes 4 things:
1. It is symmetrical around its mean.
2. The mean, the median and the mode are all equal.
3. The symmetry, already mentioned, 50% of the area is to the right and 50% of the left.
4. Normal distribution is completely determined by the parameters u and o.

If kurtosis and skewness are both in the range of -1 to +1, there is normal distribution.

37
Q

What is the mean and standard deviation?

A

The mean is 80 (represented by the line in the middle of this frequency distribution graph) and the standard deviation is 8 (represented by the sigma sign in the graph).

38
Q

Identify the range and quartiles.

A
39
Q

Definition of probability?

A

The probability of a specific outcome is defined as a fraction or proportion of all the possible outcomes.

40
Q

What is a random sample?

A

A random sample means each individual has an equal chance of being selected.

41
Q

How do you find the probability for a specific score?

A
  1. Transform the X values into z-scores.
  2. Use the unit normal table to look up the proportions corresponding to the z-score values.
42
Q

How do you determine whether a treatment has an effect?

A

Researchers will compare the treated sample with the original population. If the sample scores are similar to the population mean, then we conclude that the treatment had no effect. If the scores are noticeably different, then there is evidence that the treatment does have an effect.

43
Q

What is a typical example of a nominal variable? What is a good chart to show this?

A

Gender, pie chart.

44
Q

What is a typical example of a continuous scale variable?

A

Age

45
Q

What is a typical example of an ordinal variable? What is a good chart to show this?

A

Level of Education. Bar Graph.

46
Q

In SPSS, what is a quick way to look at important descriptive statistics from your data?

A

Select the row of variable you are interested in, right click and select “descriptive statistics”.

47
Q

What are 3 characteristics of the distribution of sample means?

A

1 - The sample means should pile up around the population mean.
2 - The pile of sample means should tend to form a normal-shaped distribution.
3 - In general, the larger the sample size, the closer the sample means should be to the population mean.

Logistically, a large sample should be better representative than a small sample.

48
Q

What is the central limit theorem?

A

Central Tendency
Standard Error
Shape

Central tendency: the mean of the distribution sample means is equal to the mean of the population scores and is called the expected value of M.

Standard Error: The standard deviation of the distribution of sample means, M, is called the standard error of M. The standard error provides a measure of how much distance is expected on average between a sample mean M and the population mean given a specific sample size.

Shape: Law of large numbers states that the larger the sample size (n), the more probable it is that the sample mean is close to the population mean.

49
Q

What is the relation between standard error and sample size?

A

As the sample size is increased, there is less error between the sample mean and the population mean.

When the sample consists of a single score (n=1), the standard error is the same as the standard deviation (oM = o).

50
Q

What is the 4 steps approach to hypothesis testing?

A
  1. State the hypotheses
  2. Set the decision criteria
  3. Collect data and compute the statistics
  4. Make a decision
51
Q

Explain H0 and H1.

A
52
Q

What are the commonly used alpha levels?

A

a = .05 (5%), a = .01 (1%) and a = .001 (.1%).

You must define what is meant by low or high probability as part of the hypothesis, or the level of significance.

53
Q

How do you set the decision criteria?

A

By defining the alpha level or level of significance. The critical region is composed of the extreme sample values that are very unlikely, to be obtained if the null hypothesis is true.

54
Q

What are the 2 outcomes that are possible as you make your decision regarding the hypothesis?

A

1 - The sample data are located in the critical region. A sample value in the critical region is very unlikely to occur if the null hypothesis is true. Decision to reject the null hypothesis (H1 is supported).

  1. Sample data set are not in the critical region. In this case, the sample mean is reasonably close to the population mean specified in the null hypothesis (in the center of the distribution). Data does not provide strong evidence that the null hypothesis is wrong, our conclusion is to fail to reject the null hypothesis. This conclusion means that the treatment does not appear to have an effect. (Retain H0).
55
Q

What are the types of errors?

A

Type I Error: Researcher rejects a null hypothesis that is actually true. In a typical research situation, a Type 1 error means that the researcher concludes that a treatment does have an effect when, in fact, it has no effect. Alpha level is the probability that the test will lead to a type I error.

Type II Error: Occurs when a researcher fails to reject a null hypothesis that is really false. In a typical research situation, a Type 2 error means that the hypothesis test has failed to detect a real treatment effect. Probability of type II error is represented by the symbol B, beta.

56
Q

What is a Type I Error?

A

Type I Error: Researcher rejects a null hypothesis that is actually true. In a typical research situation, a Type 1 error means that the researcher concludes that a treatment does have an effect when, in fact, it has no effect.

Alpha level is the probability that the test will lead to a type I error.

57
Q

What is a Type II Error?

A

Type II Error: Occurs when a researcher fails to reject a null hypothesis that is really false. In a typical research situation, a Type 2 error means that the hypothesis test has failed to detect a real treatment effect. Probability of type II error is represented by the symbol B, beta.

58
Q

What is the primary concern of selecting an alpha level?

A

The primary concern when selecting an alpha level is to minimise the risk of Type I Error. By convention, the largest permissible value is .05.

Because the consequences of Type I error can be relatively serious, many prefer to use more conservative alpha levels such as .01 or .001.

59
Q

What is the effect size?

A

A measure of effect size is intended to provide a measurement of the absolute magnitude of a treatment effect, independent of the size of the sample(s) being used.

Cohen’s d evaluates the effect size.

60
Q

What is directional (one tailed) hypothesis testing?

A

This is a better test if you don’t expect results in the other tail.

In a directional hypothesis test, or a one tailed test, the statistical hypothesis H0 and H1 specify either an increase or a decrease in the population mean. They make a statement about the direction of the effect. The alpha level is not divided between two tails, it is contained entirely in one tail.

61
Q

When do you use sample variability?

A

When the variability for a population is not known, we use sample variability. The result of the new test statistic, called “t statistic”.

62
Q

When do you use the t statistic?

A

When the variability for a population is not known (OM), you substitute the estimated standard error in the denominator of the z score formula:

63
Q

What are you assessing with Pearson’s r? How is it calculated?

A

The covariability of x and y, over the variability of x and y separately.