Why lean statistics? Flashcards

Prime my brain for higher level concepts and understanding

1
Q

“Statistics” means…

A

Statistical procedures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The uses of statistics

A
  • Organize and summarize information
  • Determine exactly what conclusions are justified based on the results that were obtained
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Goals of statistical procedures

A
  • Accurate and meaningful interpretation
  • Provide standardized evaluation procedures
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Variable

A

Characteristics or condition that changers or has different values for different individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data (Plural)

A

Measurements or observation of a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data set

A

A collections of measurements or observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A datum (singular)

A
  • A single measurement or observation
  • Score or Raw score
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Parameter

A
  • A value, usually a numerical value, that describes a population
  • Derived from measurements of the individual in the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Statistic

A
  • A value, usually a numerical value, that describes a sample
  • Derived from measurements of individuals in the sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Descriptive Statistics

A
  • Summarize data
  • Organize data
  • Simplify data

E.g.,
- Tables
- Graphs
- Averages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Inferential Statistics

A
  • Study samples to make generalizations about the population
  • Interpret environmental data

Common terminology
- “Margin of error”
- “Statistically significant”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Sampling error

A

The Sample is never identical to the population.

Sampling error is the discrepancy, or amount of error, that exists between a sample statistic and the corresponding population parameter.

Example: Margin of error in polls
“This poll was taken from a sample of registered voters and has a margin of
error of plus-or-minus 4 percentage points”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data Structure I: The correlational Method

A
  • One group of participants
  • Measurement of two variables for each participant
  • Goal is to describe the type and magnitude of the relationship
  • Patterns in the data reveal relationships
  • Non-experimental method of study

**Can not establish causation
Characteristics:
Strength
Form (Usually linear)
Direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Data Structure II: Comparing two (or more) groups of scores

A
  • One variable defines groups
  • Scores are measured on the second variable
  • Both experimental and non-experimental studies use this structure

E.g., T-test and ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Experimental Method

A

Goal
- To demonstrate a cause and effect relationship

Manipulation
- The level of one variable (IV) is determined by the experimenter

Control - rules out influence of other variables (confounds)
- Participant variables
- Environmental variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Independent variable

A

The variable manipulated by the researcher (independent because no other variable influences its value - e.g., sex)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Dependent variable

A

The variable that is observed to assess the effect of treatment (dependent because it is thought to depend on the value of the IV)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Experimental method: Methods of control

A
  • Random assignment
  • Matching of subjects
  • Holding level of some potentially influencing variables constant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Experimental Method: Control condition

A
  • Individuals do not receive the experimental treatment.
  • They either receive no treatment or they receive a neutral, placebo treatment
  • Purpose: to provide a baseline comparison with the experimental conditon
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Experimental Method: Experimental condition

A

Those who receive the experimental treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Non experimental Methods

A

Non-equivalent Groups
- Researcher compares groups
- Researcher cannot control who goes into which group

Pre-test/Post-test
- Individuals measured at two points in time.
- Researcher cannot control influence of the passage of time

Independent variable is ‘Quasi-independent’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Quasi-independent

A

Cannot be controlled
e.g., age, sex, traits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Constucts

A
  • Internal attributes or characteristics that cannot be directly observed
  • Useful for describing and explaining behavior
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Observational Definition

A
  • Identifies the set of operations required to measure an external (observable) behavior
  • Uses the resulting measurements as both a definition AND a measurement of a hypothetical construct
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Discrete variable

A
  • Has separate, indivisible categories
  • No values can exist between two neighboring categories

Distinct numbers
e.g., 35, 1, 8, 3

26
Q

Continuous variable

A
  • Has infinite number of possible values between any two observed values
  • Every interval is divisible into an infinite number of equal parts

e.g., height, weight, temp, time

27
Q

Scales of measurement

A
  1. Nominal
  2. Ordinal
  3. Interval
  4. Ratio
28
Q

Nominal Scale

A

Characteristics:
- Label and categories
- No quantitative distinctions (no numbers just names)

Examples:
- Gender
- Diagnosis
- Experimental or control

29
Q

Ordinal Scale

A

Characteristics:
- Categorize observations
- Categories organized by size or magnitude

Examples:
- Rank in class
- Clothing sizes (S, M, L, XL)
- Olympic metals

30
Q

Interval Scale

A

Characteristic:
- Ordered categories
- Interval between categories of equal size
- Arbitrary or absent of zero point

Examples:
- Temperature
- IQ
- Golf scores (above/below par)

31
Q

Ratio Scale

A

Characteristics:
- Ordered categories
- Equal interval between categories
- Absolute zero point

Examples:
- Number of correct answer
- Time to complete task
- Gain in height since last year

32
Q

X

A

Independent variable

33
Q

Y

A

Dependent variable

34
Q

N

A

Number of scores in the population

35
Q

n

A

Number of scores in the sample

36
Q

Σ

A

Summation
- Done after operations in parenthesis, squaring, multiplication and division.

  • Done before other addition or subtraction
37
Q

What is central tendency?

A

A statistical measure
A single score to define the center of the distribution

Purpose: find the single score that is the most typical or best represents the entire group

38
Q

Central Tendency Measures

A

There’s no single concept of central tendency that always the “best”

Different distribution shapes require different conceptualizations of “center”

Choose the one which best represents the scores in a specific situation

39
Q

The mean is…

A

the sum of all scores divided by the number of scores in the data

Population:

Sample:

40
Q

Mean as a balance point

41
Q

The Weighted Mean

A
  • Combine two sets of scores
    Three steps:
    1. Determine the combined sum of all the scores.
    2. Determine the combined number of scores
    3. Divide the sum of scores by the total number of scores

Overall Mean =

42
Q

Computing the Mean from a Frequency Distribution Table

43
Q

Characteristics of the Mean

A
  • Changing the value of a score changes the mean
  • Introducing a new score or removing a score ‘changes’ the mean (unless the score added or removed is ‘exactly’ equal to the mean)
  • Adding or subtracting a constant from each score changes the mean by the same constant
  • Multiplying or dividing each score by a constant multiplies or divides the mean by that constant
44
Q

The Median

A
  • The median is the midpoint of the scores in a distribution when they are listed in order from smallest to largest
  • The median divides the scores into two groups of equal size
45
Q

Locating the Median (odd n)

A
  • Put scores in order
  • Identify the “middle” score to find median

3 5 8 10 11
“Middle” score is 8 so median = 8

46
Q

Locating the Median (even n)

A
  • Put scores in order
  • Average middle pair to find median

1 1 4 5 7 9
(4 + 5) / 2 = 4.5

47
Q

The Mode

A

The mode is the score or category that has the greatest frequency of any score in the frequency distribution
- Can be used with any scale of measurement
- Corresponds to an actual score in the data

It is possible to have more than one mode

48
Q

Bimodal Distribution

49
Q

Symmetrical Distributions

A
  • Mean and median have same value
  • If exactly one mode, it has same value as the mean and the median
  • Distribution may have more than one more or no mode at all
50
Q

Central Tendency in Skewed Distributions

A

Mean, influenced by extreme scores, is found far toward the long tail (positive or negative)

Median, in order to divide scores in half, is found toward the long tail, but not as far as the mean

Mode is found near the short tail.

If Mean – Median > 0, the distribution is positively
skewed.

If Mean – Median < 0, the distribution is negatively
skewed

51
Q

Positive Skew

52
Q

Negative Skew

53
Q

Overview of variability

A

Variability is defined as:
1. Quantitative ‘distance’ measure based on the differences between scores
2. Describes ‘distance’ of the spread of scores or ‘distance’ of a score from the mean

Purpose
- Describe the distribution
- Measure how well an individual score represents the distribution

54
Q

What are the Three Measures of variability?

A
  1. The range
  2. The variance
  3. The Standard deviation
55
Q

The range

A
  • Distance covered by the scores in the distribution (smallest value to the highest)
  • For continuous data, real limits are used
  • Based on two score, not all data (an imprecise, unreliable measure of variability)
56
Q

The most common and most important measure of variability is…

A

Standard deviation
- A measure of standard, or average, distance from the mean
- Describes whether the scores are cluster closely around the mean or widely scattered

The calculation differs for sample and population

Variance is a necessary ‘companion concept’ to standard deviation but NOT the same concept

ADD equation image

57
Q

Defining the Standard Deviation

A

Step one: determine the deviation score
X — μ

Step two: Find a sum of deviations
∑(X — μ)

Step 2 (revised):First square each deviation then sum the ‘squared’ deviations (SS)

SS= ∑(X — μ)^2

Step 3: Variance = Average the ‘squared’ deviations

Step 4: SD=SqrT(Variance)

58
Q

Variance is known as…

A

mean squared deviation

59
Q

Standard deviation

A

Averaged distance of the scores from the mean

60
Q

Sum of squares formulas

61
Q

continue