Exam 1 in-class notes Flashcards

1
Q

Population description

A

The set of all the individuals of interest in a particular study

Vary in size; often quite large

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Population notation

A

N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sample definition

A

A set of individuals selected from a population.

Usually intended to represent the population in a research study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sample notation

A

n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Variable

A

Characteristic or condition that changes or has different values for different individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Parameter (or parameter estimate)

A

A value that describes a population

Derived from measurements of the individuals in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Statistic

A

A value that describes a sample.
Derived from measurements of the individuals in the sample.
Statistics are parameter estimates.We derive a statistic because we are trying to estimate a parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Descriptive statistics goals

A

Summarize data
Organize data
Simplify data (means, tables, graphs, etc)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Inferential statistics goals

A

Study samples to make generalizations about the population.

Interpret experimental data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Sampling error

A

the distance between a sample statistic and a population parameter.

Since the parameters are typically unknown we must estimate sampling error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Sampling error indicators

A

1) Variability. Low variability = low sampling error

2) Sample size. Large sample = low sampling error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Constructs

A

Internal attributes or characteristics that cannot be directly observed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Operational definition

A

Identifies the set of operations required to measure an external (observable) behavior.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Discrete variable

A

Has separate, indivisible categories.
No values can exist between 2 neighboring categories (i.e. cannot have 1.5 kids)

(these variable types refer to the underlying construct, not the operational definition)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Continuous variable

A

Has an infinite number of possible values between any two observed values.

(these variable types refer to the underlying construct, not the operational definition)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Nominal (named)

A

Labeled groups with no inherent quantitative relationship between each (e.g. diagnosed with disorder or not, restaurant options)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Ordinal (ordered or ranked)

A

Categorized observations by size or magnitude (i.e. S, M, L, XL shirt sizes, class ranking, placement in a race)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Interval

A

Ordered categories with equal size between categories of equal size.

Arbitrary zero point (IQ, Fahrenheit, Celsius)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Ratio

A

Ordered categories with equal size between categories of equal size

Non arbitrary zero point (height, weight, Kelvin)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Non-experimental (i.e. cross sectional) studies
Describe

IV
DV

A

examine associations between variables - no manipulation involved.

IV in these studies is called the Predictor variable.
DV is called the Outcome variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Statistical Notation

1) IV (predictor variable) is commonly referred to as __ variable
2) DV (outcome variable) is commonly referred to as the __ variable.

A

1) X

2) Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

N =

A

the number of scores within a population (the population size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

n =

A

the number of scores within a sample (the sample size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Sigma ∑

A

defines what needs to be summed in a formula

25
Order of Operations When is ∑ done?
Summation is done AFTER operations in parenthesis, exponents and multiplication or division. Summation is done BEFORE other addition or subtraction
26
∑f = N
Meaning you sum the NUMBER of scores, not the value of those scores. (add a pic to this slide)
27
``` Grouping Distributions - Rules of Thumb. 1. 2. 3. 4. ```
1. Intervals should all be the same width. 2. Make the lowest score of an interval a multiple of the width (e.g. 1.0-1.9, not 1.1-2.0) 3. Use roughly 10 class intervals but use your best judgement (e.g. short report format > fewer; larger research report > more) 4. Make the interval width a relatively simple number (e.g. 1, 2, 5, 10)
28
Histograms rules of thumb
1. X-axis lists intervals increasing from left to right, starting with 0 (unless negative scores) 2. Y-axis lists frequencies increasing from bottom to top, starting with 0 (unless negative scores) 3. Do not exclude intervals with a frequency of 0 4. Width corresponds to score's real limits (when ungrouped; class interval when grouped) 5. Use for interval and ratio data
29
When is it best to use a Histogram
for interval and ratio data
30
Polygons
1. Height of the bars in a histogram are replaced by a dot in polygon graphs. Dot height = frequency. 2. Connect dots with continuous line 3. Close the polygon with lines to the Y=0 point 4. Can be adapted to grouped frequency distribution data
31
When to use a bar graph
Nominal & ordinal data, to visualize frequency distributions
32
Nominal & Ordinal Data - how to display
1. Bar graph instead of histogram 2. Signifies to readers the nature of the data at a glance 3. Amount of space between bars does not matter, but they should be equal
33
Nominal scales
REMEMBER: Nominal (named) scales have no inherent order or value associated to the score. Data sets contain numbers, and usually not names so 1 = male, 2 = female, 3 = other; or 1 = Caucasian, 2 = African American, 3 = Asian/Pacific Islander, 4 = Latino/Hispanic, etc… The value associated with the response is fairly meaningless and only serves to make distinctions between the groups.
34
Ordinal scales
REMEMBER: Ordinal (ordered or ranked) scales do provide information about inherent order, but do not indicate what the distances are between the ranks. E.g., in a 100m race, the ordinal scale will tell us who finished 1st, 2nd, 3rd, etc… but this scale does not tell us how spread out or clustered together the racers were (was the person in 1st place way ahead of the rest? Or did they barely win?)
35
Interval & ratio scales
REMEMBER: Interval & ratio scales contain information about the intervals or “distance” between values. Specifically, that they are equal. These scales provide the most information
36
Nominal data doesn't contain much information. 1) what does it have? 2) Best way to display?
1) Frequency is really all we've got, and from that we can get proportions. 2) Bar charts (better for emphasizing frequency), Pie charts (better for emphasizing proportions)
37
Bar charts with nominal data are better (than pie charts) for...
emphasizing frequency
38
Pie charts with nominal data are better (than bar graphs) for000
emphasizing proportions
39
Histograms vs Bar charts
Bar charts imply distinct categories. | Histograms imply continuous variable measurements
40
Visualizing frequency distributions - 2 types
Symmetrical | Skewed
41
Symmetrical frequency distributions
Each side of the distribution is a mirror image of its counterpart
42
Skewed frequency distributions
Scores tend to pile up on one side of the distribution or the other, which creates a "tail"
43
Positive skew
Scores pile up on the LOW end of the measure (arrow on tail points towards + side)
44
Negative skew
Scores pile up on the HIGH end of the measure (arrow on tail points towards - side)
45
Central Tendency
A single score (value) that best represents the center of a distribution
46
µ = ∑x÷N
Population Frequency Distribution
47
M = ∑x÷n
Sample Frequency Distribution
48
Population Frequency Distribution Equation (Mean)
µ = ∑x÷N
49
Sample Frequency Distribution Equation (Mean)
M = ∑x÷n
50
Mean
Sum of the scores divided by the number of scores in the data. The amount each individual receives when the total is divided equally among all the individuals in the distribution
51
Median 1) What it is 2) how to calculate
MID. Midpoint of the scores in a distribution when they are listed in order from smallest to largest. It divides the distribution into two groups of equal size. Step 1. List scores from smallest to largest Step 2. Pick the middle score (if an odd number of scores). For even number of scores - divide the middle two values - i.e. if middle scores are 4 and 5, median would be 4.5.
52
Mode
Which score got the MOST. The score or category that has the greatest frequency of any score in the frequency distribution. - Corresponds to an actual score in the distribution. - It is possible to have more than 1 mode - Distributions can also have none when they represent a uniform distribution.
53
In a skewed distribution: 1) Mean 2) Median 3) Mode
1) gravitates toward the tail in a skewed distribution 2) gravitates toward the tail, but not as much as the mean 3) tends to be closer to the short tail
54
If the mean - median > 0 TEST QUESTION
the distribution is positively skewed
55
If the median - mean < 0 TEST QUESTION
the distribution is negatively skewed
56
the distribution is positively skewed if the mean - median _(greater than or less than)_ 0 TEST QUESTION
>
57
the distribution is negatively skewed if the mean - median _(greater than or less than)_ 0 TEST QUESTION
58
Vector
?data set?