Test 1 Flashcards

1
Q

Two types of observations are always discrete: __ variables and __ variables.

A

nominal, ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Nominal variables are used for observations that have __ or __ as their values.

You can’t __.

ex: stats data

u__ categories; group m__
(ex: __ of cookie

A

categories, names

You can’t order

For example, when entering data into a statistical computer program, a researcher might code male participants with the number 1 and female participants with the number 2. In this case, the numbers only identify the gender category for each participant. They do not imply any other meaning

unordered, membership, type

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Ordinal variables are used for observations that have __ (i.e., __,__,__) as their values.

You can __

examples:
(ex: __ of favorite cookie)

A

rankings

1st, 2nd, 3rd

order

top 3 favorite cookies, class rank, 3rd place in a race, hair length (short medium long)

ranking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Two types of observations can be continuous: __ variables and __ variables.

Continuous means it can have __ __.

A

interval, ratio

decimal points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Interval variables are used for observations that have __ as their values; the distance (or interval) between pairs of consecutive numbers is assumed to be __.

Interval variable have no __ __.

Ex:

A

numbers, equal

meaningful zero

Temperature in celsius, IQ score, RATING of cookie taste

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

A ratio variable is a variable that meets the criteria for an interval variable but also has a __ __point.

ex:

have a “true z__“-the a__ of what is being m__.

A

interval, meaningful zero

mass of cookie, minutes to finish homework

zero, absence, measured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A scale variable is a variable that meets the criteria for an __ variable or a __ variable.

A

interval, ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

An independent variable has at least __ levels that we either m___ or o___ to determine its effects on the ___ variable.

A

two

manipulate, observe

dependent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The dependent variable is the o___ variable that we hypothesize to be related to or caused by changes in the __ variable.

A

outcome

independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

A confounding variable is any variable that __ varies with the __ variable so that we cannot logically determine which variable is at work; also called a confound.

A

systematically

independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A reliable measure is __.

ex: bathroom scale

A

consistent

If you were to weigh yourself on your bathroom scale now, and then again in an hour, you would expect your weight to be almost exactly the same. If your weight, as shown on the scale, remains the same when you haven’t done anything to change it, then your bathroom scale is reliable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Descriptive statistics organize, summarize, and communicate a group of numerical observations. Descriptive statistics describe large amounts of data in a s__ n__ or in just a f__ n__.

ex: shark

A

single number, few numbers

National Geographic reported on their Web site that the average adult great white shark is 15 feet long, or 4.6 meters (2015). And that’s just the average. Some great white sharks grow to 20 feet, or 6 meters long! Most dorm rooms aren’t even that big. The average length of great white sharks is a descriptive statistic because it describes the lengths of many sharks in just one number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Inferential statistics use __ data to make estimates about the larger __.

use to generalize beyond our s__ to the p__; infer c__.

A

sample

population

sample, population, characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Frequency tables are organization of number of times a __ has occurred.

List steps: 4 total

A

value

1) Find highest and lowest score
2) create 2 columns: variable and frequency
3) list full range of values (include zero frequency values)
4) count and list number of scores at each value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

A __ __ describes the pattern of a set of numbers by displaying a count or proportion for each possible value of a variable.

A

frequency distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

a __ ___ __ allows researchers to depict data visually by reporting the frequencies within a given interval rather than the frequencies for a specific value.

Use this if there are many __ __ or a __ range of data

A

grouped frequency table

decimal places
large

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Steps to make a grouped frequency table:

1) find __ and __ scores.
2) get __ range of data.
- round numbers __, s__ to get the difference, and add _.
3) Take the given interval and __ the above number by the interval to get how many intervals to use (round __)
4) Make an i__ and f__ column

A
highest, lowest
full
down, subtract, 1
divide, up
interval, frequency
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

A histogram is a graph that looks like a bar graph but depicts just __ variable, usually based on __ data, with the values of the variable on the _-__ (horizontal) and the frequencies on the _-__. (vertical)

(x is along the __, y is along the __)

A

one

scale

x-axis, y-axis

bottom, side

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

In a frequency polygon, a dot is placed __ each score or interval to indicate the frequency, and the dots are __.

A

above

connected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Stem and leaf plots allow for a __ and __ display of all data.
They allow you to regenerate __ data by looking.
You can see the __ of the data.
You can easily __ groups.

A

numerical, visual
raw
shape
compare

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

A normal distribution is a specific frequency distribution that is a __-shaped, s__, __ curve

A

bell-shaped, symmetric, unimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Skewed distributions are distributions in which one of the __ of the distribution is pulled away from the center.

A

tails

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

When the tail goes to the right, there is a __ skew.

May represent __ effects.

A

positive

floor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

When the tail goes to the left, there is a __ skew.

May represent __ effects.

A

negative

ceiling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

__ __, a situation in which a constraint prevents a variable from taking values below a certain point.

A

floor effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

___ __, a situation in which a constraint prevents a variable from taking on values above a given number.

A

ceiling effect

27
Q

Graphs have positive and negative uses:

  • they can a__ and s__ present information
  • they can r__ and c__ complicated data
A

accurately, succinctly

reveal, conceal

28
Q

the __ __ lie

  • scaling to skew results
    ex: word use
A

biased scale

using 3 positive words out of 5 options

29
Q

the __ __ lie

-participants are pre-selected or self-selected

A

sneaky sample

30
Q

the ___ lie

-assumes a value between 2 data points follows the same pattern.

A

interpolation

31
Q

the ___ lie

-assumes knowledge outside of study

A

extrapolation

32
Q

the __ __ lie

-uses scaling to distort portions of data

A

inaccurate values

33
Q

A scatterplot is a graph that depicts the relation between two __ variables.

Label the horizontal x-axis with the name of the ___ variable and its possible values, starting with 0 if practical.
Label the vertical y-axis with the name of the __ variable and its possible values, starting with 0 if practical.

A

scale

independent

dependent

34
Q

a linear relation between variables means that the relation between variables is best described by a __ line.

When the linear relation is __, the pattern of data points flows upward and to the right. When the linear relation is __, the pattern of data points flows downward and to the right

A

straight

positive

negative

35
Q

A nonlinear relation between variables means that the relation between variables is best described by a line that __ or __ in some way.

A

breaks, curves

36
Q

A line graph is used to illustrate the relation between two __ variables.
The first type of line graph, based on a scatterplot, is especially useful because the line of best fit minimizes the distances between all the data points from that line. That allows us to use the x value to predict the y value and make predictions based on only one piece of information.

A time plot, or time series plot, is a graph that plots a scale variable on the y -axis as it changes over an increment of time (for example, hour, day, century) labeled on the x -axis

A

scale

37
Q

we commit a Type _ error when we reject the null hypothesis but the null hypothesis is correct.

A Type _ error is like a false positive in a medical test.

A

1

1

38
Q

we commit a Type _ error when we fail to reject the null hypothesis but the null hypothesis is false. A Type _ error is like a false negative in medical testing.

A

2

2

39
Q

If you know that the probability a student is wearing another sport’s shirt is 0.05, how many students does that represent?

A

0.05 * 159 = 7.95 -> 8 students

40
Q

If there are 40,000 people who attend your school, estimate the total number of people who would be wearing a college football shirt.

A

40000 x 0.62 = 24800

41
Q

Bar graph - when to use?

A

1 nominal and 1 scale variable

42
Q

A bar graph is a visual depiction of data in which the __ variable is nominal or ordinal and the __ variable is scale.

The height of each bar typically represents the average value of the dependent variable for each category. The independent variable on the _-__ could be either __ (such as gender) or __ (such as Olympic medal winners who won gold, silver, or bronze medals).

A

independent

dependent

x-axis

nominal, ordinal

43
Q

__ chart, a type of bar graph in which the categories along the x-axis are ordered from highest bar on the __ to lowest bar on the __.

This ordering allows easier comparisons and easier identification of the most common and least common categories.

A

pareto

left, right

44
Q

A pictorial graph is a visual depiction of data typically used for an independent variable with very __ levels (categories) and a __ dependent variable. Each level uses a picture or symbol to represent its value on the scale dependent variable

A

few

scale

45
Q

A pie chart is a graph in the shape of a circle, with a slice for every __ (category) of the independent variable. The size of each slice represents the __ (or percentage) of each category.

A

level

proportion

46
Q

Put the __ __ on the x-axis (p__)

Put the __ __ on the y-axis (p__)

A
independent variable (predictor)
dependent variable (predicted)
47
Q

scatterplots are good for observing __ data point

line graphs show __

both are for __ __ variables

A

every

trends

two scale.

48
Q

Creating a perfect graph general guidelines:

  • use same __ as in body of text
  • tell entire story without forcing reader to back to __
  • use clear __ and label __
  • include _ or d__ s__ marks
A

terms
text
title, axes
0, double slash

49
Q
When is the mode useful?
-if one score \_\_
if 2 are adjacent-a\_\_
if 2 are non-adjacent \_\_-\_\_\_
n\_\_ data 
can also be useful because it's an actually \_\_ score
A
dominated
average 
bi-modal
nominal
occurring
50
Q

Why use the median?

  • it’s unaffected by __ scores
  • can use for __, __, or __ data

downsides:
- can’t use easily in __
- not very __ from sample to sample.

A

extreme
ordinal, interval, ratio

equations
stable

51
Q

Measures of central tendency tell us about the __ of __

A

center of distribution

52
Q

What about the mean?
-most c__ used

disadvantages:

  • influenced by __ scores
  • value not an actually __ score
  • interpretation requires __ or __ level data

advantages:

  • can be manipulated __
  • more __ across samples, better estimate of __ mean
A

commonly

extreme
occurring
interval, ratio

algebraically
stable, population

53
Q

Where the mode median and mean are located relative to each other depends on shape of the frequency distribution.

bimodal: describe

negatively skewed describe:

positively skewed:

unimodal:

A

bimodal: mode in center of each bell shape and median and mean directly in middle.

negatively skewed: tail to the left and from left to right mean, median, and mode

positively skewed: tail to the right and from left to right mode, median, mean

unimodal: mean=median=mode

54
Q

numbers that represent population= __
-population mean=__ this is g__

numbers that represent sample=__
-sample mean= __ or __ this is l__

A

parameters
u shaped looking greek letter

statistics
M or x with line over top. latin

55
Q

if .212 of mnm’s were blue. how many blue were there?

A

x/2561=.212

.212(2561)=542

56
Q

___=proportion expected over the long run

-decimal value

A

probability

57
Q

The _____ in probability and statistics, states that as a sample size grows, its mean gets closer to the average of the whole population.

A

law of large numbers

58
Q

___=number of successes divided by the number of trial

-decimal value

A

proportion

59
Q

__=probability of proportion multiplied by 100

A

percentage

60
Q

Discrete Variables:

variables that can only take on s__ values.
(ex: w__ numbers)

no other values can exist b__ these numbers.

A

specific, whole

between

61
Q

Continuous Variables:

variables that can take on a full r__ of v__.

e.g., numbers out to several d__ places); an i__ number of potential values exists

A

range, values

decimal, infinite

62
Q

Experiments: studies in which participants are r__ assigned to a c__ or l__ of one or more i__ variables. The d__ variable is measured.

Experiments are the gold standard of h__ testing because they are the best way to control c__ variables. Controlling confounding variables allows researchers to infer a c__–e__ relation between variables, rather than merely a systematic a__ between variables

A

randomly, condition, level, independent

dependent

hypothesis, confounding, cause-effect, association

63
Q

Correlational Studies: variables are simply m__; usually still have c__ variables that make it inappropriate to make a c__ statement.

A

measured, confounding, causal

64
Q

HOW TO INTERPRET A CORRELATION COEFFICIENT R:

Exactly –_. A perfect downhill (negative) linear relationship

–_. A strong downhill (negative) linear relationship

–_. A moderate downhill (negative) relationship

–_. A weak downhill (negative) linear relations

_. No linear relationship

+_. A weak uphill (positive) linear relationship

+_. A moderate uphill (positive) relationship

+_. A strong uphill (positive) linear relationship

Exactly +_. A perfect uphill (positive) linear relationship

A
  • 1
  • 0.70
  • 0.50
  • 0.30

0

+0.30

+0.50

+0.70

+1