Unit 1 Flashcards

1
Q

what is the study of statistics?

A

is the set of methods for obtaining, organizing, summarizing, presenting and analyzing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is data?

A

a set of measurements or observations taken on a group of objects
ex. the people taking the survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a population?

A

the totality of individuals or units about which we want information
ex. all people in manitoba taking the survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is a variable?

A

a characteristic or property of an individual or unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are some examples of variables?

A

hair color
height
your grade in this course
marital status

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is a sample?

A

a subset of units in a population that we examine in order to gather information about the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what does categorical ordinal mean?

A

data follows a natural order and the order makes sense

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what are examples of categorical nominal?

A

gender (female or male)
marital status (married, widowed, divorced)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are quantitative variables?

A

have values that are a count or are obtained by measurement
it makes sense to take the average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are examples of quantitative variables?

A

distance ran in 45 mins
measurement in cm of all the students in this classroom
square footage of your house

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what does the distribution of a variable tell us?

A

what values it takes and how often it takes on these values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what type of charts can we use with categorical variables?

A

bar charts
pie charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what type of charts can we use with quantitative variables?

A

histograms
timeplots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the difference between a bar chart and a histogram?

A

in a bar chart the bars don’t touch
in a histogram the bars do touch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what do pie charts give us a visual representation of?

A

relative frequency
proportion of the observed values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what does a frequency distribution table look like

A

31 37 40 44 49 50 51 53 56 56
62 64 67 67 68 68 69 70 71 72
73 73 74 75 77 78 78 81 82 84

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what are the two types of quantitative variables?

A

continuous variable
discrete variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what values can a continuous variable take?

A

any value within a given range
ex. weight and distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what values can a discrete variable take?

A

only take a countable number of values
ex. number of children in a family and the number of days of rain in a month

20
Q

what do we look for in respect to histograms? (5)

A

shape
any gaps
peaks (center)
spread (how variable the values of the data are)
outliers (observations that fall away from the overall pattern)

21
Q

what are the 3 types of shapes a histogram can have?

A

approximately symmetric
skewed to the left
skewed to the right

22
Q

why would we use a timeplot?

A

if we gather data that comes to us in a sequence over a period of time

23
Q

what is a trend?

A

a time series with a persistent long term rise or fall

24
Q

what is seasonal variation?

A

a pattern that repeats itself at certain intervals

25
Q

what do we use to measure the centre of our data?

A

we use a measure of central tendency

26
Q

what are the two measures of central tendency?

A

mean
median

27
Q

mean

A

average value

28
Q

median

A

in a set of ordered data the median is the value that splits the data into two equal parts

29
Q

what is the sum of deviations always equal to?

A

0

30
Q

what is an outlier?

A

a point that falls far away from the majority of the data

31
Q

is the median robust (resistant) or not robust (not resistant) to outliers?

A

median is robust to outliers (not affected)
mean is affected by outliers

32
Q

when does symmetric distribution occur?

A

when the mean and median are equal
(exactly symmetric is ideal)

33
Q

when does skewed to the left occur?

A

when the mean is less than the median

34
Q

when does skewed to the right occur?

A

when the mean is greater than the median

35
Q

how do we calculate the measure of spread?

A

by using the range

36
Q

what is the range?

A

R
a measure of spread and is simply calculated as maximum-minimum

37
Q

what are characteristics about range? (3)

A

the larger the value of R the more variable the data are
R measures the length of the interval containing 100% of the data
Range is affected by outliers

38
Q

what is IQR (interquartile range)?

A

Q3 - Q1

39
Q

when do we use the five number summary?

A

when describing our distributions with numbers

40
Q

what does the five number summary consist of?

A

minimum
first quartile (Q1)
median
third quartile (Q3)
maximum

41
Q

what does a boxplot consist of?

A

rectangle that is formed by using the quartiles and whiskers extending from the rectangle to the maximum and minimum values

42
Q

what is the standard deviation?

A

written as s
measure of spread around the mean

43
Q

what is the variance?

A

written as s^2
the square of the standard deviation

44
Q

what is degrees of freedom?

A

n-1 in the denominator

45
Q

when do we use deviation as a measure of spread?

A

when x̄ is the measure of centre

46
Q

when does the standard deviation equal zero?

A

when there is no spread about the mean

47
Q

is standard deviation affected by outliers?

A

yes!