Everything Flashcards

1
Q

What is raw data?

A

Unprocessed data that has just been collected and needs to be ordered, grouped, rounded, or cleaned.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define qualitative data.

A

Non-numerical, descriptive data such as eye/hair colour or gender.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of data is easier to analyze, qualitative or quantitative?

A

Quantitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Give an example of quantitative data.

A

Height, weights, marks in an exam.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is discrete data?

A

Data that only takes particular values, such as shoe size or number of people.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is continuous data?

A

Data that can take any value, such as height or weight.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define categorical data.

A

Data that can be sorted into non-overlapping categories, such as gender.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is ordinal data?

A

Quantitative data that can be given an order or ranked on a rating scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does bivariate data involve?

A

Measuring two variables, which can be qualitative or quantitative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is multivariate data?

A

Data made up of more than two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of grouping data?

A

To make it easier to spot patterns and see how the data is distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or False: Discrete data can be grouped into overlapping classes.

A

False.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a primary data source?

A

Data that you have collected yourself or someone has collected on your behalf.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define secondary data.

A

Data that has already been collected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a population in statistics?

A

Everyone or everything that could be involved in the investigation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a census?

A

A survey of the entire population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Fill in the blank: A _______ is a smaller number from the population that you actually survey.

A

Sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a sampling frame?

A

A list of all the members of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a biased sample?

A

A sample that does not represent the population fairly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Define random sampling.

A

Every item/person in the population has an equal chance of being selected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is stratified sampling?

A

Sampling where the size of each group in the sample is in proportion to the sizes of those groups in the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is systematic sampling?

A

Choosing items in the population at regular intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Define cluster sampling.

A

The population is divided into natural groups, and groups are chosen at random with every member sampled.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is quota sampling?

A

Population is grouped by characteristics and a fixed amount is sampled from every group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Fill in the blank: Opportunity sampling uses the people/items that are _______.

A

Available at the time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is judgment sampling?

A

When the researcher uses their own judgment to select a sample they think will represent the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What does the Petersen Capture-Recapture method estimate?

A

The size of large or moving populations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is an explanatory variable?

A

The variable that is changed in an experiment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Define response variable.

A

The variable that is measured in an experiment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is a sample size?

A

Size large enough and representative of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is an experiment?

A

Used when a researcher examines how changes in one variable affect another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Define Explanatory (Independent) Variable.

A

The variable that is changed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Define Response (Dependent) Variable.

A

The variable that is measured.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What are Extraneous Variables?

A

Variables not of interest but that could affect the result of your experiment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What characterizes Laboratory Experiments?

A

Researcher has full control over variables; conducted in a lab or similar environment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Give an example of a Laboratory Experiment.

A

Measuring reaction times of people of different ages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is the Explanatory variable in the laboratory example?

A

Age.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is the Response variable in the laboratory example?

A

Reaction time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

List some Extraneous variables in laboratory experiments.

A
  • Gender
  • Health condition
  • Fitness level.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What are advantages of Laboratory Experiments?

A
  • Easy to replicate
  • Extraneous variables can be controlled.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is a disadvantage of Laboratory Experiments?

A

People may behave differently under test conditions than in real life.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What are Field Experiments?

A

Carried out in the everyday environment with some control over variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Give an example of a Field Experiment.

A

Testing new methods of revision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is the Explanatory variable in the field experiment example?

A

Method of revision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is the Response variable in the field experiment example?

A

Results in exam.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

List some Extraneous variables in field experiments.

A
  • Amount of revision pupils do
  • Ability of pupils.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What are the advantages of Field Experiments?

A

More accurate; reflects real life behaviour.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What is a disadvantage of Field Experiments?

A

Cannot control extraneous variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What are Natural Experiments?

A

Carried out in everyday environments with little control over variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

Give an example of a Natural Experiment.

A

The effect of education on level of income.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What is the Explanatory variable in the natural experiment example?

A

Level of education.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

What is the Response variable in the natural experiment example?

A

Income.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

List some Extraneous variables in natural experiments.

A
  • IQ
  • Other skills individuals may have
  • Personal circumstances.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What is an advantage of Natural Experiments?

A

Reflects real life behaviour.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

What are disadvantages of Natural Experiments?

A
  • Low validity
  • Difficult to replicate.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What is a Simulation?

A

A way to model random events using random numbers and previously collected data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

What are the steps in conducting a Simulation?

A
  • Choose a suitable method for getting random numbers
  • Assign numbers to the data
  • Generate random numbers
  • Match random numbers to outcomes.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

What is a Questionnaire?

A

A set of questions used to obtain data from the population/sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

What types of questions can be included in a Questionnaire?

A
  • Open questions
  • Closed questions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

What are features of a good questionnaire?

A
  • Easy to understand
  • Uses simple language
  • Avoids leading questions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What is a problem with Questionnaires?

A

Non-response when people do not respond to the questionnaire.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

What is the Random Response Method?

A

Uses a random event to decide how to answer a question ensuring anonymity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

What is a Pilot Study?

A

A small-scale replica of the study to test the design and methods of the questionnaire.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

What is an Interview?

A

Where you question each person individually, involving specific questions or topics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

What are Outliers?

A

Values that do not fit in with the pattern or trend of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

What does Cleaning Data involve?

A
  • Identifying and correcting/removing incorrect data values or outliers
  • Putting all data in the same format.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

What is a Control Group?

A

Used in an experiment to ensure that the treatment given is causing the experimental results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

What are Matched Pairs?

A

Two groups of equally matched people used to test the effect of a particular factor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

Define Hypothesis.

A

A statement that can be tested by collecting and analysing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

What are the stages of an Investigation?

A
  • Planning
  • Collecting Data
  • Processing and Representing data
  • Interpreting Results.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

What is the first stage of an investigation?

A

Planning

In this stage, you choose a hypothesis, decide what data to collect (variables), and determine how to record the data (data collection tables).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

What does the collecting data stage involve?

A

Choosing data sources (primary/secondary), collection methods (questionnaire/interviews), and control factors.

This stage is crucial for ensuring accurate and relevant data is gathered for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

List the stages of an investigation.

A
  • Planning
  • Collecting Data
  • Processing and Representing Data
  • Interpreting Results
  • Evaluating Methods
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

What are databases in the context of data representation?

A

Tables with a collection of data, often secondary data that is available online.

These databases usually contain real-life statistics and are essential for interpreting data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

What is a common inconsistency found in data tables?

A

Percentages do not add up to 100% due to rounding errors.

This is often encountered when individual percentages for columns/rows in tables have been rounded.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

What type of data do two-way tables represent?

A

Bivariate data, which has information in two categories and two variables.

They are useful for analyzing relationships between two different data sets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

What is a pictogram?

A

A representation using pictures or symbols to show a particular amount of data.

It always includes a key to indicate the amount each symbol represents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

What are the key features of simple bar charts?

A
  • Bars are equal width
  • Equal gaps between bars
  • Frequency on y-axis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

What distinguishes multiple bar charts from simple bar charts?

A

They can compare two or more sets of data with more than one bar for each class represented by different colours.

This allows for a clearer comparison between different data categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

How are composite bar charts structured?

A

Single bars split into different sections for each category, used to compare different times/days/years.

The frequency of each component is calculated by subtracting the upper frequency of that component from the lower frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

Define stem and leaf diagrams.

A

A method of organizing data that retains all original data while presenting it simply, showing the shape of the distribution.

Each value is split into a ‘stem’ (first digits) and ‘leaf’ (last digit).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
82
Q

What is the purpose of pie charts?

A

To display data showing how something is shared or divided into categories, with each sector representing a proportion of the total data.

The angles in a pie chart must add up to 360 degrees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
83
Q

True or False: Comparative pie charts can be used to compare two sets of data of different sizes.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
84
Q

What do population pyramids show?

A

Distribution of ages in a population, either in numbers or proportions/percentages.

They are used to compare two sets of data, usually genders or geographical areas.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
85
Q

What do choropleth maps represent?

A

Geographical areas split into different regions that are shaded based on frequency.

The darker the shading, the higher the frequency for that area.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
86
Q

What is cumulative frequency?

A

A running total of frequencies.

It helps in understanding the total number of occurrences up to a certain point in a dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
87
Q

What is the formula for frequency density in histograms?

A

Frequency Density = Frequency / Class Width

This reflects the concentration of values within each range of the dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
88
Q

How do you estimate the median from cumulative frequency diagrams?

A

Divide total frequency by 2, find that value on the y-axis, draw a horizontal line to the curve, and read off the value from the x-axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
89
Q

What is the formula to calculate frequency density?

A

FD = F/CW

FD stands for Frequency Density, F is Frequency, and CW is Class Width.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
90
Q

What are the steps to draw a histogram?

A
  • Calculate class widths for each class interval
  • Calculate frequency density for each class interval
  • Draw a suitable scale on y-axis labelled frequency density
  • Draw bars using frequency density data

Remember that the bars have no gaps in between.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
91
Q

What is the shape of a distribution?

A

It can be positive, negative, or symmetrical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
92
Q

What is the difference between a histogram and a frequency polygon?

A

A histogram uses bars, while a frequency polygon uses mid-points of class intervals plotted and joined with straight lines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
93
Q

True or False: To compare histograms, they need to have different class intervals.

A

False

They need to have the same class intervals and frequency density scales.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
94
Q

What is the mode in a dataset?

A

The value that appears the most.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
95
Q

How do you find the median in discrete data?

A
  • Put the numbers in order from smallest to largest
  • Find the (n + 1)th value
  • If the position is a decimal, average the two middle values.

n is the total frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
96
Q

What is the formula for the mean?

A

𝑥̅ = ∑𝑥/𝑛

Where 𝑥̅ is the mean, ∑𝑥 is the sum of data values, and 𝑛 is the number of data values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
97
Q

What is a weighted mean?

A

Used to combine different sets of data where one set is more important than another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
98
Q

Fill in the blank: The _______ is the class with the highest frequency.

A

Modal Class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
99
Q

What can cause diagrams to be misleading?

A
  • Shape of the diagram
  • Axes and scales

Examples include scales not starting at zero, missing values, or unevenly scaled axes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
100
Q

How do you estimate the median for grouped continuous data?

A

Use ½ n to find the median position and calculate using cumulative frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
101
Q

What is the geometric mean?

A

The nth root of the product of all the values.

102
Q

What happens to the mean if you add a value greater than the mean?

A

The mean increases.

103
Q

What is the first step to transforming data?

A

Take away the same large number from all the values.

104
Q

True or False: The median will always change if a new value is added.

A

False

The median may stay the same if the added value is equal to the median.

105
Q

What is a common error when drawing frequency polygons?

A

Not using midpoints.

106
Q

What should you do if the median position is a decimal?

A

Find the two values around that position and divide by 2.

107
Q

Fill in the blank: The sum of all values divided by the number of values is called the _______.

108
Q

What is the formula for estimating the median using linear interpolation?

A

Add the lower bound for the class interval to the result of multiplying the frequency for the median class.

109
Q

What happens to the mean if a value greater than the mean is added?

A

The mean increases.

110
Q

What happens to the mean if a value less than the mean is removed?

A

The mean increases.

111
Q

What happens to the mean if a value less than the mean is added?

A

The mean decreases.

112
Q

What happens to the mean if a value greater than the mean is removed?

A

The mean decreases.

113
Q

What is the mode?

A

The value that appears most frequently in the data.

114
Q

List advantages of using the mode.

A
  • Easy to use
  • Always a value in the data
  • Unaffected by extreme values
  • Can be used with quantitative and qualitative data
115
Q

List disadvantages of using the mode.

A
  • There may not be a mode or may be more than one mode
  • Cannot be used to calculate measures of spread
  • Not always representative of the data
116
Q

What is the median?

A

The middle value when the data is ordered.

117
Q

List advantages of using the median.

A
  • Easy to find when data is in order
  • Unaffected by outliers/extreme values
  • Best to use with skewed data
118
Q

List disadvantages of using the median.

A
  • May not be a data value
  • Not always representative of the data
119
Q

What is the mean?

A

The average of all the data values.

120
Q

List advantages of using the mean.

A
  • Uses all the data
  • Can be used to calculate standard deviation and skew
121
Q

List disadvantages of using the mean.

A
  • May not be a data value
  • Always affected by extreme values or outliers
122
Q

What does the range measure?

A

How spread out the data is.

123
Q

How is the range calculated?

A

Range = Largest Value - Smallest Value.

124
Q

What is the Interquartile Range (IQR)?

A

The middle 50% of the data when in order.

125
Q

How is IQR calculated?

A

IQR = Upper Quartile - Lower Quartile.

126
Q

What is the Lower Quartile (LQ)?

A

The value ¼ of the way through the data.

127
Q

What is the Upper Quartile (UQ)?

A

The value ¾ of the way through the data.

128
Q

How do you find LQ and UQ for discrete data?

A

LQ = ¼ (n+1)th value, UQ = ¾ (n+1)th value.

129
Q

What is the Interpercentile Range (IPR)?

A

The difference between two percentiles.

130
Q

What are deciles?

A

Values that divide the data into 10 equal parts.

131
Q

What is the Interdecile Range?

A

The difference between the first and ninth deciles.

132
Q

What does standard deviation (SD) measure?

A

How far all the values are from the mean value.

133
Q

What is the formula for standard deviation?

A

σ = √(1/n ∑(x - x̅)²) or σ = √(∑x²/n - (∑x)²/n²).

134
Q

What is a box plot?

A

A graphical representation of data that shows its distribution.

135
Q

What are the five pieces of information included in a box plot?

A
  • Minimum Value
  • Lower Quartile (LQ)
  • Median
  • Upper Quartile (UQ)
  • Maximum Value
136
Q

What are outliers?

A

Values that are far from the rest of the data.

137
Q

How are outliers identified?

A

Values that are more than 1.5 x IQR above UQ or below LQ.

138
Q

What is skewness?

A

Describes the shape of the distribution and how the data is spread out.

139
Q

True or False: The mean is always affected by extreme values.

140
Q

Fill in the blank: The _______ is the average of all the data values.

141
Q

Fill in the blank: The _______ is the value that appears most frequently in the data.

142
Q

Fill in the blank: The _______ is the middle value when the data is ordered.

143
Q

Fill in the blank: The difference between the largest and smallest values is called the _______.

144
Q

Fill in the blank: The middle 50% of the data is represented by the _______.

145
Q

What is the IQR used for?

A

Measure of spread

IQR stands for Interquartile Range, which measures the middle 50% of data.

146
Q

What does skewness describe?

A

The shape of the distribution and how the data is spread out.

147
Q

What indicates positive skewness?

A

Most values are at the beginning of the data set with few higher values.

148
Q

In positive skewness, what is the relationship between mean, median, and mode?

A

Mean > Median > Mode

149
Q

What indicates negative skewness?

A

Most values are at the end of the data set with few lower values.

150
Q

In negative skewness, what is the relationship between mean, median, and mode?

A

Mean < Median < Mode

151
Q

What signifies a symmetrical distribution?

A

Mean = Median = Mode

152
Q

What does a normal distribution look like on a box plot?

A

Median is halfway between LQ and UQ.

153
Q

How is skewness calculated using a formula?

A

Skewness = 3(mean - median) / standard deviation

154
Q

What does a positive skewness value indicate?

A

Positive skew.

155
Q

What does a negative skewness value indicate?

A

Negative skew.

156
Q

When comparing data sets, what measures should be used?

A

Average (mean/median/mode) and spread (range/IQR/SD) or skewness.

157
Q

What does a lower standard deviation indicate?

A

Values are closer to the mean and therefore similar.

158
Q

What are scatter diagrams used for?

A

To show if there is a relationship between two variables.

159
Q

What is the explanatory variable in a scatter diagram?

A

The independent variable plotted on the x-axis.

160
Q

What is the response variable in a scatter diagram?

A

The dependent variable plotted on the y-axis.

161
Q

What indicates a positive correlation?

A

As one variable increases, so does the other.

162
Q

What indicates a negative correlation?

A

As one variable increases, the other decreases.

163
Q

What is a causal relationship?

A

When one variable causes a change in another.

164
Q

What does the line of best fit (LOBF) represent?

A

A straight line drawn through the middle of the points on a scatter diagram.

165
Q

In the equation of LOBF, what does ‘a’ represent?

A

The gradient.

166
Q

In the equation of LOBF, what does ‘b’ represent?

A

The y-intercept.

167
Q

What is the purpose of interpolation?

A

To make predictions within the range of data given.

168
Q

What is the purpose of extrapolation?

A

To predict values outside of the range of values given.

169
Q

How is Spearman’s Rank Correlation Coefficient (SRCC) calculated?

A

SRCC = 1 - (6 * ∑d²) / (n(n² - 1))

170
Q

What does a value of SRCC near 1 indicate?

A

Strong positive correlation.

171
Q

What does Pearson’s Product Moment Correlation Coefficient (PMCC) measure?

A

The strength of linear correlation between two variables.

172
Q

What is a time series graph used for?

A

To spot trends over time.

173
Q

What is plotted on the x-axis of a time series graph?

174
Q

What is a time series?

A

A set of data collected over a period of time at equal intervals.

175
Q

What is the purpose of time series graphs?

A

To spot trends, usually going up, down, or fluctuating.

176
Q

What does a trend line show?

A

The general trend of the data.

177
Q

What are moving averages?

A

An average worked out for a given number of successive observations.

178
Q

Why are moving averages used?

A

To smooth out fluctuations and make the trend line more accurate.

179
Q

What are seasonal variations?

A

A pattern that repeats at a specific point every cycle.

180
Q

How is seasonal variation calculated?

A

Seasonal Variation = Actual Value - Trend Value.

181
Q

What is the Estimated Mean Seasonal Variation (EMSV)?

A

The average of all the seasonal variations for the same point in each cycle.

182
Q

How can future values be predicted in time series?

A

Using the trend line and estimated mean seasonal variations.

183
Q

What is simple probability?

A

A measure of how likely an event is to happen.

184
Q

How can probabilities be expressed?

A

As fractions, decimals, or percentages.

185
Q

What is an outcome in probability?

A

A possible result of an experiment or trial.

186
Q

What does P(event) represent?

A

The number of successful outcomes divided by the total number of outcomes.

187
Q

What is expected frequency?

A

The number of times you expect an event to happen.

188
Q

How is experimental probability estimated?

A

Using results of previous trials to predict future probabilities.

189
Q

What is risk in probability?

A

The likelihood of a negative event occurring.

190
Q

What are the two types of risk?

A
  • Absolute Risk
  • Relative Risk
191
Q

What does a sample space represent?

A

A list of all the possible outcomes.

192
Q

What is a sample space diagram?

A

A table used to represent the outcomes of two events.

193
Q

What is a Venn diagram?

A

Uses overlapping circles to represent all outcomes of two or three events.

194
Q

What are mutually exclusive events?

A

Events that cannot happen at the same time.

195
Q

What is the addition law in probability?

A

Used for events that are not mutually exclusive and can happen together.

196
Q

What are independent events?

A

Events where the outcome of one does not affect the outcome of the other.

197
Q

Fill in the blank: The formula for the probability of two mutually exclusive events A and B is P(A or B) = P(A) + P(B). What is the additional component for non-mutually exclusive events? _______

A

− P(A and B)

198
Q

What are independent events?

A

Unconnected events where the outcome of one does not affect the other

Example: Flipping a coin and rolling a dice.

199
Q

What is the Multiplication Law for independent events A and B?

A

P(A and B) = P(A) × P(B)

For 3 independent events A, B, and C: P(A and B and C) = P(A) × P(B) × P(C)

200
Q

How do you calculate P(at least 1)?

A

P(at least 1) = 1 - P(none)

This formula helps determine the probability of at least one occurrence.

201
Q

What do tree diagrams represent?

A

Each branch shows an outcome and probabilities on branches add up to 1

Multiply along the branches for end results and add probabilities down columns.

202
Q

What happens to the denominator in a tree diagram with replacement?

A

The denominator stays the same for the second set of branches

The question indicates if the item has been replaced.

203
Q

What is conditional probability?

A

The probability of one event affecting the chances of another

Example: Taking a white ball first changes the probability of the second draw.

204
Q

What notation is used for conditional probability?

A

P(B | A)

It represents the probability of B given that A has happened.

205
Q

What is the formula for conditional probability?

A

P(B | A) = P(A and B) / P(A)

This can also be used to test if two events are independent.

206
Q

What are simple index numbers used for?

A

To compare price changes over time

They compare the price change of an item with its base year price.

207
Q

What does an index number greater than 100 indicate?

A

The value has increased

An index number less than 100 indicates a decrease.

208
Q

What does the Retail Price Index (RPI) measure?

A

The rate of change of prices of everyday goods

RPI is calculated monthly by comparing prices to the same month of the previous year.

209
Q

What is the Consumer Price Index (CPI)?

A

Official measure of inflation used by the UK Government

It does not include mortgage payments and is weighted to reflect consumer spending.

210
Q

What does Gross Domestic Product (GDP) represent?

A

The value of goods and services produced in a country in a given time

A fall in GDP for two successive quarters indicates a recession.

211
Q

What are weighted index numbers?

A

They take into account proportions similar to the weighted mean

Weightings reflect the importance of different items.

212
Q

What do chain base index numbers compare?

A

Prices from each year with that of the previous year

They show how values change from year to year.

213
Q

What are crude rates?

A

Rates that tell how many times a particular event occurs per 1000 of the population

Examples include crude birth and death rates.

214
Q

What is the formula for calculating crude rates?

A

Crude Rate = (number of births/deaths / total population) × 1000

Crude rates can be misleading when comparing different age distributions.

215
Q

What is a standard population?

A

A hypothetical population of 1000 used to represent the whole population

It takes into account age, gender, and income distributions.

216
Q

What does a standardized rate allow you to do?

A

Compare the same age group in different populations

It uses the standard population for realistic comparisons.

217
Q

What is a probability distribution?

A

A list of all possible outcomes with their expected probabilities

Example: Flipping a fair coin results in heads or tails.

218
Q

What is a binomial distribution?

A

A type of probability distribution with only two possible outcomes

Examples include flipping a coin (heads or tails) or rolling a six (success or failure).

219
Q

What conditions must be met for a binomial distribution?

A
  • Fixed number of trials (n)
  • Each trial has 2 outcomes (success or failure)
  • Trials are independent
  • Probability of success is constant

If these conditions are met, the binomial distribution is applicable.

220
Q

How do you find probabilities using the binomial distribution?

A

Use (p + q)^n and identify the outcomes and their probabilities

Expand (p + q)^n where n is the number of trials.

221
Q

What is Pascal’s Triangle used for?

A

To find coefficients of a binomial distribution

The coefficients follow the pattern of Pascal’s triangle.

222
Q

What is the probability of landing on 6 three times?

A

10 × (X Heads) × (X Tails)

P(x) = ½ for Heads and ½ for Tails

223
Q

What pattern do the coefficients of a binomial distribution follow?

A

Pascal’s triangle

224
Q

What is the first row of Pascal’s triangle?

225
Q

How do you find the numbers in Pascal’s triangle?

A

By adding the 2 numbers directly above

226
Q

What is the expansion of (p + q)^4?

A

1p^4 + 4p^3q^1 + 6p^2q^2 + 4pq^3 + 1q^4

227
Q

What does the nCr button on a calculator represent?

A

N=number of trials and r=number of successes

228
Q

How do you calculate the coefficient for 5 trials with 3 successes using nCr?

A

Type ‘5’, ‘nCr’, ‘3’, ‘=’ to get 10

229
Q

To find a range of probabilities, what should you do?

A

Work out their individual probabilities and then add them up

230
Q

How do you calculate the probability of ‘at least 1 success’?

A

Work out the probability of 0 successes and subtract from 1

231
Q

What is the mean (or expected value) of the binomial distribution B(n, p)?

232
Q

For B(6, ½), what is the mean?

233
Q

What shape does a normal distribution curve have?

A

Bell-shaped

234
Q

What does a larger standard deviation result in for a normal distribution curve?

A

A lower curve

235
Q

What is the notation for a normal distribution?

A

N(μ, σ²)

236
Q

What do μ and σ² represent in the normal distribution notation?

A

μ = mean, σ² = variance

237
Q

What are the conditions for normal distribution?

A
  • Data is continuous
  • Distribution is symmetrical
  • Mode, median, and mean are approximately equal
238
Q

Approximately what percentage of data values lie within 1 standard deviation of the mean?

239
Q

Approximately what percentage of data values lie within 2 standard deviations of the mean?

240
Q

Approximately what percentage of data values lie within 3 standard deviations of the mean?

241
Q

For a data set with mean=30 and SD=3, what is the range for 68% of the sample?

242
Q

How do you sketch a normal distribution?

A

Draw a bell-shaped curve centered on the mean and ending at 3 SD from the mean

243
Q

What is the formula to calculate the number of SDs from the mean?

A

(value - mean) / standard deviation

244
Q

What is the purpose of standardised scores?

A

To compare how far above or below the average individual values are

245
Q

What does a positive standardised score indicate?

A

The value is above the mean

246
Q

What does a negative standardised score indicate?

A

The value is below the mean

247
Q

What does a standardised score of zero indicate?

A

The value is equal to the mean

248
Q

What does quality assurance involve?

A

Checking samples to ensure products are of the same quality and standard

249
Q

What is a control chart?

A

A time series chart used for quality assurance

250
Q

What are the lines on a control chart?

A
  • Target Value (middle line)
  • Upper and Lower Warning Lines (inner 2 lines)
  • Upper and Lower Action Limits (outer 2 lines)
251
Q

What happens if a sample average/range is above/below the warning line?

A

Another sample is taken and checked for problems

252
Q

What happens if a sample average/range is outside the action limits?

A

Production is stopped immediately and machinery is reset