Everything Flashcards

Question 1

Q

What is raw data?

Answer

A

Unprocessed data that has just been collected and needs to be ordered, grouped, rounded, or cleaned.

Question 2

Q

Define qualitative data.

Answer

A

Non-numerical, descriptive data such as eye/hair colour or gender.

Question 3

Q

What type of data is easier to analyze, qualitative or quantitative?

Answer

A

Quantitative data.

Question 4

Q

Give an example of quantitative data.

Answer

A

Height, weights, marks in an exam.

Question 5

Q

What is discrete data?

Answer

A

Data that only takes particular values, such as shoe size or number of people.

Question 6

Q

What is continuous data?

Answer

A

Data that can take any value, such as height or weight.

Question 7

Q

Define categorical data.

Answer

A

Data that can be sorted into non-overlapping categories, such as gender.

Question 8

Q

What is ordinal data?

Answer

A

Quantitative data that can be given an order or ranked on a rating scale.

Question 9

Q

What does bivariate data involve?

Answer

A

Measuring two variables, which can be qualitative or quantitative.

Question 10

Q

What is multivariate data?

Answer

A

Data made up of more than two variables.

Question 11

Q

What is the purpose of grouping data?

Answer

A

To make it easier to spot patterns and see how the data is distributed.

Question 12

Q

True or False: Discrete data can be grouped into overlapping classes.

Question 13

Q

What is a primary data source?

Answer

A

Data that you have collected yourself or someone has collected on your behalf.

Question 14

Q

Define secondary data.

Answer

A

Data that has already been collected.

Question 15

Q

What is a population in statistics?

Answer

A

Everyone or everything that could be involved in the investigation.

Question 16

Q

What is a census?

Answer

A

A survey of the entire population.

Question 17

Q

Fill in the blank: A _______ is a smaller number from the population that you actually survey.

Question 18

Q

What is a sampling frame?

Answer

A

A list of all the members of the population.

Question 19

Q

What is a biased sample?

Answer

A

A sample that does not represent the population fairly.

Question 20

Q

Define random sampling.

Answer

A

Every item/person in the population has an equal chance of being selected.

Question 21

Q

What is stratified sampling?

Answer

A

Sampling where the size of each group in the sample is in proportion to the sizes of those groups in the population.

Question 22

Q

What is systematic sampling?

Answer

A

Choosing items in the population at regular intervals.

Question 23

Q

Define cluster sampling.

Answer

A

The population is divided into natural groups, and groups are chosen at random with every member sampled.

Question 24

Q

What is quota sampling?

Answer

A

Population is grouped by characteristics and a fixed amount is sampled from every group.

Question 25

Q

Fill in the blank: Opportunity sampling uses the people/items that are _______.

Answer

A

Available at the time.

Question 26

Q

What is judgment sampling?

Answer

A

When the researcher uses their own judgment to select a sample they think will represent the population.

Question 27

Q

What does the Petersen Capture-Recapture method estimate?

Answer

A

The size of large or moving populations.

Question 28

Q

What is an explanatory variable?

Answer

A

The variable that is changed in an experiment.

Question 29

Q

Define response variable.

Answer

A

The variable that is measured in an experiment.

Question 30

Q

What is a sample size?

Answer

A

Size large enough and representative of the population.

Question 31

Q

What is an experiment?

Answer

A

Used when a researcher examines how changes in one variable affect another.

Question 32

Q

Define Explanatory (Independent) Variable.

Answer

A

The variable that is changed.

Question 33

Q

Define Response (Dependent) Variable.

Answer

A

The variable that is measured.

Question 34

Q

What are Extraneous Variables?

Answer

A

Variables not of interest but that could affect the result of your experiment.

Question 35

Q

What characterizes Laboratory Experiments?

Answer

A

Researcher has full control over variables; conducted in a lab or similar environment.

Question 36

Q

Give an example of a Laboratory Experiment.

Answer

A

Measuring reaction times of people of different ages.

Question 37

Q

What is the Explanatory variable in the laboratory example?

Question 38

Q

What is the Response variable in the laboratory example?

Answer

A

Reaction time.

Question 39

Q

List some Extraneous variables in laboratory experiments.

Answer

A

Gender
Health condition
Fitness level.

Question 40

Q

What are advantages of Laboratory Experiments?

Answer

A

Easy to replicate
Extraneous variables can be controlled.

Question 41

Q

What is a disadvantage of Laboratory Experiments?

Answer

A

People may behave differently under test conditions than in real life.

Question 42

Q

What are Field Experiments?

Answer

A

Carried out in the everyday environment with some control over variables.

Question 43

Q

Give an example of a Field Experiment.

Answer

A

Testing new methods of revision.

Question 44

Q

What is the Explanatory variable in the field experiment example?

Answer

A

Method of revision.

Question 45

Q

What is the Response variable in the field experiment example?

Answer

A

Results in exam.

Question 46

Q

List some Extraneous variables in field experiments.

Answer

A

Amount of revision pupils do
Ability of pupils.

Question 47

Q

What are the advantages of Field Experiments?

Answer

A

More accurate; reflects real life behaviour.

Question 48

Q

What is a disadvantage of Field Experiments?

Answer

A

Cannot control extraneous variables.

Question 49

Q

What are Natural Experiments?

Answer

A

Carried out in everyday environments with little control over variables.

Question 50

Q

Give an example of a Natural Experiment.

Answer

A

The effect of education on level of income.

Question 51

Q

What is the Explanatory variable in the natural experiment example?

Answer

A

Level of education.

Question 52

Q

What is the Response variable in the natural experiment example?

Question 53

Q

List some Extraneous variables in natural experiments.

Answer

A

IQ
Other skills individuals may have
Personal circumstances.

Question 54

Q

What is an advantage of Natural Experiments?

Answer

A

Reflects real life behaviour.

Question 55

Q

What are disadvantages of Natural Experiments?

Answer

A

Low validity
Difficult to replicate.

Question 56

Q

What is a Simulation?

Answer

A

A way to model random events using random numbers and previously collected data.

Question 57

Q

What are the steps in conducting a Simulation?

Answer

A

Choose a suitable method for getting random numbers
Assign numbers to the data
Generate random numbers
Match random numbers to outcomes.

Question 58

Q

What is a Questionnaire?

Answer

A

A set of questions used to obtain data from the population/sample.

Question 59

Q

What types of questions can be included in a Questionnaire?

Answer

A

Open questions
Closed questions.

Question 60

Q

What are features of a good questionnaire?

Answer

A

Easy to understand
Uses simple language
Avoids leading questions.

Question 61

Q

What is a problem with Questionnaires?

Answer

A

Non-response when people do not respond to the questionnaire.

Question 62

Q

What is the Random Response Method?

Answer

A

Uses a random event to decide how to answer a question ensuring anonymity.

Question 63

Q

What is a Pilot Study?

Answer

A

A small-scale replica of the study to test the design and methods of the questionnaire.

Question 64

Q

What is an Interview?

Answer

A

Where you question each person individually, involving specific questions or topics.

Answer 61

A

Values that do not fit in with the pattern or trend of the data.

Answer 62

A

Identifying and correcting/removing incorrect data values or outliers
Putting all data in the same format.

Answer 63

A

Used in an experiment to ensure that the treatment given is causing the experimental results.

Answer 64

A

Two groups of equally matched people used to test the effect of a particular factor.

Answer 65

A

A statement that can be tested by collecting and analysing data.

Answer 66

A

Planning
Collecting Data
Processing and Representing data
Interpreting Results.

Answer 67

A

Planning

In this stage, you choose a hypothesis, decide what data to collect (variables), and determine how to record the data (data collection tables).

Answer 68

A

Choosing data sources (primary/secondary), collection methods (questionnaire/interviews), and control factors.

This stage is crucial for ensuring accurate and relevant data is gathered for analysis.

Answer 69

A

Planning
Collecting Data
Processing and Representing Data
Interpreting Results
Evaluating Methods

Answer 70

A

Tables with a collection of data, often secondary data that is available online.

These databases usually contain real-life statistics and are essential for interpreting data.

Answer 71

A

Percentages do not add up to 100% due to rounding errors.

This is often encountered when individual percentages for columns/rows in tables have been rounded.

Answer 72

A

Bivariate data, which has information in two categories and two variables.

They are useful for analyzing relationships between two different data sets.

Answer 73

A

A representation using pictures or symbols to show a particular amount of data.

It always includes a key to indicate the amount each symbol represents.

Answer 74

A

Bars are equal width
Equal gaps between bars
Frequency on y-axis

Answer 75

A

They can compare two or more sets of data with more than one bar for each class represented by different colours.

This allows for a clearer comparison between different data categories.

Answer 76

A

Single bars split into different sections for each category, used to compare different times/days/years.

The frequency of each component is calculated by subtracting the upper frequency of that component from the lower frequency.

Answer 77

A

A method of organizing data that retains all original data while presenting it simply, showing the shape of the distribution.

Each value is split into a ‘stem’ (first digits) and ‘leaf’ (last digit).

Answer 78

A

To display data showing how something is shared or divided into categories, with each sector representing a proportion of the total data.

The angles in a pie chart must add up to 360 degrees.

Answer 79

A

Distribution of ages in a population, either in numbers or proportions/percentages.

They are used to compare two sets of data, usually genders or geographical areas.

Answer 80

A

Geographical areas split into different regions that are shaded based on frequency.

The darker the shading, the higher the frequency for that area.

Answer 81

A

A running total of frequencies.

It helps in understanding the total number of occurrences up to a certain point in a dataset.

Answer 82

A

Frequency Density = Frequency / Class Width

This reflects the concentration of values within each range of the dataset.

Answer 83

A

Divide total frequency by 2, find that value on the y-axis, draw a horizontal line to the curve, and read off the value from the x-axis.

Answer 84

A

FD = F/CW

FD stands for Frequency Density, F is Frequency, and CW is Class Width.

Answer 85

A

Calculate class widths for each class interval
Calculate frequency density for each class interval
Draw a suitable scale on y-axis labelled frequency density
Draw bars using frequency density data

Remember that the bars have no gaps in between.

Answer 86

A

It can be positive, negative, or symmetrical.

Answer 87

A

A histogram uses bars, while a frequency polygon uses mid-points of class intervals plotted and joined with straight lines.

Answer 88

A

False

They need to have the same class intervals and frequency density scales.

Answer 89

A

The value that appears the most.

Answer 90

A

Put the numbers in order from smallest to largest
Find the (n + 1)th value
If the position is a decimal, average the two middle values.

n is the total frequency.

Answer 91

A

𝑥̅ = ∑𝑥/𝑛

Where 𝑥̅ is the mean, ∑𝑥 is the sum of data values, and 𝑛 is the number of data values.

Answer 92

A

Used to combine different sets of data where one set is more important than another.

Answer 93

A

Modal Class

Answer 94

A

Shape of the diagram
Axes and scales

Examples include scales not starting at zero, missing values, or unevenly scaled axes.

Answer 95

A

Use ½ n to find the median position and calculate using cumulative frequency.

Answer 96

A

The nth root of the product of all the values.

Answer 97

A

The mean increases.

Answer 98

A

Take away the same large number from all the values.

Answer 99

A

False

The median may stay the same if the added value is equal to the median.

Answer 100

A

Not using midpoints.

Answer 101

A

Find the two values around that position and divide by 2.

Answer 102

A

Add the lower bound for the class interval to the result of multiplying the frequency for the median class.

Answer 103

A

The mean increases.

Answer 104

A

The mean increases.

Answer 105

A

The mean decreases.

Answer 106

A

The mean decreases.

Answer 107

A

The value that appears most frequently in the data.

Answer 108

A

Easy to use
Always a value in the data
Unaffected by extreme values
Can be used with quantitative and qualitative data

Answer 109

A

There may not be a mode or may be more than one mode
Cannot be used to calculate measures of spread
Not always representative of the data

Answer 110

A

The middle value when the data is ordered.

Answer 111

A

Easy to find when data is in order
Unaffected by outliers/extreme values
Best to use with skewed data

Answer 112

A

May not be a data value
Not always representative of the data

Answer 113

A

The average of all the data values.

Answer 114

A

Uses all the data
Can be used to calculate standard deviation and skew

Answer 115

A

May not be a data value
Always affected by extreme values or outliers

Answer 116

A

How spread out the data is.

Answer 117

A

Range = Largest Value - Smallest Value.

Answer 118

A

The middle 50% of the data when in order.

Answer 119

A

IQR = Upper Quartile - Lower Quartile.

Answer 120

A

The value ¼ of the way through the data.

Answer 121

A

The value ¾ of the way through the data.

Answer 122

A

LQ = ¼ (n+1)th value, UQ = ¾ (n+1)th value.

Answer 123

A

The difference between two percentiles.

Answer 124

A

Values that divide the data into 10 equal parts.

Answer 125

A

The difference between the first and ninth deciles.

Answer 126

A

How far all the values are from the mean value.

Answer 127

A

σ = √(1/n ∑(x - x̅)²) or σ = √(∑x²/n - (∑x)²/n²).

Answer 128

A

A graphical representation of data that shows its distribution.

Answer 129

A

Minimum Value
Lower Quartile (LQ)
Median
Upper Quartile (UQ)
Maximum Value

Answer 130

A

Values that are far from the rest of the data.

Answer 131

A

Values that are more than 1.5 x IQR above UQ or below LQ.

Answer 132

A

Describes the shape of the distribution and how the data is spread out.

Answer 133

A

Measure of spread

IQR stands for Interquartile Range, which measures the middle 50% of data.

Answer 134

A

The shape of the distribution and how the data is spread out.

Answer 135

A

Most values are at the beginning of the data set with few higher values.

Answer 136

A

Mean > Median > Mode

Answer 137

A

Most values are at the end of the data set with few lower values.

Answer 138

A

Mean < Median < Mode

Answer 139

A

Mean = Median = Mode

Answer 140

A

Median is halfway between LQ and UQ.

Answer 141

A

Skewness = 3(mean - median) / standard deviation

Answer 142

A

Positive skew.

Answer 143

A

Negative skew.

Answer 144

A

Average (mean/median/mode) and spread (range/IQR/SD) or skewness.

Answer 145

A

Values are closer to the mean and therefore similar.

Answer 146

A

To show if there is a relationship between two variables.

Answer 147

A

The independent variable plotted on the x-axis.

Answer 148

A

The dependent variable plotted on the y-axis.

Answer 149

A

As one variable increases, so does the other.

Answer 150

A

As one variable increases, the other decreases.

Answer 151

A

When one variable causes a change in another.

Answer 152

A

A straight line drawn through the middle of the points on a scatter diagram.

Answer 153

A

The gradient.

Answer 154

A

The y-intercept.

Answer 155

A

To make predictions within the range of data given.

Answer 156

A

To predict values outside of the range of values given.

Answer 157

A

SRCC = 1 - (6 * ∑d²) / (n(n² - 1))

Answer 158

A

Strong positive correlation.

Answer 159

A

The strength of linear correlation between two variables.

Answer 160

A

To spot trends over time.

Answer 161

A

A set of data collected over a period of time at equal intervals.

Answer 162

A

To spot trends, usually going up, down, or fluctuating.

Answer 163

A

The general trend of the data.

Answer 164

A

An average worked out for a given number of successive observations.

Answer 165

A

To smooth out fluctuations and make the trend line more accurate.

Answer 166

A

A pattern that repeats at a specific point every cycle.

Answer 167

A

Seasonal Variation = Actual Value - Trend Value.

Answer 168

A

The average of all the seasonal variations for the same point in each cycle.

Answer 169

A

Using the trend line and estimated mean seasonal variations.

Answer 170

A

A measure of how likely an event is to happen.

Answer 171

A

As fractions, decimals, or percentages.

Answer 172

A

A possible result of an experiment or trial.

Answer 173

A

The number of successful outcomes divided by the total number of outcomes.

Answer 174

A

The number of times you expect an event to happen.

Answer 175

A

Using results of previous trials to predict future probabilities.

Answer 176

A

The likelihood of a negative event occurring.

Answer 177

A

Absolute Risk
Relative Risk

Answer 178

A

A list of all the possible outcomes.

Answer 179

A

A table used to represent the outcomes of two events.

Answer 180

A

Uses overlapping circles to represent all outcomes of two or three events.

Answer 181

A

Events that cannot happen at the same time.

Answer 182

A

Used for events that are not mutually exclusive and can happen together.

Answer 183

A

Events where the outcome of one does not affect the outcome of the other.

Answer 184

A

− P(A and B)

Answer 185

A

Unconnected events where the outcome of one does not affect the other

Example: Flipping a coin and rolling a dice.

Answer 186

A

P(A and B) = P(A) × P(B)

For 3 independent events A, B, and C: P(A and B and C) = P(A) × P(B) × P(C)

Answer 187

A

P(at least 1) = 1 - P(none)

This formula helps determine the probability of at least one occurrence.

Answer 188

A

Each branch shows an outcome and probabilities on branches add up to 1

Multiply along the branches for end results and add probabilities down columns.

Answer 189

A

The denominator stays the same for the second set of branches

The question indicates if the item has been replaced.

Answer 190

A

The probability of one event affecting the chances of another

Example: Taking a white ball first changes the probability of the second draw.

Answer 191

A

P(B | A)

It represents the probability of B given that A has happened.

Answer 192

A

P(B | A) = P(A and B) / P(A)

This can also be used to test if two events are independent.

Answer 193

A

To compare price changes over time

They compare the price change of an item with its base year price.

Answer 194

A

The value has increased

An index number less than 100 indicates a decrease.

Answer 195

A

The rate of change of prices of everyday goods

RPI is calculated monthly by comparing prices to the same month of the previous year.

Answer 196

A

Official measure of inflation used by the UK Government

It does not include mortgage payments and is weighted to reflect consumer spending.

Answer 197

A

The value of goods and services produced in a country in a given time

A fall in GDP for two successive quarters indicates a recession.

Answer 198

A

They take into account proportions similar to the weighted mean

Weightings reflect the importance of different items.

Answer 199

A

Prices from each year with that of the previous year

They show how values change from year to year.

Answer 200

A

Rates that tell how many times a particular event occurs per 1000 of the population

Examples include crude birth and death rates.

Answer 201

A

Crude Rate = (number of births/deaths / total population) × 1000

Crude rates can be misleading when comparing different age distributions.

Answer 202

A

A hypothetical population of 1000 used to represent the whole population

It takes into account age, gender, and income distributions.

Answer 203

A

Compare the same age group in different populations

It uses the standard population for realistic comparisons.

Answer 204

A

A list of all possible outcomes with their expected probabilities

Example: Flipping a fair coin results in heads or tails.

Answer 205

A

A type of probability distribution with only two possible outcomes

Examples include flipping a coin (heads or tails) or rolling a six (success or failure).

Answer 206

A

Fixed number of trials (n)
Each trial has 2 outcomes (success or failure)
Trials are independent
Probability of success is constant

If these conditions are met, the binomial distribution is applicable.

Answer 207

A

Use (p + q)^n and identify the outcomes and their probabilities

Expand (p + q)^n where n is the number of trials.

Answer 208

A

To find coefficients of a binomial distribution

The coefficients follow the pattern of Pascal’s triangle.

Answer 209

A

10 × (X Heads) × (X Tails)

P(x) = ½ for Heads and ½ for Tails

Answer 210

A

Pascal’s triangle

Answer 211

A

By adding the 2 numbers directly above

Answer 212

A

1p^4 + 4p^3q^1 + 6p^2q^2 + 4pq^3 + 1q^4

Answer 213

A

N=number of trials and r=number of successes

Answer 214

A

Type ‘5’, ‘nCr’, ‘3’, ‘=’ to get 10

Answer 215

A

Work out their individual probabilities and then add them up

Answer 216

A

Work out the probability of 0 successes and subtract from 1

Answer 217

A

Bell-shaped

Answer 218

A

A lower curve

Answer 219

A

N(μ, σ²)

Answer 220

A

μ = mean, σ² = variance

Answer 221

A

Data is continuous
Distribution is symmetrical
Mode, median, and mean are approximately equal

Answer 222

A

Draw a bell-shaped curve centered on the mean and ending at 3 SD from the mean

Answer 223

A

(value - mean) / standard deviation

Answer 224

A

To compare how far above or below the average individual values are

Answer 225

A

The value is above the mean

Answer 226

A

The value is below the mean

Answer 227

A

The value is equal to the mean

Answer 228

A

Checking samples to ensure products are of the same quality and standard

Answer 229

A

A time series chart used for quality assurance

Answer 230

A

Target Value (middle line)
Upper and Lower Warning Lines (inner 2 lines)
Upper and Lower Action Limits (outer 2 lines)

Answer 231

A

Another sample is taken and checked for problems

Answer 232

A

Production is stopped immediately and machinery is reset