Chapter 1: Collection of Data Flashcards
What is raw data?
Unprocessed data that just been collected and needs to be ordered, grouped, rounded, cleaned.
Define qualitative data.
Non-numerical, descriptive data such as eye/hair colour or gender.
Define quantitative data.
Numerical data that can be measured with numbers.
What is discrete data?
Data that only takes particular values, such as shoe size or number of people.
What is continuous data?
Data that can take any value, e.g. height, weight.
Define categorical data.
Data that can be sorted into non-overlapping categories, such as gender.
What is ordinal data?
Quantitative data that can be given an order or ranked on a rating scale.
What does bivariate data involve?
Measuring 2 variables, which can be qualitative or quantitative.
What is multivariate data?
Data made up of more than 2 variables.
What is the purpose of grouping data?
Makes it easier to spot patterns and quickly see how the data is distributed.
What is a primary data source?
Data that you have collected yourself or someone has collected on your behalf.
What is a secondary data source?
Data that has already been collected.
Define population in the context of data collection.
Everyone or everything that could be involved in the investigation.
What is a census?
A survey of the entire population.
What is a sample?
A smaller number from the population that you actually survey.
What is a sampling frame?
A list of all the members of the population from which the sample is chosen.
Define biased sample.
A sample that does not represent the population fairly.
What is random sampling?
Every item/person in the population has an equal chance of being selected.
What is stratified sampling?
The size of each strata in the sample is in proportion to the sizes of strata in the population.
What is systematic sampling?
Choosing items in the population at regular intervals.
Define cluster sampling.
The population is divided into natural groups, groups are chosen at random, and every member of those groups is sampled.
What is quota sampling?
Population is grouped by characteristics and a fixed amount is sampled from every group.
What is opportunity sampling?
Using the people/items that are available at the time.
Define judgement sampling.
When the researcher uses their own judgement to select a sample they think will represent the population.
What is Petersen Capture-Recapture used for?
To estimate the size of large or moving populations where it would be impossible to count the entire population.
What are extraneous variables?
Variables you are not interested in but that could affect the result of your experiment.
What is the explanatory (independent) variable?
The variable that is changed in an experiment.
What is the response (dependent) variable?
The variable that is measured in an experiment.
What is an Explanatory (Independent) Variable?
The variable that is changed.
This variable is manipulated to observe its effect on the dependent variable.
What is a Response (Dependent) Variable?
The variable that is measured.
This variable responds to changes in the independent variable.
What are Extraneous Variables?
Variables you are not interested in but that could affect the result of your experiment.
These variables can introduce noise into the results.
What are Laboratory Experiments?
Experiments where the researcher has full control over variables, conducted in a lab or similar environment.
Example: measuring reaction times of people of different ages.
What is an example of an Explanatory Variable in a laboratory experiment measuring reaction times?
Age.
The response variable in this case would be reaction time.
What are the advantages of Laboratory Experiments?
- Easy to replicate
- Extraneous variables can be controlled
- Results are more likely to be valid
Controlled conditions enhance the reliability of results.
What are the disadvantages of Laboratory Experiments?
People may behave differently under test conditions than they would under real-life conditions.
This could affect the validity of results.
What are Field Experiments?
Experiments carried out in the everyday environment where the researcher has some control over the variables.
Example: Testing new methods of revision.
What is an example of an Explanatory Variable in a field experiment testing new methods of revision?
Method of revision.
The response variable would be results in the exam.
What are the advantages of Field Experiments?
More accurate; reflects real-life behavior.
These experiments tend to have higher ecological validity.
What are the disadvantages of Field Experiments?
- Cannot control extraneous variables
- Not as easy to replicate
This can lead to less reliable results compared to laboratory experiments.
What are Natural Experiments?
Experiments carried out in the everyday environment where the researcher has no or very little control over the variables.
Example: The effect of education on level of income.
What is an example of an Explanatory Variable in a natural experiment studying education and income?
Level of education.
The response variable would be income.
What are the advantages of Natural Experiments?
Reflects real-life behavior.
These experiments can provide insights into phenomena that are difficult to study in controlled settings.
What are the disadvantages of Natural Experiments?
- Low validity
- Difficult to replicate
Extraneous variables are not controlled, which may affect results.
What is a Simulation in research?
A way to model random events using random numbers and previously collected data.
Simulations help predict potential real-life outcomes.
What are the steps involved in conducting a simulation?
- Choose a suitable method for random numbers
- Assign numbers to data
- Generate random numbers
- Match random numbers to outcomes
This process can help simulate customer choices, for example.
What is a Questionnaire?
A set of questions used to obtain data from the population/sample.
Can be administered via post, email, phone, or face-to-face.
What are Open Questions?
Questions that allow any answer but are difficult to analyze.
They provide qualitative data but can complicate data analysis.
What are Closed Questions?
Questions with a fixed number of non-overlapping option boxes that allow for specific answers.
These questions make data easier to analyze.
What are features of a good questionnaire?
- Easy to understand
- Uses simple language
- Avoids leading questions
- Relevant to the investigation
- Includes a time frame/unit
- Non-overlapping, exhaustive option boxes
- Not offensive/personal
- Easy to analyze results
These features enhance the quality and reliability of data collected.
What is Non-response in questionnaires?
When people in the sample do not respond to the questionnaire.
This can be due to various factors, including lack of interest or understanding.
What is the Random Response Method?
Uses a random event to decide how to answer a question, ensuring anonymity.
This method helps reduce bias in sensitive questions.
What is a Pilot Study?
A small-scale replica of the study to test the design and methods of the questionnaire.
Helps identify unclear questions and estimate response rates.
What are Outliers in data collection?
Values that do not fit in with the pattern or trend of the data.
Outliers can be extreme values or incorrectly recorded data.
What is Cleaning Data?
Fixing problems with the data by identifying and correcting/removing incorrect values or outliers.
This process ensures the integrity of the dataset.
What are Control Groups?
Groups used in an experiment to ensure that the treatment is causing the experimental results.
Control groups do not receive the treatment being tested.
What are Matched Pairs in experiments?
Two groups of equally matched people used to test the effect of a particular factor.
This method helps control for unwanted variables.
What is a Hypothesis?
A statement that can be tested by collecting and analyzing data.
It serves as a foundation for an investigation.
What are the stages of an Investigation?
- Planning
- Collecting Data
- Processing and Representing Data
- Interpreting Results
- Evaluating
Each stage is crucial for conducting a thorough and valid investigation.
What is the first step in the data collection process?
Choose a hypothesis
A hypothesis is a statement that can be tested through data collection and analysis.
What do you need to determine when choosing a hypothesis?
What data to collect (variables)
Variables are the elements that can be measured or controlled in a study.
What is a key component of organizing data collection?
How you will record data (data collection tables)
Data collection tables help in systematically organizing the data collected.
What types of data sources can be chosen for data collection?
Primary and secondary sources
Primary sources are original data collected firsthand, while secondary sources are analyses or interpretations of primary data.
What are some common methods for data collection?
Questionnaire and interviews
Questionnaires are structured forms for data collection, while interviews involve direct conversation with subjects.
What should be controlled during data collection?
Control factors
Control factors are variables that are kept constant to ensure that the results are due to the variable being tested.
What is involved in processing and representing data?
Choosing diagrams and calculations
Diagrams can include charts or graphs that visually represent data, while calculations provide quantitative analysis.
What is the purpose of interpreting results?
Drawing conclusions from the results of the diagrams and conclusions
Interpretation involves analyzing the data and diagrams to understand their implications.
What is evaluated after data collection?
Evaluating methods
This involves assessing the effectiveness of data collection methods and planning.
What aspects of data collection methods should be evaluated?
Strengths and weaknesses
Understanding strengths helps reinforce effective practices, while recognizing weaknesses allows for improvement.
True or False: The choice of diagrams and calculations is irrelevant to data processing.
False
Choosing appropriate diagrams and calculations is crucial for accurate data representation and analysis.