Exploring Data (Week 1 - Section 1) Flashcards

Data And Visualization

1
Q

What are the different levels of measurement in statistics?

A

The different levels of measurement in statistics are nominal, ordinal, interval, and ratio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Explain the difference between a categorical variable and a quantitative variable.

A

A categorical variable is a variable that represents different categories or groups, while a quantitative variable is a variable that represents numerical values or quantities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between a discrete variable and a continuous variable?

A

A discrete variable is a variable that can only take on specific, separate values, while a continuous variable is a variable that can take on any value within a certain range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Give an example of a nominal variable and explain why it is considered nominal.

A

An example of a nominal variable is the color of a car. It is considered nominal because the different colors (e.g., red, blue, green) do not have any inherent order or numerical value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the difference between the ordinal level and the interval level of measurement?

A

The ordinal level of measurement has categories with an inherent order, but the intervals between the categories may not be equal. The interval level of measurement has categories with an inherent order, and the intervals between the categories are equal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain why it is important to distinguish between different levels of measurement when analyzing data.

A

It is important to distinguish between different levels of measurement because the appropriate statistical methods and analyses depend on the level of measurement. Different levels of measurement require different types of calculations and interpretations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Can an ordinal variable be treated as a quantitative variable? Why or why not?

A

In some cases, an ordinal variable can be treated as a quantitative variable if the scale has a sufficient number of categories and the intervals between the categories are considered to be equal. However, this is a matter of debate among statisticians.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Give an example of a quantitative variable and explain whether it is discrete or continuous.

A

An example of a quantitative variable is the height of individuals. It can be measured in inches or centimeters and can take on any value within a certain range. It is considered a continuous variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between a variable and a case in statistics?

A

In statistics, a variable is a characteristic or attribute that can vary among different cases. A case, on the other hand, refers to an individual or object that is being studied or analyzed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can improving your knowledge of statistics make you an expert in a particular field, such as football?

A

Improving your knowledge of statistics can make you an expert in a particular field, such as football, by enabling you to analyze and interpret data related to the sport. You can gain insights into player performance, team strategies, and make informed decisions based on statistical evidence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define Cases (in Statistics)

A

In the context of statistics, cases refer to the individuals, objects, or entities that we are studying or collecting data on. They can be people, animals, organizations, countries, or any other unit of analysis. Each case represents a unique entity that we want to gather information about. For example, in a study about football players, each player would be considered a case. Similarly, in a study about football teams, each team would be a case. Cases help us understand and analyze the characteristics and variables associated with them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define Variables (in Statistics)

A

Variables, in the context of statistics, refer to the characteristics or attributes that we measure or observe in a study. They represent the different aspects or properties of the cases or individuals we are studying. Variables can take on different values or levels, and they can be quantitative or categorical. These types of variable are then split into 2 variable classifications. Quantitative variables have discrete and continuous variables. Categorical variables have Nominal and Ordinal variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define Discrete Variables and what category of variables they belong to

A

These variables have a finite or countable number of possible values. For example, the number of goals scored by a football player or the number of students in a class.

Discrete variable belong to the Quantitative variable category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define Continuous Variables and what category of variables they belong to

A

These variables can take on any value within a certain range. They are measured on a continuous scale. Examples include height, weight, or temperature.

Continuous variables belong to the Quantitative variable category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define Nominal variables and what category of variables they belong to

A

These variables have categories or labels with no inherent order or ranking. Examples include gender, nationality, or eye color.

Nominal variables belong to the Categorical variable category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define Ordinal variables and what category of variables they belong to

A

These variables have categories with a specific order or ranking. The difference between categories may not be uniform. Examples include rating scales (e.g., Likert scale) or educational levels (e.g., high school, college, graduate).

Ordinal variables belong to the Categorical variable category

17
Q

When you are conducting a study its best to think about it as what two parts…

A

Variables and cases. Cases being the person, animal, or things you are studying. Variables being the characteristics of interest.

18
Q

Give an example of a Case: (context Football)

A

A case could be individual players of a sports team.

19
Q

Give an example of a Variable: (context Football)

A

Variables would be things such as age, weight, height, goals scored, team membership, hair color, etc.

20
Q

What is a Data Matrix?

A

A data matrix is a tabular representation of data in which cases (individuals or things being studied) are organized in rows, and variables (characteristics of interest) are organized in columns. Each cell in the data matrix contains an observation, which represents a measurement or data point for a specific case and variable combination. Data matrices are commonly used in statistical studies to organize and analyze data. Think about it like a excel spreadsheet.

21
Q

What are a few shortcomings of a data matrix?

A

Some potential shortcomings of a data matrix include:

Missing Data: Data matrices may have missing values for certain cases or variables, which can limit the completeness and accuracy of the analysis.

Limited Representation: Data matrices may not capture all relevant information or nuances of the data. They provide a simplified representation of the data, which may overlook important details or context.

Lack of Context: Data matrices do not provide contextual information about the data, such as the source, collection methods, or any underlying assumptions. This can make it challenging to interpret and analyze the data accurately.

Limited Flexibility: Once data is organized in a matrix format, it may be difficult to modify or add new variables or cases without restructuring the entire matrix. This lack of flexibility can hinder the exploration of new research questions or changes in data requirements.

Potential Bias: The way data is organized in a matrix can introduce bias or influence the analysis. For example, the order of cases or variables can impact the perception of patterns or relationships in the data.

Large Size: Data matrices can become large and complex, especially when dealing with a large number of cases or variables. This can make it challenging to manage, analyze, and present the data effectively.

22
Q

What can you do when you are presented with a large data matrix with a lot of data?

A

You can present the data in summaries in the form of tables and graphs. You would then make a frequency table.

23
Q

Define Frequency Table:

A

A frequency table is a tabular representation that summarizes the distribution of values or categories in a dataset. It displays the number of occurrences or frequency of each value or category within a variable. The table typically consists of two columns: one for the values or categories and another for the corresponding frequencies. Frequency tables are commonly used in descriptive statistics to provide a concise and organized summary of data, allowing for easy interpretation and analysis.

Essentially, a frequency table shows how the values are distributed over cases. Thru “recoding” you can make the frequency table more concise (at the risk of losing a little data)

24
Q

What does “Recoded”/”Recoding” mean?

A

When you Recode a variable, you are turning the Quantitative variable into a Ordinal variable. It is NOT possible to turn a Ordinal variable into a quantitative as ordinal variable.

25
Q
A