Unit 5 Comp Sci Flashcards

1
Q

What is Citizen Science?

A

Scientific research conducted in whole or part by distributed individuals, many of whom may not be scientists, who contribute relevant data to research using their own computing devices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Cleaning Data?

A

A process that makes the data uniform without changing its meaning (e.g., replacing all equivalent abbreviations, spellings, and capitalizations with the same word).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Correlation?

A

A relationship between two pieces of data, typically referring to the amount that one.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Crowdsourcing?

A

The practice of obtaining input or information from a large number of people via the Internet.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Information?

A

The collection of facts and patterns extracted from data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Data bias?

A

Data that does not accurately reflect the full population or phenomenon being studied.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Data filtering?

A

Choosing a smaller subset of a data set to use for analysis, for example by eliminating / keeping only certain rows in a table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a Bar Chart?

A

Graph of bars that shows the number of times each value in a column of data appears.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a Histogram?

A

Similar to a bar chart, but all numbers within a range (bucket) are grouped together.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a Crosstab Chart?

A

Counts the number of times combinations of values appear (similar to a frequency table).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a Scatterplot?

A

Graph that shows the relationship between 2 sets of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Open Data?

A

Publicly available data shared by governments, organizations, and others so that anyone can analyze it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Big Data?

A

Collection of huge amounts of data so we can learn from it often requiring cloud computing or parallel processing systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Metadata?

A

Data about data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the primary purpose of cleaning data in the Data Analysis Process?

A

To make the data uniform without changing its meaning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which visualization would be most appropriate for examining the relationship between students’ study hours and their test scores?

A

Scatterplot.

17
Q

What is ‘data bias’ as defined in the study guide?

A

Data that does not accurately reflect the full population or phenomenon being studied.

18
Q

Which of the following best describes the relationship between data and information?

A

Information is the collection of facts and patterns extracted from data.

19
Q

When working with Big Data, which computing approach is typically necessary?

A

Cloud computing or parallel processing systems.

20
Q

What is the key difference between a bar chart and a histogram?

A

Bar charts show discrete values while histograms group numbers within ranges (buckets).

21
Q

Which of the following is an example of metadata?

A

The timestamp of when a photo was taken.

22
Q

What computing practice involves obtaining input or information from a large number of people via the Internet?

A

Crowdsourcing.

23
Q

According to the AP standards covered in Unit 5, what can programs be used for in relation to data?

A

To process data, allowing users to discover information and create new knowledge.

24
Q

Which type of data analysis focuses on examining how frequently each value appears in a single column?

A

One-column analysis.