Unit 5: Data Flashcards
What is data?
I. Computer readable information
II. Information collected about the physical world
III. Programs that process images
IV. Graphs and charts
Source: CodeHS Data
A) II, IV
B) I, II
C) I, II, IV
D) I, II, III, IV
B
Notes: Right! Data is just information that is collected. Digital data must be in a computer readable form, like digital images.
Answer: B) I, II
Explanation:
Data is information that can be collected, stored, and analyzed.
I. Computer-readable information: True, as digital data must be encoded in a form that computers can process.
II. Information collected about the physical world: True, as data often comes from observations or measurements of real-world phenomena.
III. Programs that process images: False, these are not data but tools to analyze data.
IV. Graphs and charts: False, these are visual representations of data, not the data itself.
Which of the following statements is NOT a benefit of using computers to process data?
Source: CodeHS Data
A) People can use computers to find patterns in data and make predictions.
B) Computers help people visualize data so that it is easy to extract useful information.
C) Websites can spy on people and gather large amounts of personal data without the user knowing.
D) Computers are able to easily process, manipulate, and display large amounts of data in a short amount of time.
C
Websites can spy on people and gather large amounts of personal data without the user knowing. Notes: This is not exactly a benefit as this information can be used for evil purposes.
Answer: C) Websites can spy on people and gather large amounts of personal data without the user knowing.
Explanation:
While this is a factual statement, it is not a benefit of using computers for data processing.
The other options highlight positive uses of data processing, such as identifying patterns, creating visualizations, and handling large datasets efficiently.
Which of the following statements is an example of computer readable data?
Source: CodeHS Data
A) A handwritten note
B) Brain waves
C) A physical photograph
D) A digital spreadsheet filled with measurements about the air quality of different major cities
D
Answer: D) A digital spreadsheet filled with measurements about the air quality of different major cities.
Explanation:
A handwritten note and a physical photograph are analog forms of data and not computer-readable.
Brain waves require sensors and specialized software for conversion into computer-readable format.
A digital spreadsheet is already formatted for computer processing.
Which of the following statements is true about data visualizations?
Source: CodeHS Data
A) Visualizing data has only been possible since computers have become widespread.
B) Visualizations take many forms, from tables to charts to images.
C) There is always one exact visualization that should be used to show a particular aspect of a dataset.
D) The only way to extract information from data is by using a visualization.
B
Answer: B) Visualizations take many forms, from tables to charts to images.
Explanation:
Data visualizations are versatile and can represent information in various formats to highlight patterns and relationships.
The other options incorrectly limit the utility or types of visualizations.
Suppose you want to make a visualization that shows how many students bought certain quantities of candy from the vending machine during the month of September. For example, this visualization should reveal the frequency of students who bought 3 candy bars versus the frequency of students bought 10 candy bars. Of the choices below, which chart would best convey this information to the person looking at the graph?
Source: CodeHS Data
A) Pie chart
B) Histogram
C) A map where the colors represent the number of candy bars bought
D) Line chart
B
Answer: B) Histogram
Explanation:
A histogram is specifically designed to display the distribution and frequency of numerical data, making it the ideal choice for this scenario.
There are several different kinds of charts that we commonly use to visualize data. In which of the following would one of the charts described help to accomplish the accompanying task described?
Source: CodeHS Data
A) You want to track the number of times you say “hello” today.
B) Your school wants to track how many people attend the football games over time throughout the school year.
C) You are trying to figure out what happens when different colors are mixed.
D) You need instructions on how to bake a cake.
B
Answer: B) Your school wants to track how many people attend the football games over time throughout the school year.
Explanation:
Line charts are commonly used to track changes over time, such as attendance trends across a year.
A natural science museum opened a new display that lets the visitors view animations of a coral reef. The animations show how the health of the coral reef varies based on water temperature, pollution levels, and the number of fish living around the reef. The visitors are able to choose a numerical value for each of the conditions. The exhibit’s animations are determined by using a database to look up how healthy the coral reef is at the particular settings the visitor chooses and displaying a corresponding picture.
What is the biggest advantage of using an interactive exhibit like this instead of showing a poster with the same information?
Source: CodeHS Data
A) The visitors will be more entertained by the exhibit, but won’t learn any more than they would have from just looking at a poster.
B) By allowing the visitors to interact with the exhibit, the visitors will be able to understand coral reefs better.
C) The interactive display will be more visually appealing than a static poster.
D) Scientists will be able to learn more about the coral reefs by tracking the visitors’ interactions.
B
Answer: B) By allowing the visitors to interact with the exhibit, the visitors will be able to understand coral reefs better.
Explanation:
Interactive exhibits engage visitors by allowing them to manipulate variables, enhancing understanding of relationships between factors like pollution levels and coral health.
Polly and Sergei are working on a project to explain how the rise in oil prices is leading to a rise in lunch prices at their school.
Polly wants to have a chart showing the oil prices every day over the past 18 months and a different chart showing the lunch prices every day over the past 18 months. Sergei argues that having two separate charts won’t show the relationship between oil prices and lunch prices. He also thinks that the charts are showing too many data points. Instead, he wants to use a program to make a chart that shows both the oil prices and the lunch prices on the same chart. In addition, rather than plotting the prices for every day, he only wants to chart the average monthly prices for oil and lunch.
Why would Sergei’s approach make it easier for other people to analyze the data than Polly’s approach?
Source: CodeHS Data
A) Sergei’s chart would be much smaller than Polly’s chart, so people wouldn’t be overloaded with visual information.
B) It is always better to put all of the data you want to analyze on the same chart.
C) Polly’s presentation is more likely to be misunderstood because it uses two charts.
D) By transforming and summarizing the available data, Serge’s chart would more effective in showing any trends that may have occurred.
D
Answer: D) By transforming and summarizing the available data, Sergei’s chart would be more effective in showing any trends that may have occurred.
Explanation:
Combining datasets (oil prices and lunch prices) into one chart and summarizing data (using averages) simplifies analysis and makes trends easier to identify.
Which of the following statements are true about using visualizations to display a dataset?
Source: CodeHS Data
I. Visualizations are visually appealing, but don’t help the viewer understand relationships that exist in the data
II. Visualizations like graphs, charts, or visualizations with pictures are useful for conveying information, while tables just filled with text are not useful.
III. Patterns that exist in the data can be found more easily by using a visualization
A) I and II
B) II and III
C) III only
D) I, II, and III
C
Answer: C) III only
Explanation:
Patterns in data are easier to identify through visualizations like charts and graphs, but statements I and II are overly restrictive or incorrect.
Which of the following are ways that data is collected about you:
I - Websites store data that tracks how you use the website
II - Websites store cookies in your browser so that the next time you visit the website things like your profile login and recent activity are saved
III - Some apps store geolocation information from your phone to track your location
IV - Transaction data is stored by credit card companies when you purchase things with a credit card
Source: CodeHS Data
A) I only
B) II only
C) I, II, III, and IV
D) I, II, and IV
C
Answer: C) I, II, III, and IV
Explanation:
Websites track user behavior (I), store cookies (II), and some apps track geolocation (III).
Credit card companies collect transaction data (IV).
Suppose a student named Marcus wants to learn about the sleeping habits of students in his class. Marcus wants to collect data from his classmates to learn how many hours of sleep his classmates get. He then wants to process this data with a computer and visualize it in a Histogram.
Which of the following would be the best technique for Marcus to collect this data?
Source: CodeHS Data
A) Marcus should ask each of his classmates to write down on a piece of paper how many hours of sleep they get per night and hand the paper to him.
B) Marcus should have them download an app that tracks their phone geolocation and activity so he can see when their phones are in their rooms and not being used. From this data he can figure out how long each student sleeps.
C) Follow the link to view picture: https://drive.google.com/file/d/1hTjFSfPs1Gzi1bGNOxXrlK59n3GrLZa6/view?usp=drive_link
D) Follow the link to view picture:
https://drive.google.com/file/d/1Jp3GaS02cKE_z2b9lHIgYXpj_DjoG8Ho/view?usp=drive_link
C
Correct! The simplest and most effective way to collect data is with an online survey. It is better to ask for numeric data rather than text data so that he can visualize the numbers later with a Histogram.
Answer: C) Follow the link to view picture.
Explanation:
The linked option likely refers to using a survey, which is the most efficient and reliable way to collect specific numeric data for visualization.
News reporting agencies often want to find the public’s opinion on current events. One particular agency is considering two different strategies to collect this data by collecting responses to online surveys. The two strategies are outlined below.
Strategy One
1. Uses a database to store all of the survey responses
2. Stores some data as text and some data as numbers
3. Will track extra information about the survey taker that won’t be publicly visible
Strategy Two
1. Uses a single spreadsheet to store all of the survey responses
2. Stores all data as numbers
3. Will not track any information other than the survey responses
Which of the following statements is the most accurate comparison of these strategies?
Source: CodeHS Data
A) Strategy One will make it easier to sort and filter the data, while Strategy Two will make it easier to graph the data
B) Strategy One will cause problems because of the mixed data types, while Strategy Two will make it very easy to find specific data.
C) Strategy One will allow the agency to conclude more about the public’s opinion because it tracks extra metadata, while Strategy Two will make it hard to find trends and access particular pieces of the data.
D) Strategy One will require less cleaning and manipulation of the data, while Strategy Two will require a significant amount of extra computation to use the data.
C
Answer: C) Strategy One will allow the agency to conclude more about the public’s opinion because it tracks extra metadata, while Strategy Two will make it hard to find trends and access particular pieces of the data.
Explanation:
Strategy One’s use of mixed data types and metadata enhances analytical capabilities, while Strategy Two’s limited structure reduces versatility.
Which of the following statements describes how mobile devices, the use of computers in more and more everyday interactions, and the ability to connect with other devices almost anywhere are changing society?
I. People are able to use mobile devices for new applications such as finding directions or finding restaurants
II. Data can be collected from thousands of sources and can be combined to provide new services to individuals and companies
III. Buildings, cars, classrooms, and offices can now be engineered with sensors to automate tasks like adjusting the thermostat or even driving
IV. Data that is collected can be used to identify social problems
Source: CodeHS Data
A) II, IV
B) III
C) I, III
D) I, II, III, IV
D
Answer: D) I, II, III, IV
Explanation:
All listed options describe societal shifts, from new applications (I) to collecting data for services (II) and using sensors in infrastructure (III, IV).
Shown here is the google search trend data for the search term “flu vaccine”:
https://drive.google.com/file/d/1VA121rA6gosK9Z_O0-Uir6KZgsXGFsL2/view?usp=sharing
What can we reasonably conclude from this data visualization?
Source: CodeHS Data
A) Exactly 100 people had a flu vaccine in October 2009
B) The highest interest in flu vaccines occurs in October each year, and will likely continue to occur in October in future years
C) The highest number of flu infections happened in 2009
D) Taking vitamin B-12 can help reduce your risk of catching the flu
B
Answer: B) The highest interest in flu vaccines occurs in October each year, and will likely continue to occur in October in future years.
Explanation:
The data visualization shows peaks in interest during October, a recurring seasonal trend likely tied to flu season.
Shown here is a line graph showing the stock prices for Twitter, Inc.
https://drive.google.com/file/d/1UbY85DoSnrGfqjZCvRQYt-xZNrV75nkI/view?usp=drive_link
What is misleading about this visualization?
Source: CodeHS Data
A) The y-axis is upside down, so larger values are at the bottom and smaller values are at the top.
B) The y-axis is truncated making the graph seem like it is increasing a lot more than it actually is.
C) The graph is omitting data.
D) The graph makes it seem like the increase in stock prices is caused by the month that they are sold/bought.
B
Answer: B) The y-axis is truncated, making the graph seem like it is increasing a lot more than it actually is.
Explanation:
Truncated axes exaggerate visual trends, misleading viewers about the true scale of changes.