Lecture 1.2 - Introduction and Types of Data - Understanding data Flashcards
1
Q
What is Data?
A
- Data collection is necessary to learn something.
- Data are facts and figures that are collected, analyzed, and summarized for presentation and interpretation.
- Statistics relies on data, information that surrounds us.
2
Q
Why do we collect Data
A
- Interested in characteristics of people, places, things, or events.
- Example: Interested in temperatures in Chennai in a particular month.
- Example: Interested in marks obtained by students in Class 12.
- Interested in how many people like a new song, product, or video, collected through comments.
3
Q
Data Collection
A
- Data available: Published data.
- Data not available: Need to collect or generate data.
- Statistical analysis is done on available data, assuming it’s already available.
4
Q
Unstructured and structured data
A
- Context of the numbers and text in a database must be known to make it useful.
- Scattered and unstructured information is of very little use.
- Organizing data is necessary; structured data is a collection of values, such as numbers, names, and roll numbers.
5
Q
Variables and cases
A
- Case (observation): A unit from which data are collected.
- Variable: A characteristic or attribute that varies across all units.
- In a school data set, each student is a case and variables include name, marks obtained, board, etc.
- Rows represent cases: For each case, the same attributes are recorded.
- Columns represent variables: For each variable, the same type of value from each case is recorded.