From study guide posted Flashcards
What is data?
Raw facts or figures
What is statistics and what does it involve?
The study of data, involving:
Collecting data
Summarizing data
Drawing conclusions
Descriptive vs inferential statistics
Descriptive: Summarizes data.
Inferential: Makes predictions or inferences about a population from a sample.
What is population data?
Often not feasible to obtain due to size or time constraints.
Cross-sectional vs time series data
Cross-sectional: Data collected at one point in time.
Time series: Data collected over multiple time periods.
Structured vs unstructured data
Structured: Organized (e.g., in tables).
Unstructured: Disorganized (e.g., text, images).
What is a variable?
A characteristic that can take different values.
Categorical vs numerical data
Categorical: Qualitative (e.g., gender, type).
Numerical: Quantitative (e.g., height, age).
Discrete vs continuous data
Discrete: Countable (e.g., number of cars).
Continuous: Measurable (e.g., weight, height).
What are the four scales of measurement?
Nominal: Categories (e.g., gender).
Ordinal: Ordered categories (e.g., rankings).
Interval: Ordered with equal intervals but no true zero (e.g., temperature).
Ratio: Ordered with equal intervals and a true zero (e.g., weight).
How do you deal with missing data?
Omission: Remove missing values.
Imputation: Fill in missing values with estimates.
What is frequency distribution?
Summarizes data into categories or intervals.
How do you visualize categorical data?
Bar chart
Pie chart
Contingency table
Stacked column chart
How do you visualize numeric data?
Histogram: For continuous data.
Intervals: Calculate width by (Max - Min) / Number of intervals.
Skewness: Symmetric, Positive, or Negative.
What are three ways to measure central location?
Mean, median, and mode.
Mean (μ for population, x̄ for sample)
Median (especially useful for outliers)
Mode (Unimodal, Bimodal)