Les 4 - DATA PROCESSING AND STATISTICAL TREATMENT Flashcards
involves the conversion of data in either manually or digitally into quantitative and qualitative forms for use in research analysis. It involves 3 general steps.
Data Processing
1. Categorization
2. Coding
3. Tabulation of Data
Nature of statistics
A. As a body of knowledge or science (study of data, population/s, variations, distributions)
B. As a mass of data
Two Major Areas of Statistics
Descriptive and Inferential Statistics
concerned with the methods for collection, organizing, and describing a set of data so as to yield meaningful information.
Descriptive Statistics
comprises those procedures for drawing inferences or making generalizations about characteristics of a population based on partial and incomplete information obtained from a sample of the population
Inferential Statistics
Parameter vs Statistic (differences in meaning and symbols used)
A parameter is a number describing a whole population (e.g., µ, σ), while a statistic is a number describing a sample (e.g., x̄, sd)
is any trait or attribute that vary from
person to person or case to case
variable
_______________ refer to any recorded information derived from counts, measurements, observations, interviews, experiments and other techniques. The data originally measured are referred to as _______________.
Statistical Data; Raw Data
Qualitative vs Quantitative Variables
classifies objects or cases according to the type or quality (qualitative) or the degree or amount (quantitative) of their attributes
Discrete vs Continuous Variables
- Discrete variable – the observations on the
variable are countable units, expressed as
whole numbers - Continuous variable – can assume an
infinitely large number of small, fractional
values on a continuum
classifies data into distinct categories in which no ranking is implied. Weakest form of measurement.
Nominal
classifies values into distinct categories in which ranking is implied.
Ordinal
is an ordered scale in which the difference between measurements is a meaningful quantity but does not involve a true zero point; difference has the same meaning anywhere on the scale.
Interval
is an ordered scale in which the difference between the measurements involves a true zero point, as in height, weight, age, or salary measurements
Ratio
Identify the the levels of measurement of the following: Exam score, temperature in C, Age, Types of Rocks, Officer Hierarchy, Cost of Unliwings
Exam score, temperature in C, Age, Types of Rocks, Officer Hierarchy, Cost of Unliwings
Interval, Interval, Ratio, Nominal, Ordinal, Ratio
is a powerful statistical software platform that offers a user-friendly interface and a robust set of features that lets your organization quickly extract actionable insights
from your data according to International Business Machines Corporation (IBM)
SPSS (Statistical Package
for Social Sciences)
SPSS Statistical Steps
Setting up a Data File, Preparing a Code Book, Putting your ideas into work, Creating on SPSS Data File, Hypothesis Testing: P-value Approach
If null hypothesis is true but rejected.
Type I Error
If null hypothesis is wrong but failed to reject
Type II Error
Type of Error: Saying you’re pregnant to a man
Type I Error
Type of Error: Saying you’re not pregnant to a pregnant woman
Type II Error
concerned with the brief descriptive coefficients that summarize a given data set, which can be either a representation of the entire population or a sample of a population. Includes measures of central tendency and measures of variability (spread).
Descriptive Statistics
It is better to use __________ on categorical variables and ____________ for continuous variables in Descriptive Statistics.
Frequencies, Descriptives
Normality Value for Skewness, Kurtosis, Z-score, COV, Shapiro-Wilk’s W test, and Kolmogorov-Smirnov test?
Normality Value for Skewness, Kurtosis, Z-score, COV, Shapiro-Wilk’s W test, and Kolmogorov-Smirnov test?
± 2, ± 7, ± 3, <30%, >α (one-tailed) or >α/2 (two-tailed), >α (one-tailed) or >α/2 (two-tailed)
It is useful for identifying outliers
Z-score
One-tailed or Two-tailed?
A.
Ho: µ = 100
Ha: µ ≠ 100
B.
Ho: µ = 100
Ha: µ > 100
A. Two-tailed
B. One-tailed
is a measure of association between
two variables. The variables are not designated as dependent or independent
Correlation
Pearson correlation for 2 samples (r), Interpret results:
α = 0.01
r = -0.774
Sig. (2-tailed) = 0.003
The correlation is SIGNIFICANT (Sig. < α) the relationship is NEGATIVE and the degree and strength of association between two scores is VERY GOOD (0.774 or 77.4% is within 0.66-0.85)