Introduction to Statistics and Data Analysis Flashcards
a branch of mathematics that deals with the collection organization and analysis of numerical data and with such problems as experiment design and decision making
statistics
three important features of statistics
data gathering
data analysis
making decision
data collected in original form
raw data
characteristic or attribute that can assume different values
variable
all subjects possessing a common characteristic that is being studied
population
a subgroup or subset of a population
sample
characteristic or measure obtained from a population
parameter
variables which assume non-numerical values
qualitative variables
variables which assume numerical values
quantitative variables
variables which assume finite or countable number of possible values, usually obtained by counting
discrete variable
variables which assume infinite number of possible values usually obtained by measurement
continuous variable
methods of sampling
random sampling
systematic sampling
stratified sampling
cluster sampling
sampling in which the data is collected using chance methods or random numbers
random sampling
sampling in which the data is collected by selecting every kth object
systematic sampling
sampling in which the population is divided into groups or strata according to some characteristic
stratified sampling
sampling in which the population is divided into groups usually geographically
cluster sampling
methods of summarizing or characterizing data
tabular method
graphical method
numerical method
three ways of tabular method
frequency distribution
cumulative frequency
stem and leaf table
four ways of graphical methods
frequency histogram
frequency polygon
ogive
pie chart
four ways of numerical methods
measures of central tendencies
measures of dispersion
measures of shape
measures of data locations
the organization of raw data in tabular form with classes and frequencies
frequency distribution
separates one class in a grouped frequency from the other
class interval
it has one more decimal place than the raw data and therefore it does not appear in the data
class boundary
the number of times a certain value or class of values occurs
frequency
frequency / the total number of data
relative frequency
a graph which displays the data by using vertical bars of various heights to represent frequencies
frequency histogram
a line graph between frequency and class mark
frequency polygon
a frequency polygon of relative cumulative frequency against upper boundaries
ogive
the degree of slice is based on the relative frequency
pie chart
the sum of the product of class mark and corresponding frequency divided by the total of samples
mean
the value that will divide the samples into two equal halves when the sample are arranged from lowest to highest
median
the most frequent number
mode
measures how the sample are clustered
range
measures how the sample are dispersed
variance
the positive square root of the variance
standard deviation
a measure of the symmetry of the distribution of the sample
skewness
a measure of the height of the distribution
kurtosis
it is the 25%, 50%, and 75% respectively of the data
quartiles
it is the 10%, 20%, 30%, to 90% respectively of the data
deciles
it is the 1%, 2%, 3%, up to 99% respectively of the data
percentile