M5 Flashcards
representing counts or measurements
Numerical data
descriptions or characteristics
Categorical data
Any recording of information is called
observation
comprises those methods concerned
with collecting and describing a set of data so as to yield
meaningful information.
Descriptive statistics
comprises those methods concerned
with the analysis of a subset of data leading to predictions or
inferences about the entire set of data.
STATISTICAL INFERENCE
Infer the expected amount of rain for July next year based
on the average precipitation data for July in the past 30
years.
STATISTICAL INFERENCE
consists of the totality of the observations with
which we are concerned. May be finite or infinite
population
is a subset of a population.
sample
representative of the
population.
sample
A useful tool in choosing a randon sample from any population
Table of Random Samples
are often used to compare quantities in
different categories.
bar graphs
used to show the distribution or
proportions of parts to a whole
pie graph
show information that is connected in some
way like changes through time.
Line Graphs
is the organization of raw data in table form, using classes and frequencies
frequency distributuion
When the range of the data is large, the data must be grouped into classes that are more than one unit in width, in what is callsed a
group frequency distribution
each class is defined by its X, which are the smalles and highest data value that can be included in the class
class limits
are numbers used to separate the classes so that there are no gaps in the frequency distribution
class boundaries
are used to show how many data values are accumulated up to and including a specific class
cumulative frequencies
what is the formula for class mark
(lower limit + upper lim) /2
a bar graph that frequencies against the class boundaries
histogram
is the line graph of the frequencies against the class marks. Close the polygon at the lowest and highest class boundaries
frequency polygon
line graph of the comulative frequency with the upper boundary
ogive
These values are used to represent a set of data.
mean median mode
2 types of mean
populatn sample
is the middle number when all observations are arranged in
increasing or decreasing order.
median
that value which occurs
most often with the greatest frequency.
mode
These values are used to describe the distribution of a
set of data
- Range
- Variance
- Standard Deviation
the difference between the largest and smallest number in the set
Range
This is the value used to compare values from different sets
with different mean and standard deviation.
z-score
representative value of the elements of each class
percentile
is a chance process that leads to well-defined results called outcomes
probability experiment
is a result of a single trial of a probability experiment
outcome
is the set of all possible outcomes of a probability experiment
sample space
consists of a set of outcomes of a probability experiment
even
are events that have the same probability of occuring
equally likely events
assumes that all outcomes in the sample space are equally likely to occur
classical probability