Week 4 Flashcards by Omar Gonzalez

Data

recorded values of qualitative or quantitative observations.

How well did you know this?

Not at all

Perfectly

Population

the collection of all subjects of interest.

How well did you know this?

Not at all

Perfectly

Sample

a subset of the population of interest.

How well did you know this?

Not at all

Perfectly

Parameters

a characteristic of a population.

How well did you know this?

Not at all

Perfectly

Statistic

a characteristic of a sample.

How well did you know this?

Not at all

Perfectly

Levels of Measurement

qualitative [nominal (categories that cannot be put in any order) & ordinal (categories that can be ordered)] & quantitative [interval (-infinity to infinity) & ratio (0 to infinity)]

How well did you know this?

Not at all

Perfectly

Measure of Central Tendency

Mean (average of data points), Median (middle of data points) and Mode (most recurring data point)

How well did you know this?

Not at all

Perfectly

Measure of Position

Mean, Median, Mode, Min, Max.

How well did you know this?

Not at all

Perfectly

Measure of Dispersion

Range, frequency, variance, standard deviation.

How well did you know this?

Not at all

Perfectly

Measures of Relationship

Covariance, Correlation, Regression, Trend, Forecast.

How well did you know this?

Not at all

Perfectly

Measures of Asymmetry

Skewness and Kurtosis.

How well did you know this?

Not at all

Perfectly

Statistics

the science of collecting, summarizing, and drawing valid conclusions from data which involves: selecting models to validate hypotheses and test assumptions, determining the relationships between variables, assessing data trends and trajectories, identifying patterns and groupings, detecting mistakes and outliers.

How well did you know this?

Not at all

Perfectly

Uniform Distribution

distribution (continuous or discrete) whose data points lie within a range and all have equal probability of appearing.

How well did you know this?

Not at all

Perfectly

Binomial Distribution

discrete probability distribution with parameters n and p of the number of successes in a sequence of n independent experiments and each with its Boolean-valued outcome: success (with probability p) or failure (with probability q = 1-p).

How well did you know this?

Not at all

Perfectly

Poisson distribution

discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time or space if these events occur with a known constant mean rate and independently of the time since the last event.

How well did you know this?

Not at all

Perfectly

Normal distribution

Study These Flashcards

continuous probability distribution whose importance stems from the fact that random variables without known distribution will mimic the distribution if a large enough sample of those random variables are collected (CLT).

Central Limit Theorem

Study These Flashcards

no matter the underlying distribution of the dataset, the sampling distributions of the means would approximate a normal distribution. The mean of the sampling distribution would be equal to the mean of the original distribution and the variance would be n times smaller .

Hypothesis Testing

Study These Flashcards

the testing of a hypothesis (an idea that can be tested and a supposition or proposed explanation made on the basis of limited evidence as a starting point for further explanation.

ANOVA (Analysis of Variance)

Study These Flashcards

a collection of statistical models and their associated estimation procedures used to analyze the difference among means. Based on the law of total variance, ANOVA provides a statistical test of whether two or more population means are equal.

Chi-Squared Analysis

Study These Flashcards

a statistical hypothesis test that is valid to perform when the test statistic is chi-squared distributed under the null hypothesis. Used to determine whether there is a statistically significant difference between the expected frequencies and the observed frequencies in one or more categories of a contingency table.

Standardization

Study These Flashcards

the normalization of the normal distribution (N(0,1)) .

Z score

Study These Flashcards

the standard score calculated by subtracting the population mean from an individual raw score and dividing the difference by the population standard deviation.

Arithmetic mean, Median, Mode

Study These Flashcards

average of data points, center of data points and data point that appears most frequently.

Range, Average Deviation, Variance

Study These Flashcards

difference between the maximum and minimum data point, number that indicates how data points deviate from the mean, taking the standard deviation and squaring it.

Standard deviation

number that indicates how much data points deviate from the mean.

Covariance

a measure of the joint variability of two variables

Correlation

a measure of the joint variability of two variables. Standardized measure of covariance.

Skewness

a measure of a symmetry that indicates whether the observations in a dataset are concentrated on one side.

Probability Sampling

each element from the population dataset has a chance of being deleted as a sample. Ex. Simple, Stratified, Cluster, and Systematic random sampling.

Non Probability Sampling

the practice of sampling without the assurance that elements have the equal amount of chance of being selected. Ex. Convenience, Voluntary and Snowball sampling, Quota, and Purposive.

Bias

the risk that a subset of a population will not accurately represent the overall population.

Week 4 Flashcards

(31 cards)