intro to stats Flashcards

Question

simulation vs. classical statistics

Answer 1

theoretical - any value of x^2 is possible, probability distribution is continuous simulation based - limited number of x^2 values possible - small number of possible outcomes when counting numbers of heads/tails from 100 coin tosses - probability distribution is discrete simulation based are better, only deal with possible outcomes - but classical stats are more widespread

Answer 2

- calculate difference between each score and a single point - difference between each score and the mean - DEVIATION SCORE (xi -x bar)

Answer 3

1. ignore the signs (mean absolute deviation) 2. remove the signs by squaring all deviation scores, calculating the average, then taking the square root (standard deviation

Answer 4

- outliers will distort estimates of s more than MAD- larger deviation scores get even larger when squared - MAD is more intuitive cause s is result of squaring, adding, square-rooting - in real datasets, MAD estimates from a sample may be better estimates of the underlying population parameter than s S - s is one of the parameters used to define the normal distribution, which is centrally important in classical statistics - Fisher (1920) demonstrated that in a perfect normal distribution, sample s is a better estimate of population standard deviation compared with sample MAD as an estimate of population MAD (s estimates its corresponding parameter better than MAD) s is the dominant measure of variation used in stats

Answer 5

s and variance (s^2) are primary measures of variation divide n-1 (degrees of freedom) s= dividing the sum of squares by the degrees of freedom, then taking the square root mean deviation score is calculated by summing the squared deviation scores, then dividing number of scores that vary, then taking the square root

Answer 6

first calculate sum of squares (SS): sum(xi-x bar)^2 df (number of things that can vary): n-1 Ex = n x x-bar purpose of s= generate estimate of average variation

Answer 7

natural variables commonly approx to normal distribution errors of measurement commonly approximate to the normal distribution means calculated from multiple samples drawn from pop will approx to normal distribution is a probability distribution - common use: derive probability that a score selected at random from normal-distributed pop will have a specific value

Answer 8

distribution y axis: ignore values - for most plots: interested in value of y that corresponds to a value of x - probability distrubtions: interested in area under the cure between two variables of x, and express it as the percentage of total area under the curve x-axis: number of standard deviations from the mean - standard deviation: average deviation from the mean

Answer 9

originated from attempts to stop disputes between gamblers distrubtion is an approx of the binomial distribution with a large number of trials (games) and can be calculated simply from mu and s combo of mathematical simplicity and usefulness of the normal distribution in modelling real variables and errors resulted in it holding a central position in classical statistics if we know a population mu and s, and know that the variable is normally distributed, we can easily estimate the probability that a score will be within a specific range of values

intro to stats Flashcards

(33 cards)