Midterm 1 Flashcards

1
Q

What is descriptive stats?

A

consists of methods for organizing and summarizing information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is inferential statistics?

A

consists of methods for drawing and measuring the reliability of conclusions about a population based on information obtained from a sample of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a population?

A

The collection of all individuals or items under consideration in a statistical study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a sample?

A

That part of the population from which information is actually obtained

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is an observational study?

A
  • researchers simply observe characteristics and take measurements, as in a sample survey - Observation studies can reveal association, but not causation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a designed experiment?

A
  • researchers impose treatments and controls and then observe characteristics and take measurements - Designed experiments (done properly) reveals both association and causation.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a census?

A

A survey that includes every member of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the issues with censuses?

A
  • If the population is large, it can be very costly and difficult (perhaps impossible) to collect information from every member of the population. -Since a census is usually too costly or takes too long, most statistical information is gathered by sampling or experimentation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is sampling?

A

collecting the information from a sample rather than the entire population. - Since the of sampling is to make decisions about the corresponding population, it is important that the results obtained from sampling closely match the results that we would obtain by conducting a census. - This means sampling must be done very carefully so as to obtain a representative sample. - One method of sampling is to try to choose elements of the population so each element has an equal chance of being included in the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is simple random sampling?

A

A sampling procedure for which each possible sample of a given size is equally likely to the one obtained.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a simple random sample?

A

A sample obtained by simple random sampling. - Using an SRS (simple random sample) is a common way of obtaining a representative sample. - Samples may be selected with or without replacement. - In sampling with replacement, each time an element is chosen from the population, it is put back in the population - thus any element may be chosen more than once for a sample. - In sampling without replacement , an element of the population is removed from the population once it has been chosen - thus any element can only appear only once in the sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a study?

A

The process of sampling a population, and collecting the information of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is raw data?

A

Data recorded in the sequence in which they are collected and before they are processed or ranked

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a variable?

A

A characteristic that varies from one individual to another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is qualitative or categorical variable?

A

A non-numerically valued variable Examples of qualitative (i.e. categorical) variables include eye colour, first letter in a persons last name, type of automobile a person drives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a quantitative variable?

A

A numerically valued variable. - Examples of quantitative variables include height, weight, age, speed of traffic (the various vehicles) at a certain location and time and number of stars that can be observed in a particular part of sky. - There are two types discrete and continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a discrete variable?

A

A quantitative variable whose possible values can be listed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a continuous variable?

A

A quantitative variable whose possible values form some interval of numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is data?

A

Values of a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is Qualitative or Categorical data

A

Values of a qualitative or categorical variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is quantitative data?

A

Values of a quantitative variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is discrete data?

A

Values of a discrete variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is continuous data?

A

Values of a continuous variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is a data set?

A

The collection of all observations for a particular variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is an observation?

A

Each piece of individual data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is frequency distribution for qualitative (categorical) data?

A

lists all categories and the number of elements that belong to each of the categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is relative frequency?

A

a category is obtained by dividing the frequency of that category by the sum of all frequencies - the relative frequency shows what fractional part or proportion of the total frequency belongs to the corresponding category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is the relative frequency distribution?

A

lists the relative frequencies for all categories. - Relative frequency must always add up to 1.00

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

How is a percentage of a category obtained?

A

by multiplying the relative frequency of that category by 100 -

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is a percentage distribution list?

A

lists the percentages for all categories - percentage must always add up to 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is a bar graph or bar chart?

A

A graph made of bars whose heights represent the frequencies or relative frequencies or percent frequencies of respective categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is a pie chart?

A

A circle divided into portions that represent the relative frequencies or percentages of a population or a sample belonging to different categories - we multiply 360 by the relative frequency of each category to obtain the degree measure or size of the angle for the corresponding category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is a class, category or bin?

A

For quantitative data, an that includes all the values that fall within two numbers, the lower and upper limits - Each class (category, bin) is defined by an interval - these are chosen so every measurement in the data set falls into exactly one interval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is a frequency?

A

are the number of values that belong to different classes and are denoted by f

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is frequency distribution for quantitative data?

A

lists all the classes and the number of values that belong to each class. Data presented in the form of a frequency distribution are called .group data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is single value grouping?

A
  • In single-value grouping, each distinct number in the set of measurements forms a class. - Single-value grouping is appropriate for discrete data in where there are only a small number of distinct values. - Make a table of frequencies of each distinct value, in order from smallest to largest.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is limit grouping?

A
  • This method is appropriate for data. - In this method, we define the classes by providing numbers called class limits - For each class, there is a lower limit and upper limit, selected by the person doing the analysis. - The lower limit of a class is the smallest number that could be in the class. - The upper limit of a class is the largest number that could be in the class.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is lower class limit?

A

The smallest value that could go into a class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What is upper class limit?

A

The largest value that could go into a class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is class width?

A

The difference between the lower limit of a class and the lower limit of the next-higher class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is class mark?

A

The average of the two class limits of a class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What are the guidelines for grouping?

A
  1. The number of classes should be small enough to provide an effective summary but large enough to display the relevant characteristics of the data. Generally 5 to 20 classes are used. 2. Each observation must belong to one, and only one, class. 3. Whenever feasible, all classes should have the same width
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is cutpoint grouping?

A
  • This method is similar to Limit Grouping - define classes by choosing two values. - there is an important difference in how one of these values is defined. - This modification makes this method appropriate for continuous data although it may also be used for discrete data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is lower class cutpoint?

A

The smallest value that could go in a class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is upper class cutpoint?

A

The smallest value that could go in the next-higher class (equivalent to the lower cutpoint of the next-higher class)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

What is class width in cutpoint grouping?

A

The difference between the cutpoints of a class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What is class midpoint in cutpoint grouping?

A

The average of two cutpoints of a class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What is the relative frequency of a class?

A

is the frequency of that class divided the sum of all frequencies (i.e. the number of measurements).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What is a histogram?

A

displays the classes of the quantitative data on a horizontal axis and frequencies or relative frequencies or percent frequencies of those classes on a vertical axis. The frequency (relative, percent) of each class is represented by a vertical bar whose height is equal to the frequency (relative, percent) of that class. The bars should be drawn so that consecutive classes share a common side (bars are touching)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is single-value grouping in a histogram?

A

For single-value grouping, we use the distinct values of the observations to label the bars, with each such value centered under its bar.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What are other groupings in a histogram?

A

For limit grouping or cutpoint grouping, we use the lower class limits (or equivalently, lower class cutpoints) to label the bars.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

What are the different types of shapes of a histogram?

A
  • symmetric (identical on both sides of its central point) - skewed (is asymmetric. For a skewed histogram, the tail on one side is longer than the tail on the other side can be skewed to the left or the right) - uniform or rectangular (has the same frequency for each uniform rectangular histogram class)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What can make graphs misleading?

A
  1. Changing the scale 2. Truncating the frequency axis 3.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What is a stem and leaf display?

A

For quantitative data, each value is divided into two portions-a stem and a leaf. The leaves for each stem are shown separately in a display. - A common method for stem diagrams is to use the last digit of each data value as the leaves and all the preceding digits as stems - If one or more of the stems has too many leaves, the stem-and-leaf display doesn’t look good and may be difficult to fit on a page horizontally. - The remedy for this is to use two stems of the same value - the first stem having leaves with digits 0 through 4 and the second stem having leaves with digits 5 to 9.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

What is a dotplot?

A
  • All the data values from smallest to largest are listed on a horizontal scale. - A dot is placed above each data value for each occurrence of the data value -Dotplots can help us detect outliers (also called extreme values) in a data set. - Stem and Leaf and dotplots are easy ways to graph sets of data in a comparative way. - If you graph two or more sets of data with stem and leaf or dotplot, make sure the scales line up and scaling is identical so a comparison of the resulting graphs can be made
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What is an outlier or extreme value?

A

Values that are very small or very large relative to the majority of the values in a data set are called outliers or extreme values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

What is index notation?

A

Use of a letter usually x with a subscript to represent the data values x without a subscript refers to the entire data set and x subscript i would be an unspecified member of the data set

58
Q

What greek letter is the summation symbol?

A

sigma Σ

59
Q

What does a measure of central tendency provide?

A

The center of a histogram or a frequency distribution curve

60
Q

What is the mean of a sample?

A

sum of all the values divided by the number of values in the data set

61
Q

What is the median of a sample?

A

Is the midpoint of the distributed values - represented by M half the numbers are smaller than M and half the numbers are larger than M

62
Q

What is mode of a sample?

A

Find the frequency of each value in the data set. If no value occurs more than once, the data set has no mode. Otherwise, any value that occurs with the greatest frequency is the mode value of the dataset.

63
Q

Which is most sensitive to outliers - mean, mode or median?

A

mean

64
Q

What is deviation?

A

Measures the variation of the data

65
Q

What is the range of a sample?

A

The difference of the largest value to the smallest value: Range = Largest value - Smallest value

66
Q

What is standard deviation?

A

is the square root of the variance. Is used when you use mean

67
Q

What is variance?

A
68
Q

What is Chebyshev’s Theorem?

A

For any number greater than 1,

1-1/k2

at least of the data values lie with standard deviations of the mean. is the number of standard deviations.

69
Q

What is the empirical rule?

A

For a bell-shaped distribution, approximately

  1. 68% of the observations lie within one standard deviation of the mean,
  2. 95% of the observations lie within two standard deviations of the mean,
  3. 99.7% of the observations lie within three standard deviations of the mean.
70
Q

What is the quartiles of a sample?

A

are three numbers Q1 Q2 Q,3 and which show the locations where the data set is split into four equal parts.

71
Q

What is the five number summary?

A

The five number summary of a distribution consists of the smallest observation, the first quartile, the median, the third quartile and the largest observation, written in order from smallest to largest. In symbols the five number summary is

minimum Q1 M Q3 maximum.

72
Q

What is the interquartile range?

A

The interquartile range of a set of data is the difference of the third quartile to the first quartile.

IQR = Q3 - Q1

73
Q

How is the lower limit and upper limit of a data set calculated?

A

L = Q1 - 1.5xIQR

U=Q3 + 1.5xIQR

74
Q

What is a potential outlier?

A

An observation is suspected to be a potential outlier if it falls outside the lower and upper limits.

75
Q

What is an adjacent value?

A

The smallest value from the dataset greater than the lower limit and the largest value from the dataset less than the upper limit are called adjacent values.

76
Q

How is the mean of a population calculated?

A

Suppose N is the number of measurements taken of some quantitative variable for every individual in the population of interest. Let be the values of those measurements. Then the population mean value of these measurements is denoted by and is calculated by

77
Q

How is standard deviation of a population calculated?

A

Suppose N is the number of measurements taken of some quantitative variable for every individual in the population of interest. Let x1, x 2, x3 ….. xn be the values of those measurements and let be the population mean (calculated as above). Then the population standard deviation of these measurements is denoted by and is defined by

78
Q

What is a parameter?

A

A descriptive measure for a population (egs µ include and ð ).

79
Q

What is a statistic?

A

A descriptive measure for a sample (egs x include and s).

80
Q

What is the standardized version and standardized variable?

A

For a variable taken x from a population with mean µ and standard deviation σ the variable is called the standardized version of x or the standardized variable corresponding to the variable x.

81
Q

What is the z-score?

A

For an observed value of a variable x, the corresponding value of the variable z is called the of the observation. It also called the standard score.

The z-score of an observation is the number of standard deviations the observation is away from the mean.

A negative z-score indicates an observation is less than the mean.

A positive z-score indicates an observation is greater than the mean.

Based on Chebyshev’s theorem, at least 75% of the -scores of a dataset must be between -2 and 2 (within two standard deviations of the mean) and at least 89% must be between -3 and 3 (within three standard deviations of the mean).

82
Q

Explain experiment, outcomes and sample space

A

An experiment is a process that, when performed, results in one and only one of many observations. These observations are called the outcomes of the experiment. The set of all outcomes for an experiment is called a sample space.

83
Q

What graphical representation are used for sample space and the possible outcomes in that space?

A

(a) Venn diagram is a picture, in which all the possible outcomes of the experiment can be placed as dots within the rectangle.
(b) tree diagram in which each outcome is at the end of a branch of the tree

84
Q

What is an event?

A

is a collection of outcomes for the experiment, that is, any subset of the sample space of an experiment.

An event is said to occur if and only if the outcome of the experiment is a member of the event.

85
Q

Describe the intersection of events

A

Suppose S is a sample space with events A and B taken from S. If we form the set (i.e. the event) that has all the outcomes common to both the events A and B, this new event is called the intersection of the events A and B.

86
Q

Define mutually exclusive events

A

Suppose S is a sample space with events A and B taken from The events A and B are said to be mutually exclusive if the two events can never occur together. That is, if A occurs, then B does not occur and if the B event occurs, then the event A does not occur.

87
Q

Explain union of events

A

Suppose S is a sample space with events A and B taken from S. If all the outcomes of events A and B are combined into a new set, this new set is also an event called the union of events A and B.

88
Q

Explain compliment of events

A

Suppose S is a sample space with an event C, Then the compliment of the event C is the event of all the outcomes from sample space. that are not in event C. The compliment of C is written (not C),

89
Q

What is probability?

A

is a numerical measure of the likelihood that a specific event will occur.

90
Q

What is probability mode?

A

is the mathematical description of a random phenomenon consisting of two parts: sample space S and a way of assigning probabilities to the event

91
Q

What is the formula for the special addition rule?

A

Suppose we have a sample space S with events A and B which are mutually exclusive. Then

P(A or B)= P(A) + P(B).

92
Q

What is the complementation rule?

A

Suppose E is an event in sample space S then

P(E) = 1 - P (not E)

which can also be written as

P(E) + P(not E) = 1

93
Q

What is the general addition rule?

A

Suppose we have a sample space S with events A and B then

P (A or B) = P(A) + P(B) - P(A&B)

which can also be written as

P(A or B) + P(A&B) = P(A) + P(B)

94
Q

How do you calculate the probability of events?

A
95
Q

Define equiprobable

A

Suppose S is a sample space. If all the outcomes that make up sample space S have the same probability of occurrence, we say the sample is equiprobable

96
Q

How do you calculate the probability of events in an equiprobable sample space

A

P(A) = number of outcomes in A/number of outcomes in S

97
Q

What is classical probability?

A

In the classical probability model, it is assumed that the probabilities of outcomes for an experiment are all the same.

Earlier we had called this type of model equiprobable.

The classical model is used in many situations: flipping a coin, rolling a die, rolling dice, drawing one of several identical items from a container, bingo balls, etc.

98
Q

Explain relative frequency in relation to probability

A

There are many situations where the probabilities of outcomes are not equal.

Suppose we have an experiment with some set of outcomes and we can repeat the experiment as often as we want. By repeated observations, we note the frequency of each elementary outcome.

After many such observations, we define the probability of an outcome to be the ratio of the frequency of that outcome to the total number of experiments.

That is, the probability is the relative frequency of the occurrences of outcomes to total outcomes after the experiment has been done many, many times.

P(A) = f/n

99
Q

What is the law of large numbers?

A

If an experiment is repeated again and again, the probability of an event obtained from the relative frequency approaches the actual or theoretical probability

100
Q

What is subjective probability?

A

is the probability assigned to an event based on the subjective judgment, experience, information, and belief

101
Q

What are the basic properties of probability?

A

1: The probability of an event always lies in the range between
0 and 1. That is for any event A

0 <=P(A) <=1

  1. The probability of an event that cannot occur is zero.
  2. The probability of an event that must occur is one.
102
Q

What is marginal probability?

A

is the probability of a single event without consideration of any other event. Marginal probability is also called
simple probability.

103
Q

What is joint probability?

A

The probability of the intersection of two events

104
Q

How is joint probability calculated?

A

Using, set analysis if we know all the outcomes from both events, we can
nd the intersection set.

105
Q

What is conditional probability?

A

is the probability that an event will occur given that another event has already occurred in the same experiment. If A and B are two events, then the conditional probability of B
given A is written as

P (B|A)

and read as the probability of B given that A occurred

106
Q

What are independent events?

A

Two events are said to be independent if the occurrence of one does not affect the probability of the occurrence of the other. In other words, A and B are independent events if either

P(A|B) = P(A) or P(B|A) = P(B)

107
Q

What is the general rule of multiplication as it relates to probabilities?

A
  • The General Multiplication Rule is just a rearrangement of the formula for conditional probability given earlier.
  • Notice that the General Multiplication Rule relates the joint probability, marginal probability and conditional probability in one formula
  • Suppose we have a sample space S with events
    A and B. The general multiplication rule is

P(A&B) = P(A) P(B|A)

  • NOTE: Do not use the General Multiplication Rule as a way to find joint probabilities unless the conditional probabilities are already known or if the experiment involves multiple stages.
  • In the case of multiple stages, we can figure out the conditional probability by set analysis and then use the General Multiplication Rule to find the joint probability.
108
Q

What are two-way tables?

A
  • In some probability problems, a set of individuals is considered and two different pieces of information are extracted from each individual.
  • This creates two distinct sample spaces on the same set of individuals.
  • We wish to consider how to analyze probability problems with two sample spaces.
109
Q

What is the basic counting rule (BCR)?

A
  • If there is a situation in which we have several sets of items from which we choose one item each, the basic counting rule calculates all the possible ways these items can be selected.
  • Suppose there are actions r to be performed in a definite order. Further suppose that there are m1 ways to perform
    the first action, m2 ways to perform the second action and so on, up to mr ways to perform the last action. m1 , m2, m3 . .mr , are all positive integers (they may have different values and they can possibly equal 1). Then the number of possible ways the r actions can be performed is the product m1 x m2, x m3 x . . . x mr
110
Q

What are factorials?

A

The factorial of a positive integer n is written n! (read it as “n
factorial”) and is given by

n! = n x (n-1) x (n-2) x … x3 x 2 x 1

if n >= 1

Notice 1! = 1, The factorial of 0 is also defined given by 0! = 1

  • The factorial obeys this rule if n is an integer greater than 1 (this is called a recursive formula - it is often
    used to define the factorial).
111
Q

What is the permutation rule?

A
  • The number of possible permutations r of objects from a collection of m objects is given by the formula in the image
  • Used where there is a set from which we wish to select a subset of a certain size and for which the order in the subset matters.
  • Each choice for the ordered subset is called a permutation and we wish to calculate the number of possible permutations.
112
Q

What is the combination rule?

A
  • The third counting rule is also concerned with the number of ways of selecting items from a group of m distinct objects, but this time without regard to the order the items are listed.
  • This is similar to the permutations except we do not have to count the number of ways of sorting the items: thus number of combinations is less
    than the number of permutations
  • The number of possible combinations of r objects
    from a collection of m objects is given by the formula in the image
113
Q

What is another way to write mCr in the combination rule formula?

A
114
Q
A
115
Q

What is a random variable?

A

is a quantitative variable whose value

  • in some cases a random variable may be the outcomes from a sample space
  • in other cases the random variable involves an additional calculation using the outcomes from the sample space
116
Q

What is a discrete random variable?

A

A random variable that assumes countable values

  • in this class countable values mean a finite number of values
  • the upper case letter represent the value of an outcome of a random variable
117
Q

What is discrete probability distribution of a discrete random variable?

A

is a table which lists all the possible values that the
random variable can assume and their corresponding probabilities

118
Q

What is a discrete probability histogram?

A

is a graph of the probability distribution that displays the possible values of the discrete random variable on the horizontal axis and the probabilities of those values on the vertical axis. The probability of each value is represented by a vertical bar
whose height equals the probability.

119
Q

What is the interpretation of a probability distribution?

A

In a large number of independent observations of a random variable X, the proportion of times each possible value occurs will approximate the probability distribution of X; or, equivalently,
the proportion histogram will approximate the probability histogram for X

120
Q

Using mathematical notation for discrete probability how do you express less than 8?

A

P(X<=7)

121
Q

Using mathematical notation for discrete probability how do you express no more than 7?

A

P(X<=7)

122
Q

Using mathematical notation for discrete probability how do you express at most 7?

A

P(X<=7)

123
Q

Using mathematical notation for discrete probability how do you express 7 at least?

A

P(X<=7)

124
Q

Using mathematical notation for discrete probability how do you express more than 6 or at least 7?

A

P(X=>7) =1 - P(X<=6)

125
Q

Using mathematical notation for discrete probability how do you express no less than 7 or 7 or more?

A

P(X>=7) = 1 - P(X<=6)

126
Q

Using mathematical notation for discrete probability how do you express between 5 and 9 inclusively?

A

P(5<=X<=9) = P(X<=9) - P(X<=4)

127
Q

Using mathematical notation for discrete probability how do you express more than 5 but less than 9, between 5 and 9 not inclusively?

A

P(6<=X<=8) = P(X<=8) - P(X<=5)

128
Q

What are Bernoulli trials?

A

Repeated trials of an experiment if the following three conditions are satisfied:

  • the experiment (each trial), has two possible outcomes, denoted generically s for success and f for failure
  • the trials are independent
  • the probability of success, called the success probability, and denoted by p, remains the same from trial to trial
129
Q

What is binomial distribution?

A

is the probability distribution for the number of successes in a sequence of Bernoulli trials

Every binomial distribution has three numbers

  • n which is the number of trials and must be a positive integer
  • p which is the probability of a success on a single trial P(s)
  • 1 - p which is the probability of a failure of a single trial P(f)
  • The random variable for the binomial distribution is the number of successes x out of the possible outcomes given n and p
  • x must be a nonnegative integer up to and including n
130
Q

What is a continuous variable?

A

one in which there are an infinite number of possible values

e.g. height, time to take a test, weight etc.

131
Q

What is a polygon plot?

A

Lines connecting the centre of each rectangle on a histogram plot

132
Q

What is a density curve?

A

is a curve that:

  • is always on or above the horizontal axis and has exactly area equal to one between the graph and the horizontal axis.
  • A density curve is also called a probability distribution curve or a probability density function, since is used to find probabilities in the distribution of a continuous variable.

Two Characteristics of a Density Curve

  1. The probability that assumes a value in any interval lies in the range 0 to 1.
  2. The total probability of all the (mutually exclusive) intervals within x which can assume a value is 1.0.

The probability is measured as the area under the density curve

133
Q

What is normal distribution?

A

a normal distribution is described by a normal density curve which when plotted gives a bell shaped curve:

  1. The total area under the curve is 1.0.
  2. The curve is symmetric about the mean.
  3. The standard deviation of a Normal Distribution is the distance from the center to the change-of-curvature points on either side.
  4. The two tails of the curve extend indefinitely.

The mean, µ, and the standard deviation, ð, are called the parameters of the normal distribution. Given the values of these two parameters, we can find the area under
a normal distribution curve for any interval

134
Q

What is the 68-95-99.7 rule?

A

There are many possibilities for µ and ð, but all normal density curves satisfy some common properties

In a normal distribution with mean µ and standard
deviation ð

  • Approximately 68% of the observations fall within 1ð of the mean µ
  • Approximately 95% of the observations fall within 2ð of the mean µ
  • Approximately 99.7% of the observations fall within 3ð of the mean µ
135
Q

What is a z-score?

A

a standardized value

  • tells us how many standard deviations the original observation falls away from the mean, and in which direction
  • Observations larger than the mean are positive when standardized; observations smaller than the mean are negative; the mean becomes zero as a z-score
  • Since all normal distributions have the same properties, it definitely most convenient to work with just one fixed normal distribution.
  • The result of standardizing creates that one xed normal distribution.
136
Q
A
137
Q

What is standard normal distribution?

A

N (0,1) is called the standard normal distribution if the variable x has any normal distribution N(µ, ð) with mean µ and standard deviation ð then the standard variable z = x - µ/ð has the standard normal distribution

To find the area in the standard normal distribution, a Standard
Normal Table is available

138
Q

What is a parameter?

A

is a number that describes the population. In statistical practice, the values of a parameter is not known because we cannot examine the entire population.

139
Q

What is a statistic?

A

is a number that can be computed from the sample data without making any use of any unknown parameter. In practice, we often use a statistic to estimate value of a parameter

  • Statistics come from samples.
  • Parameter come from populations.

To keep this straight we use Greek letters to represent parameters of the population and Latin letters to represent statistics of samples.

e.g. For the standard deviation, the population is represented by ð while the standard deviation for the sample is s

140
Q

What is sampling distribution?

A

is the distribution of values taken by the statistic in all possible samples of the same size taken from the same population.

  • The sampling distribution is the ideal pattern that would emerge if we looked at all possible samples of size n from out population.
141
Q

What is a sampling error?

A

is the error that results from using
a sample statistic to estimate a population parameter

142
Q

What is the shape of population distribution?

A
  • The shape depends on the population distribution
  • if the distribution of the population is normal, the distribution of the mean will also be normal