About Data 1 Flashcards
Experiment, data collection, categories
What are the rows and columns of the data table called?
Rows (observations/cases)
Columns (variables)
What are the two sub classes of numerical variables and what do they mean?
Continuous - infinite choices
Discrete - finite choices
What are the two categorical variable sub classes and what do they mean?
Ordinal - has natural order
Regular- doesn’t have natural order
Associated vs independent variables?
Asssociated - two variables have a connection
Independent- no connection
Anecdotal evidence?
Grandma says lightning cures cancer cuz it happend to her
Problems with taking a census
Some are hard to locate
Complex
Population changes while cencus is being taken
“Tasting soup” exploratory analysis, infrence and representative.
Exploritory analysis - gathering data (tasting the soup)
Inference - to generalize your claims to the whole population
Representative - does your sample represent tge whole population (it needs to!)
Sampling bias from these- non response, voulentary response, convenience sample
Non response - if only a small fraction of randomly sampled people respond; the sample may no longer be representative of the population.
Voulentary response- only people who care to respond are those with strong opinions (npt representative)
Convineince sample- people who are more easily accessable are more likly to be in the sample
Explanitory variable and response variables
Its a suggestion to which one is influencing the other (does not mean it is causal)
Observational study
Data is collected in a way that does not effect how data comes “observes”
Experiment
subjects are assigned treatments to establish causal connections between explanatory and response variables
Co-founding variable
a variable which is correlated to the explanatory and response variables
Two types of observational studys?
prospective and retrospective studys
prospective study?
collects info as events unfold
retrospective study?
collects info after events have taken place
What are the four sampling methods?
simple random sampling
stratified “”
cluster “”
multistage “”
Simple random sample
random samples
Stratified sample
divides population into groups based on similar observations. Then takes random samples from each
Cluster sample
divides population into random groups then takes whole cluster samples from some randomly chosen groups
Multistage sample
make random clusters. then randomly chose clusters to sample. simple random sample within
Principles of experimental design (4) C R R B
Control-compare treated with control group
Randomize- random samples
Replicate-do the experiment many times by collecting a large sample
Block-assign groups into subdivisions to eliminate a third variable
Scatter plot
useful for visualizing the relationship between two numerical values
Dot plots and mean
Shows the mean along with dots grouped densely up in a single line
Sample statistic and point estimate
Sample statistic- data found from the sample
Point estimate- an estimation of the population