exam 1 Flashcards

Question 1

Q

statistics

Answer

A

a field of mathematics that develops and studies methods to collect, analyze, interpret, and present empirical evidence

Question 2

Q

empirical vs anecdotal evidence

Answer

A

empirical - information received from the observation or measurements of patterns using experimentation
anecdotal - evidence collected in a casual or informal manner that relies heavily on personal testimony or conclusions (not statistical data collection)

Question 3

Q

data

Answer

A

a collection of numerical facts or information from which conclusions can be drawn

Question 4

Q

raw data

Answer

A

unformatted data (numerical measurements, instrument readings, text) that has not been processed or analyzed

Question 5

Q

replicates

Answer

A

parallel measurements of a phenomenon to estimate variability in your sample (the number of replicates = n)

Question 6

Q

sampling effort

Answer

A

how much data do we need?

Question 7

Q

precision and accuracy

Answer

A

precision - how fine the divisions on a scale of measurement are
accuracy - how close to the truth our measurement is
(accuracy is the priority)

Question 8

Q

descriptive statistics

Answer

A

quantitative description of observations sampled from a population (mathematically summarizing patterns, data centers, and variability without making conclusions about overall meaning of data)

Question 9

Q

data distribution (histogram)

Answer

A

sampled populations arranged by rank order and graphically presented

Question 10

Q

normal distribution

Answer

A

an arrangement of data in which most values cluster in the middle of the range and the rest taper off symmetrically toward either extreme

Question 11

Q

log-normal distribution

Answer

A

data are clustered at low values, but there are some much higher values (positive skew)
(can be made normal by applying a logarithm function)

Question 12

Q

central tendency

Answer

A

numeric value describing a central position in a dataset. mean, median, and mode are valid measures

Question 13

Q

skew

Answer

A

positive, negative, or normal

Question 14

Q

central limit theorem

Answer

A

if a population with finite variants is sufficiently sampled, the mean of all the samples from the population will be = approximately equal to the mean of the population, AND the means from the samples will approach a normal distribution

Question 15

Q

main steps in the scientific method

Answer

A

planning - what are you going to do? learn the system, develop ideas about how the system works (maybe do a pilot study), decide hypothesis, figure out what data you will need
recording - collect and properly accord data, can take many forms, must record extremely carefully
analysis - interrogate data to test hypothesis, analysis cannot be successful if data gathering was not designed with analysis in mind, should allow you to accept or reject null
reporting - disseminating methods and media will depend on the type of work and audience, statistical results must be reported using proper conventions, graphs must be properly labelled

Question 16

Q

types of data

Answer

A

continuous - data that can take any value (usually measured)
discrete - numerical data that can take a limited number of values (often counted)
ordinal - data in categories that can be placed in order, but magnitude of difference between categories is not fixed
categorical - data in categories that can’t be usefully ordered

Question 17

Q

null and alternative hypothesis

Answer

A

null hypothesis - no change (Ho)
alternative hypothesis - what you want to show (Ha or H1)

Question 18

Q

sampling strategies

Answer

A

random - best choice
systematic - transects (sampling on a created line)
mixed - stratified random sampling
haphazardly - when you are unable to randomly sample because of practicality

Question 19

Q

mean, median, mode

Answer

A

mean - sum of observations is divided by number of observations in the sample
median - the middle score for the sampled data that has been arranged by order of magnitude
mode - the most frequent score in a sampled dataset
(equations)

Question 20

Q

data in quartiles

Answer

A

divide data into quarters and use five number summary
steps -
rank data from smallest to largest
smallest is first number, largest is 5th
median is third
middle of first and third is second, middle of fifth and third is fourth

Question 21

Q

dividing n-1 to calculate variance

Answer

A

penalty for having a small amount of replicates

Question 22

Q

shapiro-wilk test and how to interpret

Answer

A

takes a data distribution and determines whether it is significantly different to normal
p-value of <.05 = not normal, reject Ho

Question 23

Q

standard error of the mean (SEM) (def and equation)

Answer

A

estimate of how close the sample mean is compared to the true population mean
standard deviation of resampled mean
=Sx/sqrt n

Question 24

Q

types of project

Answer

A

descriptive -
differences - is a different to b, bar charts and box and whisker plots, categorical variable and want to know if the response variable differs between categories
correlations - links between variables, usually quantitative variables are independent and quantitative variables are dependent
associations - similar to correlations but with categorical data

Question 25

Q

how to calculate mean

Answer

A

bar x = (E^n i=1 * xi)/n

Question 26

Q

how to calculate median

Answer

A

the middle value

Question 27

Q

how to calculate mode

Answer

A

most often occurring data

Question 28

Q

how to calculate range

Answer

A

rank-order observations - highest - lowest

Question 29

Q

how to calculate variance

Answer

A

=(E^n i=1(xi - bar x)^2)/n-1 OR = SS/n-1

Question 30

Q

how to calculate standard deviation

Answer

A

= sqrt(E^n i=1 (xi - bar x)^2/n-1) OR = sqrt (SS/n-1)

Question 31

Q

how to calculate standard error

Answer

A

=Sx/sqrt n

Question 32

Q

copy>paste special> paste values in excel

Answer

A

makes values the actual number rather than the equation in a cell

Question 33

Q

$ in excel

Answer

A

keeps a number the same to make a cell value unchangeable

Question 34

Q

sum in excel

Answer

A

=SUM(array)

Question 35

Q

count in excel

Answer

A

=COUNT(array)

Question 36

Q

mean in excel

Answer

A

=AVERAGE(array)

Question 37

Q

median in excel

Answer

A

=MEDIAN(array)

Question 38

Q

mode in excel

Answer

A

=MODE(array)

Question 39

Q

variance in excel

Answer

A

=VAR(array)

Question 40

Q

standard deviation in excel

Answer

A

=STDEV(array)

Question 41

Q

standard error of the mean in excel

Answer

A

=AVERAGE(array)/SQRT(count(array))

Question 42

Q

list in R

Question 43

Q

remove in R

Answer

A

rm(objectname)

Question 44

Q

quit in R

Question 45

Q

import .csv in R

Answer

A

read.csv(file.choose())

Question 46

Q

sum of values in a column in R

Answer

A

> sum(objectname$variablename)

Question 47

Q

number of values in a column in R

Answer

A

> length(objectname$variablename)

Question 48

Q

mean of values in a column in R

Answer

A

> mean(objectname$variablename)

Question 49

Q

median of values in a column in R

Answer

A

> median(objectname$variablename)

Question 50

Q

quartiles in R

Answer

A

> summary(objectname$variablename)

Question 51

Q

plot quartiles as boxplot in R

Answer

A

> boxplot(objectname$variablename)

Question 52

Q

plot histogram in R

Answer

A

> hist(objectname$variablename)

Question 53

Q

variance in R

Answer

A

> var(objectname$variablename)

Question 54

Q

standard deviation in R

Answer

A

> sd(objectname$variablename)

Question 55

Q

shapiro-wilk test for normality in R

Answer

A

> shapiro-test(objectname$variablename)