Research Methods Flashcards
What is Simpson’s Paradox
When aggregate data does shows a bias but disaggregated data doesn’t, or does but in the other direction.
What is aggregate data vs disaggregate data
aggregate = average disaggregate = all data (not averaged data)
what do you need to do with negative numbers in R
put them in parenthesis
eg. -1 ^ 2
should be
(-1)^2
what does an exclamation mark mean
not
what are the three types of rounding functions
round( )
floor( )
ceiling( )
differentiate ‘=’ and ‘==’
== is used for a comparison –> like a question
= is a command and exists for the purpose of the current function
= is comparable to
what is a numeric variable
a numbered variable. henryage
what is a character variable
a word variable. food
what is a logical variable
true/false basis. isFurry
variable titles cant have what?
spaces
what is the hierarchy of vectors
character > numerical > logical
what is a package
a collection of R variables
what are theoretical constructs and how do they relate to the two other elements
unobservable psychological entities which are then operationalised into a measure - a tool designed to obtain data. Then turned into data
what are the three types of scales
nominal –> no set data order eg. eye colour –> discrete
ordinal –> ranked data, where the difference between data points is hard to tell –> continuous
interval –> numbers eg. dates
what do you use to find/make column lists
$
how do you remove a data point
make it equal to ‘NULL’
how do you make a vector
c( )
what does the “here” package do
finds the root of your projects based on the current RProj file
basic codes for how to make tables
table( ) or for a neater option kable( )
what does echo = FALSE do
ensures that we don’t see the r code in the markdown document
what is the function to name columns
col.name = c(“x”, “y”)
what does pipes do and what is the symbol for it
makes it so you don’t have to make new variables each time.
symbol: %>%
grouping allows you to
move beyond nominal variables
what is the difference between group_by( ) and summarise ( )
group_by( ) groups variables in a dataset
summarise( ) tells us what kinds of stats to create over the groups