Data Manipulation in R Flashcards
This is a conditional statement we use in R.
ifelse
Is calculated by taking the sum of the values and dividing with the number of values in a data series.
Mean
Is a measure of how far away items in a data set are from the mean.
Standard deviation
If there are missing values, then the mean and sd functions returns what output?
NA
This function is used to produce mean, median, minimum, maximum, 1st and 3rd quartile.
summary
This package allows you to use the forward-pipe chaining operator (%>%) for combining multiple operations.
dplyr
Use _____ to ______ data with the required condition.
Filter, Filter
This function is used to reorder rows.
arrange
This function is used in picking variables.
select
This function is used to change the name of the column(s).
rename
This function is used to add new variables that are functions of existing variables.
mutate
This function is used to find insights from the data.
summarise
This two functions are used in taking random samples.
sample.n and sample.frac
To drop the missing values, what syntax do you use?
na.rm = TRUE
How do we write the syntax of an if-else statement?
ifelse(<condition>, <true>, <false>)</false></true></condition>