R Flashcards

Question

data frame

Answer 1

collection of columns

Answer 2

dataframes in the tidyverse you can't change the type of info (number - string)

Answer 3

mutate(dataframe, column_new = column*100)

Answer 4

install.packages("tidyverse")

Answer 5

load it: library(tidyverse)

Answer 6

only pull up first 10 rows of a dataset. Never change the names of your variables, or the data types of your inputs. Part of tidyverse

Answer 7

read_csv()

Answer 8

bookings_df

Answer 9

mutate(new_df, total= 'adr'/adultsread

Answer 10

makes summarizing data really easy, lets you skim through it more quickly

Answer 11

has functions for cleaning data

Answer 12

skim_without_charts(), glimpse(), head(), str(), select()

Answer 13

skimr and janitor

Answer 14

specifies certain columns or excludes columns

Answer 15

penguins %>% | select( - species)

Answer 16

penguins %>% | rename(island_new = island)

Answer 17

rename_with(penguins, toupper) (or tolower)

Answer 18

ensures only characters, numbers and underscores in the names

Answer 19

returns remainder after division

Answer 20

returns an integer value after division (5%/%2=2)

Answer 21

arithmetic, relational, logical, assignment

Answer 22

compares only first numbers in the vectors (x

Answer 23

logical NOT

Answer 24

chooses what variable you want to sort by

Answer 25

penguins %>% | arrange( - bill_length)

Answer 26

assigning a name to something

Answer 27

group_by()

Answer 28

drop_na( )

Answer 29

penguins %>% group_by(island) %>% summarize (mean_bill_length_mm = mean (bill_length_mm)) (or replace mean with max)

Answer 30

penguins %>% group_by(species, island) %>% | summarize(max_bl=max(bill_length_mm), mean_bl = mean(bill_length_mm)

Answer 31

penguins %>% | filter (species == Adelie)

Answer 32

install.packages(tidyverse, skimr, janitor

Answer 33

bookings_df

Answer 34

trimmed_df

Answer 35

1. rename: (to rename columns) dataframe %>% rename(column_new = column) 2. unite: dataframe %>% unite (column1_2, c("column1", "column2"), sep = " ") 3. mutate: (adds a column) dataframe % mutate(guests = babies+children+adults) 4. summarize (newcolumn= mean(column), newcolumn1 = sum(column1)

Answer 36

separate( ) unite ( ) mutate ( )

Answer 37

separate( dataframe, column, into = c(newcolumn1, newcolumn2), sep = " ")

Answer 38

unite (dataframe, "newcolumn", column1, column2,

Answer 39

dataframe %>% | mutate(new_column = column/1000, new_column2 = column2/1000)

Answer 40

pivot_longer( ), pivot_wider( )

Answer 41

clean_names( )

Answer 42

SimDesign package, bias(actual, predicted)

Answer 43

arrange(hotel_bookings, desc(lead_time))

Answer 44

max(hotel_bookings$lead_time) | min (hotel_bookings$lead_time)

Answer 45

mean(hotel_bookings$lead_time)

Answer 46

new_hotel_dataframe

Answer 47

hotel_summary % group_by (hotel) %>% summarise (average_lead_time = mean(lead_time) max_lead_time = max(lead_time) min_lead_time = min (lead_time)

Answer 48

arrange( ), group_by( ), filter( )

Answer 49

rename_with(dataframe, tolower)

Answer 50

aesthetics, geoms, facets, labels, and annotations

Answer 51

install.packages("palmerpenguins") library("palmerpenguins") data(penguins) View(penguins)

Answer 52

geom_point and geom_bar

Answer 53

ggplot(data=penguins) + geom_point(mapping = aes(x=flipper_length_mm, y=body_mass_g))

Answer 54

a geometric object used to represent your data (points, bars, lines and more)

Answer 55

a visual property of an object in your plot (position, color, shape or size)

Answer 56

matching up a specific variable in your dataset with a specific aesthetic

Answer 57

1. start with ggplot function and choose a dataset 2. add a geom_ function to display your data 3. map the variables you want to plot in the arguments of the aes( ) function

Answer 58

x,y, color, shape, size, alpha (transparency)

Answer 59

geom_smooth

Answer 60

linetype = (species)

Answer 61

geom_jitter

Answer 62

only put outlines of the color around the bars, the "fill" aesthetic will fill in the color

Answer 63

ggplot(data, aes(x= , y= )) + geom_point() + geom_smooth (method = "loess")

Answer 64

ggplot(data, aes(x= , y= )+ geom_point() + geom_smooth (method = "gam", ...)

Answer 65

let you display smaller groups, or subsets, of your data

Answer 66

facet_wrap, facet_grid

Answer 67

let's us create a separate plot for each species

Answer 68

facet_grid | vertically by the first variable, and horizontally by the second variable

Answer 69

tilda symbol

Answer 70

theme(axis.text.x = element_text(angle = 45)

Answer 71

labs(title="Palmer Penguins", subtitle="3 Species", caption = "collected by Dr.")

Answer 72

annotate function

Answer 73

annotate("text", x=50, y=50, label= "The largest", fontface="bold", size=4.5, angle=25)

Answer 74

1. Explort | 2. ggsave("---.png")

Answer 75

min(hotel_bookings$arrival_date_year)

Answer 76

subtitle=paste0("Data from: ", mindate, " to ", maxdate))

Answer 77

ggsave("---.png", width=7, height=7)

Answer 78

file format for making dynamic documents with R

Answer 79

a syntax for formatting plain text files

Answer 80

lets users run your code and show tha graphs and charts that visualize the code

Answer 81

The set of markup symbols or codes used to create a webpage

R Flashcards

(110 cards)