Reshaping, joining and visualisation data Flashcards

week 2

1
Q

function to do summarising and filtering by subgroups of the data.

A

group_by()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

There are two main types of join

A

Mutating joins add new columns from the additional dataset.

Filtering joins filter out existing columns based on information in the additional dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

There are four separate mutating joins

A

left_join(x,y) returns all rows in x, and all rows that match these in y. If y doesn’t have a match, NA will be used.

right_join(x,y) returns all rows in y and all rows that match these in x. If x doesn’t have a match in y, NA will be used.

inner_join(x,y) returns only rows in x and y that have a match.

full_join(x,y) returns all rows in x and all rows in y. Anything in x or y that don’t have a match will be filled with NA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

There are three interrelated rules which make a dataset tidy

A
  1. Each variable should be a column.
  2. Each observation should be a row.
  3. Each value should have its own cell
How well did you know this?
1
Not at all
2
3
4
5
Perfectly