Data Flashcards

1
Q

TIDY data

A
Filter
Transform
Aggregate
Sort
Join/Merge (Inner/Outer)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Common Data Problems

A
In different source systems
Messy:
Missing
Invalids
Errors
Different levels
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Common Data Types

A

Flat File
-Not used as much now
-Each field is placed in a fixed position (e.g. first 5 bytes of the file)
CSV
-Values are separated by commas (very common as many systems can export a CSV
Delimited File
-Pipe (example is Air BnB)
-Tab, etc.
Proprietary (e.g. SAS, SPSS, Workday etc.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly