Data Preparation Flashcards
order of operations with data
- gather raw data
- sort and filter
- format data
- remove duplicates
- summarize in better ways
how to remove duplicates
data- remove duplicates
fill a whole new column with just the first info before the column if the content looks like this: mirutzi suzuki, aos
write out content for 1-3 cells then, data - flash fill
how to create a column called Car ID?
write one under new column, car id, then highlight all the cells, then home- fill- series - series in: columns, type: linear, etc
import data
data - get data - From Text/CSV
choose table style that’s all white
click any cell - table design - table styles - etc
how do you highlight the duplicate values in the Product Id column in light red with dark red font.
highlight column - home - styles - cond. formatting - highlight cells rules - duplicate values
make a note of how many duplicates have been removed
count (bottom-right of excel window, or use =COUNTA(range) in an empty cell
tips to data prepare
create a lookup sheet that you can use for data prep
You have a column called Order Date that shows the date the order was placed with DataCo from 2015 to 2018. You might eventually want to add more date information to help with your analysis. A lookup sheet would be useful for this. how do you create a data lookup sheet?
create new sheet called Dates
Using the advanced Fill Series feature, how do I populate the Date column to show all dates from “1/1/2015” to “12/31/2018”, ensuring that every calendar day is included in the range.
click cell with 1/1/2015, then home - editing group - fill - series - stop value field 12/31/2018