CSE 6040 - 7.0 - Tidy Data Concepts Flashcards

1
Q

What is Tidy Data?

A variable forms a what?

Each observation forms a what?

Each observational unit forms a what?

A

Tidy data is standard way of mapping the meaning of a datset to its structure. In tidy data

  1. Each variable forms a column.
  2. Each observation forms a row.
  3. Each type of observational unit forms a table
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A Data Frame representation is better suited for __________.

A

A Data Frame representation is better suited for regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Wickham defines a tidy data set as one that can be organized into a 2-D table such that

each column represents a variable;

each row represents an observation;

each entry of the table represents a single value, which may come from either _______(discrete) or ________spaces.

A

Wickham defines a tidy data set as one that can be organized into a 2-D table such that

each column represents a variable;

each row represents an observation;

each entry of the table represents a single value, which may come from either categorical (discrete) or continuous spaces.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

If a table is tidy, we will call it a ___ __ or _______, for short.

A

If a table is tidy, we will call it a tidy table, or tibble, for short.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Identify how a computer scientist with machine learning outlook might refer to to this picture.

A

Columns as features

rows as data points

especially when all values are numerical (ordinal or continuous)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly