Exploratory data analysis Flashcards

1
Q

Define visual analytics:

A

The science of analytical reasoning facilitated by INTERACTIVE visual interfaces

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do you want to discover with visualizations when doing visual analytics? (4)

A
  1. Data structure
  2. Parameters
  3. Relationships among variables
  4. Properties of the data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

List the tasks of exploratory analysis: (6)

A
  1. Making sense of data
  2. Discover facts
  3. Reveal patterns, outliers and trends
  4. Find correlations
  5. Reveal rankings
  6. Perform multivariate analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an important feature of an exploratory analysis?

A

Interaction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which type of statistics we use to make sense of data?

A

Descriptive statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In descriptive statistics which measures of central tendency do we use? (3)

A

Mean, median, mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In descriptive statistics which measures of dispersion do we use? (5)

A

range, percentile, variance, standard deviation, quantiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which visual patterns do we look for with dot representations? (5)

A
  1. trends
  2. concentration
  3. clusters
  4. gaps
  5. outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which visual patterns we look for with line representations? (5)

A
  1. increases, decreases and stationarity
  2. curves
  3. repetitions
  4. interactions
  5. covariation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which types of visualizations are the most suitable to look for correlations?

A
  1. Scatterplot
  2. Scatterplot matrix
  3. Heatmap
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

List techniques to perform cluster analysis: (2)

A
  1. K-means

2. Density-based

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which are the best visualizations to show part-to-whole? (4)

A
  1. Stacked bar chart
  2. Stacked area chart
  3. Bar chart, with cumulative percentage line (see chapter 8 slide 38)
  4. Waffle chart
  5. Pie chart XD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What should we pay attention to when ranking?

A

Look for proportions over absolute values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Which are the best suited visualizations to perform multivariate analysis? (5)

A
  1. Composed axis
  2. Parallel coordinates
  3. Scatterplot matrices
  4. Tablelens
  5. Radar charts
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are we looking for with interaction? (3)

A
  1. Changing the type of visualization
  2. Selection, filter, focus, getting details
  3. Transforming data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly