Chapter 4 : Data Processing Flashcards

1
Q

Where does data come from?

A

Agent’s sensors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does higher sampling frequency mean?

A

Less information missing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is aliasing?

A

When sampling, signals become aliases of each other and can’t reconstruct original signal anymore

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is noise?

A

Obscures features in data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is artificat?

A

Makes it appear as a feature exists when it does not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

is DFT created on idea of in time any signal can be seen as a sum of sine functions

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is PSD (Power Spectrum Density)?

A

Shows the power at each point of frequency. Co-related to squared amplitude of DFT.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an modified Periodogram?

A

When the window size is not a vector of 1 x number of data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is welch method?

A
  1. Divide signal into segments
  2. Take periodogram of each of the segments
  3. Average all periodograms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Purpose of welch method?

A

Smoother PSD with reduced variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What domain are filters applied?

A

Frequency domain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is frequency response?

A

Explains how a filter or system effects signals in frequency domain in terms of amplitude response and phase response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is FIR filter?

A

Impulse Response is finite because no feedback loop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What type of filter is moving average filter?

A

Low pass filter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why normalize filter?

A

So the size of the output does not depend on size of input

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the FIR Output Filter Delay?

A

Output delay of symmetric FIR filters is

(Window size - 1) / 2

17
Q

Why do we need other filters other then moving average filter?

A
  • Moving average filter needs to be large to remove a lot of noise
  • A large filter causes more delay
  • Transition band of moving average filter can be very large
18
Q

What is a matched filter?

A

FIR filter which tries to extract features from a known spatial signal

19
Q

What are problems with matched filter?

A
  • Very sensitive to signal change

- Very fine-tuned for signals

20
Q

What type of output does median filter give?

A

Constant output (box version)

21
Q

What is the purpose of feature selection?

A
  1. Achieves faster training
  2. Select ML models with less complexities which leads to a less chance of overfit and easier to interpret
  3. Better generalization can be achieved, accuracy can be improved
22
Q

What does variance-based feature selection do?

A

Trying to extract features with most information in them

23
Q

What does correlation-based feature selection do?

A

Tries to remove features that are very similar to other features

24
Q

What does univariate-based feature selection do?

A

Evaluate each feature individually with respect to output to see which ones are most important

25
Q

What does sequential-based feature selection do?

A

Uses greedy algorithm to pick local optimal features, uses both:

  1. Forward-SFS algorithm
  2. Backward-SFS algorithm
26
Q

What is the variable deletion techniques in missing data resolution?

A

Get rid of variables that have 50-60% missing data

27
Q

What is the curse of dimensionality?

A

As number of dimensions grow, the learning/search grows exponentially

28
Q

What can proper dimensionality reduction lead to?

A
  1. Less complex processing
  2. Faster processing
  3. Easier visualization
29
Q

What are PCA algorithm steps

A
  1. Normalize the data
  2. Calculate the covariance matrix
  3. Calculate eigenvectors and eigenvalues
  4. Sort by highest-to-lowest by eigen values
  5. Pick however many dimensions needed
30
Q

What happens if Pearson-correlation is 0?

A

The signals show no relationship to eachother

31
Q

What does Independent Component Analysis (ICA) do?

A

Used in source separation, maximizes the statistical independence

32
Q

What conditions are needed for ICA to perform?

A
  1. Number of observes must be greater then or equal to number of sources
  2. The different sources of information must be independent
  3. Information must be additive
33
Q

Problems with ICA?

A
  1. Order of outputs cannot be determined
  2. Not perfect
  3. Computationally expensive (iterative algorithm)
34
Q

Why is ICA called “blind” source separation (BSS)?

A

Due to the fact that there is not much particular information known about the sources