Learning From Data Flashcards

1
Q

What is Structured Data?

A

Data that is organised in a predefined schema, for example data stored in a Relational Database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Semi-Structured Data?

A

Data that has some structure, but not in fixed rows or columns. For example: JSON, XML or NoSQL.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Unstructured Data?

A

Data without a predefined structure, which doesn’t fit neatly in tables. For example: text, images, videos or PDF’s.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Data Integration?

A

The practice of combining data from different sources into a single, coherent data store.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Common User Interface?

A

Manual, controlled data-integration, but not scalable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Middleware Data Integration?

A

Uses middleware software to bridge and facilitate communication between different systems, consistent but needs maintenance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Application-based integration?

A

Software applications locate, retrieve and integrate data by making data from different sources and systems compatible with one another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is uniform data access?

A

Provides a consistent view of data from diverse sources without moving or altering it, keeping the data in its original location. Virtual integration without moving data; lighter but may affect integrity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Common data storage?

A

It retrieves and presents data uniformly while creating and storing a duplicate copy, often in a central repository. Stores copies centrally; great for analysis but expensive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between supervised and unsupervised learning?

A

Supervised learning uses data with labelled outcomes, while unsupervised learning algorithms use data without labelled outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is supervised learning?

A

To learn a mapping function from inputs x to outputs y, where x is features, and y is a label or target.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is unsupervised learning?

A

Aim to make sense of data, and uncover patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In supervised learning, what are features?

A

The input variables (columns in the dataset).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are model parameters?

A

Values that the model learns (e.g., coefficients in linear regression).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What’s the difference between y^ and y?

A

y^ is the model’s prediction; y is the true observed value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When is Regression used in Supervised Learning?

A

When the target is a quantitive value.

15
Q

When is Classification used in Supervised Learning?

A

When the target is qualitative or a class.

16
Q

What is the equation for a (the slope) in ax+b for linear regression

A

a =
Σ((x - x̄)(y - ȳ)) /
Σ(x - x̄)(x - x̄)

16
Q

What is the equation for b in ax+b for linear regression

A

b = ȳ - ax̄