Topic 5 Flashcards

1
Q

What is an ER diagram?

A

In Pearson Edexcel International Advanced Level (IAL) IT, an ER diagram (Entity-Relationship diagram) is a type of visual representation used to model the relationships between data entities in a database system.

It consists of:
1. Entities: Represented by rectangles, which are objects or concepts (e.g., “Student” or “Course”).
2. Attributes: Represented by ovals, which provide details about an entity (e.g., “Name” or “ID”).
3. Relationships: Represented by diamonds, showing how entities are connected (e.g., “Enrolled in”).

ER diagrams help design and structure a database before implementation, ensuring data is organized logically and effectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are three features of structured data?

A

Structured Data:
1. Organized Format:
○ Stored in a predefined format like rows and columns (e.g., spreadsheets or databases).
○ Example: A customer database with names, ages, and email addresses.
2. Easily Searchable:
○ Can be quickly searched and queried using tools like SQL.
○ Example: Finding all customers older than 30 in a database.
3. Defined Data Types:
○ Each piece of data has a specific type (like numbers, dates, or text).
○ Example: A “Date of Birth” column only contains valid dates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are three features of unstructured data?

A

Unstructured Data:
1. No Fixed Format:
○ Data isn’t stored in rows or columns and doesn’t have a consistent structure.
○ Example: Photos, videos, emails, or social media posts.
2. Challenging to Search:
○ Requires advanced tools like AI or machine learning to find specific information.
○ Example: Searching for a specific phrase in thousands of video transcripts.
3. Varied Data Types:
○ Includes a mix of text, images, audio, and video.
○ Example: A YouTube video with captions and comments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define structured data:

A

Data that is organised into rows and columns in tables, making it easy to search, store, and analyse (e.g., data in a spreadsheet or database).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define unstructured data:

A

Data that does not have a clear format or structure, such as images, videos, emails, or text documents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain structured and unstructured data:

A

Structured data is highly organised and often stored in relational databases. It is easy to query using SQL.
Unstructured data lacks a predefined format, so analysing it requires advanced tools like natural language processing or AI.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define format:

A

The way data is arranged, structured, or presented (e.g., CSV, JSON, XML).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain why format is useful in IT:

A

Formats standardise how data is stored and shared, ensuring compatibility across systems and simplifying data processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define qualitative data:

A

Descriptive data that captures qualities or characteristics (e.g., colour, texture, opinions).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Explain why format is useful in IT:

A

Formats standardise how data is stored and shared, ensuring compatibility across systems and simplifying data processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Examples of structured data:

A

Customer names and phone numbers in a database
Sales records in an Excel sheet
Financial transactions stored in a SQL database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why structured data is easy to search, manipulate, and analyse:

A

It is organised into predefined fields (rows/columns).
It uses standard formats and can be queried with tools like SQL.
Relationships and patterns are clear due to its structured nature.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Examples of unstructured data:

A

Social media posts
Photos and videos
Emails and chat logs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How ML uses structured data:

A

ML uses structured data to train algorithms by analysing patterns and relationships in labelled datasets, like predicting sales or diagnosing diseases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Process of developing an ML model:

A

Collect and prepare data.
Preprocess data (cleaning, normalisation).
Select an algorithm (e.g., linear regression, decision trees).
Train the model using training data.
Validate and test the model with test data.
Deploy the model and monitor performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Examples of a model to predict house prices:

A

Linear regression (predict prices based on square footage, location, etc.).
Decision trees (categorise houses by features and price ranges).

15
Q

Define data mining:

A

The process of finding patterns, trends, and useful information in large datasets using techniques like clustering and classification.

16
Q

Define data extraction:

A

The process of retrieving specific data from a larger set, often for analysis or migration.

17
Q

Explain data mining and where it is used:

A

Data mining identifies hidden patterns in data.
It is used in marketing (e.g., targeted ads), healthcare (e.g., diagnosis), and fraud detection.

18
Q

Explain where data extraction is used:

A

Used for migrating data from legacy systems, scraping web data, or pulling information from documents.

19
Q

Examples of data explosion sources:

A

Social media platforms
IoT (Internet of Things) devices
Cloud services and online tools
Sensors in smart devices

20
Q

Advantages of using data extraction in organisations:

A

Saves time by automating manual tasks.
Improves decision-making with timely insights.
Reduces errors in data handling.

21
Q

Disadvantages of using data extraction in organisations:

A

High costs for tools and infrastructure.
Risk of data breaches if security is weak.
Time-consuming setup for complex systems.
Potential for inaccuracies if data quality is poor.

22
Q

Why databases are used to structure data:

A

Databases organise data efficiently, allowing easy retrieval, updates, and querying.

23
Q

Define the term composite (general):

A

Something made up of multiple parts or elements.

24
Q

Where composite keys come in handy:

A

When no single field can uniquely identify a record (e.g., in a student-course relationship).

25
Q

Define a composite key in IT:

A

A primary key is made up of two or more fields to uniquely identify a record.

26
Q

Define records:

A

A complete set of related data in a table (a row in a database).

27
Q

Define fields:

A

A specific category of data within a record (a column in a database).

28
Q

Define a database:

A

An organised collection of data stored for easy access and management.

29
Q

Define a table:

A

A collection of records organized into rows and columns within a database.