Data processing and info Flashcards

1
Q

What is Data?

A

Raw data which doesn’t have any meaning
Eg 112048 1209854 2293874
Data can be processed into information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Information?

A

Data but with context
Eg 229413 , 396210, 504001
Postal codes for India

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Direct Data meaning

A

Data collected for a specific purpose and only used for that purpose

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Indirect Data meaning

A

Data obtained from a third party and not used for it orginal purpose

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Examples of Direct Data sources

A

Questionnaires (survey)
Interview (talk to people)
Observation (look at people)
Data logging ( uses sensor)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Examples of Indirect Data Sources

A

Electoral Register
Business collecting personal data for use by other third parties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Adv of direct data

A

Original source known so we know reliability (how much sampling bias)
Can gather specific data (obscure data that indirect never measured)
Can gather how much or little data is needed
Up to date data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Disadv of direct data

A

Takes lot of time and money to gather data
May not be able to gather data due to time of the year ( eg summer rainfall data in winter)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Indirect data Adv

A

Immediately available
More likely to have large samples available

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Indirect data disadv

A

Irrelevant data will need to be sifted through and deleted
Original source unknown might be unreliable and biased
Might be out of date

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Factors that affect quality of info

A

Accuracy
Relevance
Age (how old the data)
Level of detail (can have too much data)
Completeness (eg u only said street not house number)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Encryption definition

A

A way of scrambling data so only authorised people can access it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Symmetric Encryption /secret key

A

Faster but same private key to encrypt and unencrypt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Asymmetric Encryption/public key

A

2 keys
public key used to encrypt data (many can people have)
private key only for receiver to decrypt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Types of validation checks

A

Presence check
Range check
Type check
Length check
Format check
Lookup check
Consistency check
Check digit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is type check

A

make sure right data type eg
integer
number
alphanumeric

17
Q

What is lookup check

A

Make sure it matches something in the list
Eg Gender: Male, Female

18
Q

what is consistency check

A

Make sure data across 2 fields are consistent

19
Q

what is validation

A

checks when data is input into computer system ensures data is sensible

20
Q

what is verification

A

process of checking whether data entered into computer matches source

21
Q

2 ways to check accuracy of data

A

verification
validation
both are needed for accuracy

22
Q

Types of data verification

A

Visual check
Double data entry
Hash total
Control total
parity check
check sum

23
Q

What is double data entry

A

Enter data twice compare second to first
like confirm password

24
Q

What is parity check

A

Checks each byte
Even parity and Odd parity
One bit of data is used as parity rest are used for data

In even parity number of 1’s must be even
1001100
there are 3 ones, so parity will be 1

25
What is check sum
Checks whole file Checks data is transmitted accurately from one device to another Has many algorithms to calculate data is correct
26
What is hash total
Calculation is performed before and after transmitting compares 2 values if same or not same Usually done on large files using data not normally used in calculation eg ID column ( can also do on alphanumeric by converting letter to number)
27
What is control total
same as hash total but only on numeric field this will be a meaningful value
28
Data processing 3 popular types
batch online real time
29
Batch processing
Data is entered together and processed in a large batch called a job done when computer resources are not being fully used usually without human interaction stored sequentially
30
Master and Transaction files
Master File contains important data that doesnt change often Transaction file contains data that changes often Transaction file is validated and then sorted into same order as master then combined
31
Data processing
Data processing turns data into information can be actions like collecting and storage Editing and updating Sorting and searching Outputting and spreading info The word transaction is used for these actions
32
Adv batch processing
automated process so less human error reduces costs fewer tasks for human operator can be scheduled to occur when less resources are used
33
dis batch processing
Delay in data being processed errors can't be corrected until processing is complete only same type of data can be processed since its automated
34
Online processing
Direct access is used ( don't have to go through every previous record to access) processing takes place almost immediately each transaction is processed before the next
35
Examples of online processing
Electronic Funds Transfer (EFT) Online store Automatic stock control Electronic data exchange Business to business buying and selling
36
Real time processing
An example of online processing but response must be immediate usually sensors microprocessors and actuators processing is contiunous eg for sensors in greenhouse temp must be controlled
37
Uses of real time processing
Greenhouse Air-conditioning Computer games
38
adv disadv real time processing
always up to date expensive complex computer system ( high specs)