Collecting, storing, analysing and using data Flashcards

1
Q

What are the types of data collection methods

A
  • Autonomous devices
  • Passive and active data collection
  • Manual data collection
  • Usage data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are types of advanced storing techniques

A
  • Redundant Array of Inexpensive Disks (RAID)
  • Network Attached Storage (NAS)
  • High availability storage
  • Storage Area Networks usage (SAN)
  • Cloud storage
  • Hosted storage
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you calculate the size of an image

A

Size of an image = row x columns x bpp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 2 methods of calculating the size of an audio file

A

File size = sample rate × sample resolution × length

or

File size = bit rate × length of sound

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define and explain the purpose of data

A

Data can be defined as a set of recorded facts, numbers or events that has no initial meaning or structure.

  • The main purpose of data collection is to gather information in a measured and systematic manner to ensure accuracy and analyse the data
  • Data only becomes valuable once this has happened as it gives context and meaning in relation to why it was gathered.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the 5 methods to help store data

A
  • Virtualisation
  • Hosted instance
  • Hosted solution
  • Clustering
  • Blockchain storage
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some cloud computing services

A
  • Data storage
  • E-mail
  • Virtualised software
  • Remotely hosted applications
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define AI

A

Artificial intelligence (AI) is the simulation of human intelligence processes by machines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the storage methods of video and audio files

A
  • Digitally sampled sound
  • Bitmapped graphics
  • Compressed audio
  • Compressed video
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the units of data

A

Unit Symbol Value

Byte B 8 bits
Kilobyte KB 1024 bytes
Megabyte MB 1024 KB
Gigabyte GB 1024 MB
Terabyte TB 1024 GB
Petabyte PB 1024 TB
Exabyte EB 1024 PB
Zettabyte ZB 1024 EB
Yottabyte YB 1024 ZB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define Data visualisation

A

Involves presenting the data visually or graphically to detect patterns, trends and correlations that are not usually apparent from raw data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define management information system

A

A management information system (MIS) is a collection of systems and procedures that gather data from multiple sources and compile them in a readable format.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define project management software

A

Project management software (PMS) is a software tool that helps organise, manage and track projects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define data warehouse

A

A data warehouse (DW or DWH) is a system used for reporting and data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define data mining

A

Data mining is considered an interdisciplinary field that joins the techniques of computer science and statistics together.

or

the practice of analysing large databases in order to generate new information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define large data sets

A

Large data sets refer to data sets that are too large or complex to be dealt with by traditional data-processing application software

17
Q

Define neural network modelling

A

A neural network is a series of algorithms that tries to recognise underlaying relationships in a set of data through a process that mimics the way the human brain operates.

18
Q

Define data flow diagram

A

Data flow diagrams (DFD) are used to show the flow of data in a business information system. Specific rules and symbols must be used when creating these diagrams.

19
Q

Define natural language processing

A

A subset of artificial intelligence is known as natural language processing (NLP). The aim of this subset is to develop computer systems which can understand text or voice data in the same way as human beings.

20
Q

What are the social and ethical implications of AI

A
  • Is it acceptable if AI becomes more knowledgeable than humans?
  • How many jobs will be lost to AI?
  • How much data does AI gather?
  • Does AI take away people’s privacy?
  • How do we know what information AI is generating?
21
Q

What are the 4 data flow diagram symbols

A
  • Process Box
  • Data Flow
  • Data store
  • External entity
22
Q

Define Process box

A

Process box: Text goes inside this box to describe the process being formed (verb followed by a noun, e.g. calculate tax).

23
Q

Define Data flow

A

Data flow: Arrows indicate the direction of the flow. They should be labelled with either the data element or set, e.g. customer ID or customer record.

24
Q

Define data store

A

Data store/repository: Text should go inside this symbol indicating the name of the repository, e.g. customer database.

25
Q

Define External entity

A

External entity: This should be labelled with a description, e.g. it
could be a person, department or external organisation or system that provides or receives data to or from the organisation.

26
Q

Define Virtualisation

A

Virtualisation: Is the process of turning hardware into a software equivalent without sacrificing functionality.

27
Q

Define Hosted instance

A

Hosted instance: Instances are the virtual machines that run operating systems’ images such as Linux.

28
Q

Define Hosted solution

A

Hosted solution: When you rent a virtual server from a company that takes over the responsibility for maintaining and keeping your server running.

29
Q

Define Clustering

A

Clustering: A group of two or more computer systems that run in parallel together to achieve a goal.

30
Q

Define blockchain storage

A

Blockchain storage: A way of saving data in a decentralised network, which utilises the unused hard disk space of users across the world to store files.

31
Q

Define digital sampled sound

A

Digital audio is a representation of sound recorded in or converted into, digital form.

32
Q

Define bitmapped graphics

A

A bitmap (also called “raster”) graphic is created from rows of different coloured pixels that together form an image

33
Q

Define compressed audio

A

Audio data compression has the potential to reduce the transmission bandwidth and storage requirements of audio data

34
Q

Define compressed video

A

Video compression is the process of reducing the total number of bits needed to represent a given image or video sequence.

35
Q

Define Shell in expert systems

A

Expert system shells are toolkits that can be used to develop expert systems. They consist of some built expert system components with an empty knowledge base.

36
Q

Define Heuristics in expert systems

A

A heuristic system is designed to work with uncertainty and to simulate producing decisions based on experience.

37
Q

Define Fuzzy logic in expert systems

A

The concept provides possibilities which are not given by computers, but are similar to the range of possibilities generated by humans.

38
Q

Define Knowledge engineer

A

A knowledge engineer is a professional engaged in the science of building advanced logic into computer systems in order to try to simulate human decision-making and high-level cognitive tasks.