Collecting, storing, analysing and using data Flashcards
What are the types of data collection methods
- Autonomous devices
- Passive and active data collection
- Manual data collection
- Usage data
What are types of advanced storing techniques
- Redundant Array of Inexpensive Disks (RAID)
- Network Attached Storage (NAS)
- High availability storage
- Storage Area Networks usage (SAN)
- Cloud storage
- Hosted storage
How do you calculate the size of an image
Size of an image = row x columns x bpp
What are the 2 methods of calculating the size of an audio file
File size = sample rate × sample resolution × length
or
File size = bit rate × length of sound
Define and explain the purpose of data
Data can be defined as a set of recorded facts, numbers or events that has no initial meaning or structure.
- The main purpose of data collection is to gather information in a measured and systematic manner to ensure accuracy and analyse the data
- Data only becomes valuable once this has happened as it gives context and meaning in relation to why it was gathered.
What are the 5 methods to help store data
- Virtualisation
- Hosted instance
- Hosted solution
- Clustering
- Blockchain storage
What are some cloud computing services
- Data storage
- Virtualised software
- Remotely hosted applications
Define AI
Artificial intelligence (AI) is the simulation of human intelligence processes by machines.
What are the storage methods of video and audio files
- Digitally sampled sound
- Bitmapped graphics
- Compressed audio
- Compressed video
What are the units of data
Unit Symbol Value
Byte B 8 bits
Kilobyte KB 1024 bytes
Megabyte MB 1024 KB
Gigabyte GB 1024 MB
Terabyte TB 1024 GB
Petabyte PB 1024 TB
Exabyte EB 1024 PB
Zettabyte ZB 1024 EB
Yottabyte YB 1024 ZB
Define Data visualisation
Involves presenting the data visually or graphically to detect patterns, trends and correlations that are not usually apparent from raw data.
Define management information system
A management information system (MIS) is a collection of systems and procedures that gather data from multiple sources and compile them in a readable format.
Define project management software
Project management software (PMS) is a software tool that helps organise, manage and track projects.
Define data warehouse
A data warehouse (DW or DWH) is a system used for reporting and data analysis
Define data mining
Data mining is considered an interdisciplinary field that joins the techniques of computer science and statistics together.
or
the practice of analysing large databases in order to generate new information
Define large data sets
Large data sets refer to data sets that are too large or complex to be dealt with by traditional data-processing application software
Define neural network modelling
A neural network is a series of algorithms that tries to recognise underlaying relationships in a set of data through a process that mimics the way the human brain operates.
Define data flow diagram
Data flow diagrams (DFD) are used to show the flow of data in a business information system. Specific rules and symbols must be used when creating these diagrams.
Define natural language processing
A subset of artificial intelligence is known as natural language processing (NLP). The aim of this subset is to develop computer systems which can understand text or voice data in the same way as human beings.
What are the social and ethical implications of AI
- Is it acceptable if AI becomes more knowledgeable than humans?
- How many jobs will be lost to AI?
- How much data does AI gather?
- Does AI take away people’s privacy?
- How do we know what information AI is generating?
What are the 4 data flow diagram symbols
- Process Box
- Data Flow
- Data store
- External entity
Define Process box
Process box: Text goes inside this box to describe the process being formed (verb followed by a noun, e.g. calculate tax).
Define Data flow
Data flow: Arrows indicate the direction of the flow. They should be labelled with either the data element or set, e.g. customer ID or customer record.
Define data store
Data store/repository: Text should go inside this symbol indicating the name of the repository, e.g. customer database.