Data collection in IoT systems Flashcards
Data
Data is information, such as facts and figures, used to analyze something or make decisions. When data is collected, for example from a wireless sensor, it is processed several times before it reaches a backend system. This is done to customize how the data is presented so that different applications can use it, or to perform calculations that help companies get value from the data and link it to specific business needs.
Structured data
Data organized in tables (rows and columns), making it easy to analyse. Examples are database tables with rows and columns, where each column has a specific property or attribute, and each row is a data record.
Unstructured data
is a type of data that is difficult to organize in tables, such as text documents, images or audio files. This data is often referred to as qualitative data and cannot be easily processed or analyzed with standard data tools. Instead, unstructured data is used in fields such as natural language processing (NLP) and text mining, which analyze and draw insights from text and other forms of data.
Semi-structured data
is a type of data that lies between structured and unstructured data. Unlike structured data, which is always organized in rows and columns (as in a database), semi-structured data lacks a fixed form. However, it does have certain elements such as ‘tags’ or labels that help to understand and organize the information. These tags help to understand what each part of the data means. A common example of semi-structured data is XML (Extensible Markup Language), which uses tags to describe data. E.g., the tags <book>, <title>, <author>, and <year> help us understand what the information is about.</year></author></title></book>
Meta-data
data about data. It provides information that describes or summarises other data, helping to understand, organize, or manage it more effectively.
Types of data collections in IoT
using sensors to collect data and track the status of the smart devices connected to the IoT.
Equipment data: information on the status of IoT devices, often in real-time, for maintenance and optimization. Example: Predictive maintenance.
Environmental data: it includes information on the physical environment. Humidity, temperature, movement, air quality.
Submeters data: Data collected from different users of common resources, e.g. water and electricity in multi-tenant buildings, collecting data and sending it to the cloud.
Location data: it comprises information on the location and movement of people, objects or vehicles. E.g. collecting data and sending it to the cloud.
Which layers interplay to make the IoT data collection process work (IoT data collection architecture)?
Device layer, communication layer, IT edge layer, Event processing layer, client communication layer.
Device layer
layer with sensors to collect data.
Communication layer
this layer manages how data is transferred between IoT devices and other systems. Is the layer that defines a type of protocol, and other protocols that they need to send data. E.g., protocols HTTP/HTTPS, MQTT, CoAP. The communication layer is used in many embedded systems, for example in automation, security systems and household appliances (smart home devices).
IT Edge layer
the layer where data is stored close to its collection point which includes hardware, firmware and operating systems of IoT devices. It plays a crucial role in IoT data processing by performing preliminary processing and analysis of data collected from connected devices.
Event processing layer
layer where data is cleansed, metadata is added and insights are generated.
Client communication layer
a bridge between back-end databases and front-end interfaces. API. It is the layer that communicates the results of data analysis to the end user via interfaces such as a mobile application, which is the interface that the user interacts with. The layer translates raw data into data the end user can easily understand.
What principles do IoT data collection systems use to work properly?
Scalability: robust iot data collection systems must be secure enough to gather and store large volumes of data.
Security: iot-based data collection systems must provide top-notch security to prevent data breaches or unauthorized access.
Interoperability: IoT systems must be able to work together and exchange data, regardless of manufacturers or technological platforms.
Flexibility: iot data collection systems must accept different data formats and adapt to changing requirements.
Why is the right data needed when collecting and how can proper data collection from IoT devices benefit businesses?
- Improved data operational efficiency: IoT data collection automates the collection of sensor information, increasing productivity and eliminating the need for manual data collection.
- Accurate real-time insights: IoT data collection allows businesses to monitor and solve problems in real time, enabling faster response and control.
- Better decision-making: Collected IoT data provides insights into customer behavior, market trends, and business performance, facilitating strategic planning, predictive maintenance, and decisions.
- Saved costs: IoT data can identify inefficiencies in processes, allowing companies to optimize their operations, reduce costs and increase profitability.
Challenges involved in iot data collection
- Security and privacy: Strict data security.
- Compatibility: different types of data need to be integrated and compatible to work effectively.
- Large data sets: a huge amount of data, not all are useful.
- Consistency: costly to ensure the right communication between devices and systems.