Data Exploration Flashcards
define data exploration
Data Exploration is the process of exploring the data and putting it in a uniform way for a better understanding.
big data as a field?
big data is a field that treats ways to analyze, extract information and deal with data sets that are too massive/ complex to be dealth with by traditional data processing software.
big data define
big data refers to data that is so large and complex that it is an almost impossible task to be processed by traditional methods.
how do people deal with big data
people work with the cloud and networks of endless normal servers and pwerful algorithms. this allows them to process over a million pieces of data within the matter of minutes.
name and explain parameters of data
-volume( size of data)-velocity( speed of data accumulation)-variety structured/semi structured/un structured data-veracity- accuraccy and trustworthiness of data-value- rumuneration of the data
advantages of data exploration
-gain better understanding and new insights of data-identify the trends, relationships and patterns present in the data.- decide the course of action to take in the AI project cycle.-allow us to visually represent the data- helps in validating or verification of the collected data
define data visualisation
data visualisation is the process of representing the data in the form of graphics to gain better understanding of new insights of data.
advantages of visualisation
Quickly get a sense of the trends, relationships and patterns contained within the data.• Define strategy for which model to use at a later stage.• Communicate the same to others effectively.
steps of visualization
- Start with understanding the data,2. Scoping the problem and extracting authentic data.3. And present it using different graphs.
How to read the graph chart?
- DESCRIBE: This part explains how the figure/table is set up. like:1) representation on axis. 2)Units of axis. 3)sclae on each axis.4) symbol and legend on a figure. 5) Observe the patterns in the data.2. INTERPRET: It will explain how to interpret the data.As it is important to understand how to draw conclusion from the pattern described.
name tools used to represent data
- MS Excel2. Apple Numbers3.Google Charts4. OnlineChartTool.com5.Python Graph gallery
components of a chart
-data series-axes(x and y acxis)-chart area-legend
define chart
A chart is a graphical representation of information which helps in presenting relationships and patterns to reach a hypotheses.
Differentiate between chart and graph.
Chart It is the representation of data diagrammatically in the form of graphs, shapes etc. It is used to divide data into subsets.Ex : Pie chart, Bar chart etc GraphIt is a way to represent data visually which shows the arithmetic relation between the data.It is used to find patterns and trends in data.Not all graphs are charts. But all graphs are chatsEx : Line graph, Bar graph etc
all graphs are charts but not all charts are graphs.
charts is a data visualisation technique used to represent data diagramtically like shapes etc. graphs are a type of chart used to represent the arithmetic relationship between data visually. charts usually aid in dividing data into subsets while graphs are used to identify trends or paterns. so graphs fall under the catgory of the chart making all graphs charts.