Chapter 1: Data Analytics Flashcards
Data Analytics
What are the key technologies of big data?
- Data mining
- Text mining: Analyzes text-based data from websites, comment fields, books
- Data management
- In-memory analytics
- Predictive analytics
- Hadoop: a third-party provider that stores large amounts of data
Data Analytics
What is data mining?
- Data mining drills down the data to remove any repetitive patterns
- It then finds data that has unexpected data to find any issues (i.e. fraudulent transactions,correaltions of unrelated data, etc.)
Data Analytics
What is in-memory analytics?
- In-memory analytics uses data that is used from system memory instead of the hard drives
- This is because the data that is in the system can be backed-up on a regular basis, while hard drives may have data that is not saved onto the system, but on the computers C: drive
Data Analytics
What is Diagnostic Analysis?
- What happened that caused the results to occur?
- Uses historical information to provide insight
Data Analytics
What is Descriptive Analytics?
- What Happened?
- The most basic and commonly used analytic
- Reports on actual results from historical information
Data Analytics
What are the different types of descriptive analytics?
- Anomoly detection
- Regression analysis
- Reviewing the relationship and trends from historical information to find out why trends happen
Data analtyics
Why is anomaly detection considered a component of descriptive analysis?
Once all of the historical data mining, extraction and cleaning has been done, the data is the reviewed and analyzed to determine if there are any unusual patters or deviations from the expected results
Data Analytics
What is Predictive Analytics?
- What will happen next?
- What is the expected outcome of the result?
- Predictive analytics is based on future results
- Predictive Analytics applies assumptions of data from various technology sources to find different outcomes from future events
Data Analytics
What types of technology can be used in predictive analysis?
- Data mining
- Statistical algorythms
- Machine-learning techniques
- Predictive modeling to cluster analysis groups of data with similar characteristics
Data Analytics
What is Prescriptive Analysis?
- What should be done in the future in order for the results to occur?
- Based on future results, not historical information
- It is the most complex analysis because it uses all of the analysis tools, diagnostic, descriptive and predictive analytics to improve business strategy
- Provides the most data inputs
Data Analytics
What is ranking of data size?
“Kim Met Gene To Purchase Extra Zebra Yarn”
* Kilobyte
* Megabyte
* Gigabyte
* Terabyte
* Petabyte
* Exabyte
* Zettabyte
* Yottabyte
Implementing Data Analytics
What are the five stages of data analytics?
- Define business questions to determine the goals and objectives that need to be obtained
- Obtain relevant data through information discovery
- Clean/scrub/normalize data
- Analyze data to derive values
- Communicate results including information used, conclusions and recommendations
Data Visualization
When would a pie chart be used?
Pie charts are used to show relative proportions of a specific period
Data Visualization
When would a scatter-plot chart be used?
Scatter plots are used to show the relationship between two variables
Data Visualization
When would a line chart be used?
- Line charts are used to show trends, cyle or variability over time
- They are similar to bar graphs, but they are dots instead of bar height