Lectuer 1 Flashcards
Big Data:
Big data refers to data that would typically be too expensive to store, manage, and analyze using
traditional (relational and/or monolithic) database systems
Data analysis
examining, transforming and arranging a given data set in specific ways in order to study its individual
parts and extract useful information
Data analytics
science or discipline that encompasses the complete management of data. Includes analysis, o data collection, organisation, storage, and all the tools and techniques used
Machine Learning
a set of computational methods that uses experience to improve performance of make accurate predictions without the need of being programmed.
Machine learning is a sub-field
of what?
Artificial Intelligence (AI)
AI?
Man-made system that perceives, reacts and analyzes
ML?
System that learns from by itself
Supervised ML
where we have input variable (x) and an output (y) - labelled data.
Unsupervised ML
only have input (x)
Steps of Analysis?
- Acquire
- Prepare
- Analyze
- Report
5.Act
Acquiring Data?
finding the right data sources
Prepare Data includes?
- Preliminary investigation
- Correlations
- General trends
- Outliers
- Summary statistics
- Mean
- Median
- Range
- standard deviation
- Visualization techniques
- Histogram
- Scatter plots
To transform the raw data to make it suitable for analysis we need?
- Scaling
- Transformation
- Feature selection
- Dimensionality reduction
- Data manipulation
Analysing Data
Building a model from your data
Different types of analysis techniques
- Classification
- Regression
- Clustering
- Association analysis
- Graph analysis