Compiled Summatives - Sheet1 Flashcards
What is the primary focus of statistics?
Predictive modeling
Data mining
Application of algorithms to inform strategic decisions
Collection, analysis, interpretation, presentation, and organization of data
Collection, analysis, interpretation, presentation, and organization of data
Which of the following methods is commonly used in statistics to understand data distributions and relationships?
Algorithm application
Data mining
Hypothesis testing and regression analysis
Predictive modeling
Hypothesis testing and regression analysis
What does analytics emphasize in addition to statistical methods?
Data presentation
Data interpretation
Predictive modeling and data mining
Data collection
Predictive modeling and data mining
Which of the following best describes the scope of analytics?
Integrates statistical methods with advanced computational techniques
Focuses solely on hypothesis testing
Limited to data collection and presentation
Only involves data organization
Integrates statistical methods with advanced computational techniques
What is the first step in the data analysis process
Get actionable information
Extract patterns
Prepare data
Apply machine learning techniques
Prepare data
Which of the following is not listed as a data source from the chart?
Printed Books
Email
Social Media Posts
Audio
Printed Books
What does the second step of the process involve?
Finding patterns using algorithms
Making decisions based on information
Collecting raw information
Cleaning and transforming databases
Finding patterns using algorithms
In which step would you apply machine learning techniques according to this flowchart?
Step 2- Extract Patterns
None of the above steps explicitly mention applying machine learning techniques
Step 3 - Get Actionable Information
Step 1 - Prepare Data
Step 2- Extract Patterns
What outcome does this flowchart suggest as a result of following these steps?
Creation of new databases
Learning how to code in various programming languages
Development of new software programs
Gaining insights or making informed decisions based on analyzed data
Gaining insights or making informed decisions based on analyzed data
What does transactional data primarily consist of?
Visual representations of data
General summaries of transactions
Structured, detailed information
Unstructured and random information
Structured, detailed information
Which of the following is an example of transactional data?
Credit card payment
Social media posts
Weather forecasts
Movie reviews
Credit card payment
What type of information is included in contractual, subscription, or account data?
Social media interactions
General market trends
Information about the type of product combined with customer characteristics
Weather patterns
Information about the type of product combined with customer characteristics
Which of the following is an example of a product type mentioned in the statement?
Loan
Weather forecast
Movie review
Social media post
Loan
What is the primary aim of surveys?
To extract sociodemographic and behavioral data from a particular group of people
To organize social events for communities
To entertain a particular group of people
To provide financial assistance to people
To extract sociodemographic and behavioral data from a particular group of people
Surveys are typically in the form of:
Novels
Music albums
Questionnaires
Art exhibitions
Questionnaires
Which of the following is NOT an example of unstructured data?
Social media posts
Media files
Sensor data
Spreadsheets
Spreadsheets
What is unstructured data?
Information that resides in a traditional row-column database
Data that is always textual
Data that is always numerical
Information that does not reside in a traditional row-column database
Information that does not reside in a traditional row-column database
Which of the following is an example of a purpose for which data poolers gather data?
Marketing and credit risk assessment
Weather forecasting
Event planning
Cooking recipes
Marketing and credit risk assessment
What is the primary role of data poolers?
To provide financial advice
and sell data for specific purposes
To develop software applications
To create new databases
and sell data for specific purposes
What is the first phase in the data analytics process?
Business Understanding
Modelling
Data Preparation
Evaluation
Business Understanding
What is the primary goal of the Business Understanding phase?
Cleaning data for better quality
Evaluating the model
Evaluating the model
Applying machine learning algorithms
Evaluating the model
Which phase involves selecting related data from various databases?
Data Understanding
Deployment
Data Preparation
Modelling
Data Understanding
Which of the following is NOT a type of database mentioned in the Data Understanding phase?
Relational Databases
Temporal, Sequence or Time-Series Database
Social Media Databases
Data Warehouses
Social Media Databases
What is another term for Data Preparation?
Data Modelling
Data Preprocessing
Data Transformation
Data Cleaning
Data Preprocessing
Which of the following activities is NOT part of Data Preparation?
Aggregating data
Filling in missing values
Applying machine learning algorithms
Filtering outliers
Applying machine learning algorithms
What does Data Transformation involve?
Converting different measurements into a unified numerical scale
Evaluating the model
Selecting related data from databases
Cleaning data for better quality
Converting different measurements into a unified numerical scale
Which of the following is an example of categorical values?
Filtered data
Numerical scales
Ordinal values (less, moderate, strong)
Aggregated data
Ordinal values (less, moderate, strong)
What is the primary focus of the Modelling phase?
Applying statistical and machine learning algorithms
Identifying business tasks
Selecting related data
Cleaning data
Applying statistical and machine learning algorithms
Which phase involves evaluating the performance of the model?
Deployment
Data Preparation
Business Understanding
Evaluation
Evaluation
What is the final phase in the data analytics process?
Modelling
Deployment
Evaluation
Data Understanding
Deployment
Which activity is part of the Data Preparation phase?
Identifying relevant data for the problem description
Evaluating the model
Applying machine learning algorithms
Filtering outliers and redundancies
Filtering outliers and redundancies
What type of data can be found in a Temporal, Sequence or Time-Series Database?
Static data
Aggregated data
Time-based data
Categorical data
Time-based data
Which phase involves selecting the related data from many available databases to correctly describe a given business task?
Data Understanding
Evaluation
Data Preparation
Modelling
Data Understanding
What is the definition of Mean
The range of values in a dataset
The average value of a dataset
The middle value in a dataset
The most frequently occurring value in a dataset
The average value of a dataset
How is the Mean calculated?
By identifying the most frequent value
By summing all values and dividing by the number of values
By subtracting the smallest value from the largest value
By finding the middle value
By summing all values and dividing by the number of values
What does the Median represent?
The most frequently occurring value in a dataset
The middle value when arranged in order
The difference between the highest and lowest values
The average value of a dataset
The middle value when arranged in order
Which measure of central tendency can have multiple values?
Median
Mean
Range
Mode
Mode
What is the primary purpose of measures of central tendency?
Measuring dispersion
Solving equations
Calculating probability
Organizing, summarizing, and visualizing data
Organizing, summarizing, and visualizing data
Formula for mean of population data