intro to data science and analytics Flashcards

1
Q

new techniques to solve problems

A

data science and analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

t/f: data scientis is the sexiest job of the 21st century

A

t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

According to Glassdoor, data scientists earn a base pay of __ a year, on the average

A

$116,840

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

roles in analytics

A

collector / data steward
data engineer
business analyst
modeler / data scientist

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

data science, due to its __ nature, requires an intersection of abilities

A

interdisciplinary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

data science skillset: necessary for working with massive amounts of electronic data that must be acquired, cleaned, and manipulated

A

hacking skills

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

data science skillset: allows a data scientist to choose appropriate methods and tools in order to extract insight from data

A

math and statistics knowledge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

data science skillset: crucial for generating motivating questions and hypotheses and interpreting results

A

substantive expertise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

data science skillset: lies at the intersection of knowledge of math and statistics with substantive expertise in a scientific field

A

traditional research

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

data science skillset: stems from combining hacking skills with math and statistics knowledge, but does not require scientific motivation

A

machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

data science skillset: hacking skills combined with substantive scientific expertise without rigorous methods can beget incorrect analyses

A

danger zone

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

data science vs. data analytics:
scope - macro

A

data science

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

data science vs. data analytics:
scope - micro

A

data analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

data science vs. data analytics:
goal - to ask the right questions

A

data science

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

data science vs. data analytics:
goal - to find actionable data

A

data analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

data science vs. data analytics:
major fields - Machine learning, AI, search engine engineering, corporate analytics

A

data science

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

data science vs. data analytics:
major fields - Healthcare, gaming, travel, industries with immediate data needs

A

data analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

t/f: bot data science and data analytic uses big data

A

t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

mother of invention

A

necessity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

goal of report writing

A

automation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

year of report writing

A

1970s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

year of centralized system

A

1980s

23
Q

goal of centralized system

A

ERP (Enterprise Resource Planning) / MIS (Management Info System)

24
Q

year of business intelligence

A

1990s

25
Q

goal of business intelligence

A

apps fo everyone

26
Q

t/f: in the 1990s, applications were invented to share and to analyze

A

f (made to share but not yet to analyze)

27
Q

when was microsoft office for windows released

A

november 19, 1990

28
Q

year of internet & data mining

A

2000

29
Q

year of big data & data science

A

2010

30
Q

t/f: The needs of the industry, as demanded by the fast moving realities of the
present time, also evolve the analytics.

A

t

31
Q

the value in the data “__” is guided by your knowledge of the domain– not the tools or techniques

A

haystack

32
Q

combination of all the skillsets needed in data science

A

analytics

33
Q

evolution of analytics: describes historical data, helps understand how things are going

A

descriptive

34
Q

evolution of analytics: helps understand unique driver, segmentation, statistical, sensitivity analysis

A

diagnostic

35
Q

evolution of analytics: forecast future performance events and results

A

predictive

36
Q

evolution of analytics: analysis that suggest a prescribed action

A

prescriptive

37
Q

evolution of analytics: proactive action, learn off scale, reason with purpose interact naturally

A

cognitive

38
Q

finding useful pattern in a data

A

data mining

39
Q

process of knowledge discovery, machine learning, and predictive analytics

A

data mining

40
Q

extracting meaningful patterns

A

data mining

41
Q

building representative models

A

data mining

42
Q

combination of statistics, machine learning, and computing algorithms

A

data mining

43
Q

t/f: data mining is about descriptive statistics

A

f

44
Q

t/f: data mining is not about dimensional slicing

A

t

45
Q

types of learning models in data mining

A

supervised
unsupervised

46
Q

types of learning models: directed data mining

A

supervised

47
Q

types of learning models: undirected data mining

A

unsupervised

48
Q

types of learning models: the model generalizes the relationship between the input and output variables

A

supervised

49
Q

types of learning models: the objective is to find patterns in data based on the relationship between data points themselves

A

unsupervised

50
Q

supervised goups of learning models

A

classification models
regression models
clustering models

51
Q

unsupervised groups of learning models

A

clustering models
anomaly detection
time series forecasting
association
text and sentiment analysis

52
Q

t/f: clustering model can be both supervised and unsupervised

A

t

53
Q

data mining steps

A

business understanding
data understanding
data preparation
modeling
testing and evaluation
deployment