Data Sci Flashcards

1
Q

What is the structure of a data science project

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two goals of ETA - exploratory data analysis

A

The first is you wanna know if the data that you have is suitable for answering the question that you have. Then so this will depend on a variety of factors depending on very basic things like is there enough data, are there too many missing values, things like that. The second goal of exploratory data analysis isto start to develop a sketch of the solution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is an alternative structure of data science project

A

If this process if you used a date set to generate questions you can’t use the same data set to look for answers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

4 hallmarks of success of a data sci project includes

A

New knowledge is created.

Decisions or policies are made based on the outcome of the experiment.

A report, presentation or app with impact is created.

It is learned that the data can’t answer the question being asked of it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the five types of data analytics ?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the tools used for different types of data analytics

A

Descriptive and diagnostic analytics usually rely on analytic tools that can handle manipulation of large sets of data or that help visualize and interact with summarized information. Examples include SQL, Oracle database or Oracle DB, Hadoop/Spark, Tableau, QlikView, Microsoft Access, SAS, R, Python and various statistical packages within them.

Predictive and prescriptive analytics have traditionally relied on analytics tools that have significant mathematical modeling capabilities or scenario planning or simulation capabilities. Examples of these tools include SAS, R, SPSS, Python, and various packages associated with them. Optimization tools like Garrobi, ILOG, RiverLogic, etc. Simulation tools like Vensim, AnyLogic, STELLA. Machine learning and deep learning tools, like Scikit, TensorFlow, Caffe, Theano, etc. Natural language processing tools like NLTK or. Natural Language Tool Kit or OpenNLP.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly