FA5 + M5 - Sheet1 Flashcards

Question 1

Q

Which of the following libraries are used for mathematical and statistical operations on multi-dimensional arrays and matrices in Python?

Group of answer choices

Matplotlib

NumPy

Pandas

Question 2

Q

Which of the following libraries are used for data visualization in Python?

Group of answer choices

NumPy

Matplotlib

SciPy

Answer

A

Matplotlib

Question 3

Q

Which of the following libraries are used for deep learning in Python?

Group of answer choices

TensorFlow

Scikit-learn

Keras

Answer

A

TensorFlow

Question 4

Q

Which of the following libraries are used for natural language processing in Python?

Group of answer choices

NLTK

Scrapy

Scikit-learn

Question 5

Q

Which of the following libraries are used for creating spiders bots that scan website pages and collect structured data in Python?

Group of answer choices

Scrapy

Pandas

SciPy

Question 6

Q

Which of the following libraries are used for object identification, speech recognition, and more in Python?

Group of answer choices

PyTorch

Keras

Dist-keras

Answer

A

Tensorflow dapat pero Pytorch ung tama sa canvas

Question 7

Q

Which of the following libraries are used for reading data, selecting and filtering in data, and data manipulations in Python? There are two correct answer in the options, just choose one.

Group of answer choices

PyTorch

Pandas

NumPy

SciPy

Answer

A

Pandas
NumPy

Question 8

Q

Which of the following libraries are used for creating interactive and scalable visualizations in a browser using JavaScript widgets in Python? There are two correct ansers from the choices, just select one.

Group of answer choices

SciPy

Bokeh

NumPy

Bokeh

Plotly

NumPy

SciPy

Answer

A

Bokeh
Bokeh

Plotly
Plotly

Question 9

Q

Which Python libraries are built on NumPy? There are two correct ansers from the choices, just select one.

Group of answer choices

Pandas

Seaborn

Scikit-Learn

Matplotlib

Answer

A

Pandas
Scikit-Learn

Question 10

Q

Which Python library provides machine learning algorithms?

Group of answer choices

Pandas

Scikit-Learn

NumPy

Matplotlib

Answer

A

Scikit-Learn

Question 11

Q

Data Wrangling:

Answer

A

SciPy
NumPy
pandas

Question 12

Q

Statistic

Answer

A

StatsModels

Question 13

Q

NLP

Answer

A

Natural Language Toolkit
SpaCy
gensim

Question 14

Q

Machine Learning

Answer

A

scikitlearn
xgboost
lightgbm
catboost
eli5

Question 15

Q

Deep Learning

Answer

A

TensorFlow
Pytorch
Keras

Question 16

Q

Distributed Deep Learning

Answer

A

dist-keras
elephas
spark-deep-learning

Question 17

Q

Visualization

Answer

A

matplotlib
Bokeh
plotly
Seaborn
pydot

Question 18

Q

it is intended for processing large multidimensional arrays and matrices, and an extensive collection of high-level mathematical functions and implemented methods makes it possible to perform various operations with these objects

Answer

A

NumPy (numpy.org)

Question 19

Q

it is based on NumPy and therefore extends its capabilities. SciPy main data structure is again a multidimensional array, implemented by Numpy.

Answer

A

SciPy (scipy.org/scipylib)

Question 20

Q

The package contains tools that help with solving linear algebra, probability theory, integral calculus and many more tasks

Question 21

Q

provides high-level data structure and a vast variety of tools for analysis. The great feature of this package is the ability to translate rather complex operations with data into one or two commands.

Answer

A

Pandas (pandas.pydata.org)

Question 22

Q

contains many built-in methods for grouping, filtering, and combining data, as well as the time-series functionality

Question 23

Q

is a low-level library for creating two-dimensional diagrams and graphs.

Answer

A

Matplotlib (matplotlib.org)

Question 24

Q

With iths help, you can build diverse charts, from histograms and scatterplots to non-Cartesian coordinates graphs.

Answer

A

Matplotlib (matplotlib.org)

Question 25

Q

Moreover, many popular plotting libraries are designed to work in conjunction with ____

Answer

A

matplotlib

Question 26

Q

is essentially a higher-level API based on the matplot library.

Answer

A

Seaborn (seaborn.pydata.org)

Question 27

Q

It contains more suitable default settings for processing charts.

Question 28

Q

Also, there is a rich gallery of visualizations including some complex types like time series, jointplots, and violin diagrams

Question 29

Q

is a popular library that allows you to build sophisticated graphics easily.

Answer

A

Plotly (plot.ly/python/)

Question 30

Q

The package is adapted to work in interactive web applications.

Question 31

Q

Among its remarkable visualizations are contour graphics, ternary plots, and 3D charts

Question 32

Q

The ____library creates interactive and scalable visualizations in a browser using JavaScript widgets.

Answer

A

Bokeh (bokeh.pydata.org/en/latest/)

Question 33

Q

The library provides a versatile collection of graphs, styling possibilities, interaction abilities in the form of linking plots, adding widgets, and defining callbacks, and many more useful features.

Question 34

Q

is a popular framework for deep and machine learning, developed in Google Brain.

Answer

A

TensorFlow (tensorflow.org)

Question 35

Q

It provides abilities to work with artificial neural networks with multiple data sets.

Answer

A

TensorFlow

Question 36

Q

Among the most popular TensorFlow applications are _____ and more.

Answer

A

object identification, speech recognition,

Question 37

Q

is a large framework that allows you to perform tensor computations with GPU acceleration, create dynamic computational graphs and automatically calculate gradients.

Answer

A

PyTorch (pytorch.org)

Question 38

Q

Above this, ____ offers a rich API for solving applications related to neural networks

Question 39

Q

is a high-level library for working with neural networks, running on top of TensorFlow, Theano, and now as a result of the new releases.

Answer

A

Keras (keras.io)

Question 40

Q

It simplifies many specific tasks and greatly reduces the amount of monotonous code. However, it may not be suitable for some complicated things.

Answer

A

Keras (keras.io)

Question 41

Q

These packages allow you to train neural networks based on the Keras library directly with the help of Apache Spark

Answer

A

Dist-keras (joerihermans.com/work/distributed-keras/)

Question 42

Q

dist-keras and others are gaining popularity and developing rapidly, and it is very difficult to single out one of the libraries since they are all designed to ______

Answer

A

solve a common task.

Question 43

Q

This Python module based on NumPy and SciPy is one of the best libraries for working with data.

Answer

A

Scikit-learn (scikit-learn.org/stable)

Question 44

Q

It provides algorithms for many standard machine learning and data mining tasks such as clustering, regression, classification, dimensionality reduction, and model selection

Answer

A

Scikit-learn

Question 45

Q

is an extension module that makes several frequent item set mining implementations available as functions.

Question 46

Q

In PyFim, Currently _______ are available as functions, although the interfaces do not offer all of the options of the command line progarm

Answer

A

apriori, eclat, fpgrowth, sam, relim, carpenter, ista, accretion and apriacc

Question 47

Q

Often the results of machine learning models predictions are not entirely clear, and this is the challenge that ___ library helps to deal with.

Question 48

Q

it is a package for visualization and debugging machine learning models and tracking the work of an algorithm step by step.

Answer

A

Eli5 (eli5.readthedocs.io/en/latest/)

Question 49

Q

It provides support for scikit-learn, XGBoost, LightGBM, lightning, and sklearn-crfsuite libraries and performs the different tasks for each of them

Question 50

Q

is a set of libraries, a whole platform for natural language processing.

Answer

A

NLTK (nltk.org)

Question 51

Q

With the help of ____, you can process and analyze text in a variety of ways, tokenize and tag it, extract information, etc.

Question 52

Q

is also used for prototyping and building research systems

Question 53

Q

is a Python library for robust semantic analysis, topic modeling and vector-space modeling, and is built upon Numpy and Scipy.

Answer

A

Gensim (radimrehurek.com/gensim)

Question 54

Q

Gensim provides an implementation of popular NLP algorithms, such as _____.

Question 55

Q

Although gensim has its own models.wrappers.fasttext implementation, the ____ can also be used for efficient learning of word representations.

Answer

A

fasttext library

Question 56

Q

is a library used to create spiders bots that scan website pages and collect structured data.

Answer

A

Scrapy (scrapy.org)

Question 57

Q

In addition, Scrapy can extract data from the ___

Question 58

Q

The library happens to be very handy due to its extensibility and portability

Question 59

Q

Introduces for multi-dimensional arrays and matrices, as well as functions that allow to easily perform advanced mathematical and statistical operations on those objects

Question 60

Q

Provides vectorization of mathematical operations on array and matrices which significantly improves the performance

Question 61

Q

Many other python libraries are built on ____

Question 62

Q

adds data structures and tools designed to work with table - like data (similar to Series and Data Frames in R)

Question 63

Q

Provides tools and data manipulation: reshaping, sorting, slicing, aggregation etc.

Question 64

Q

Allow handling missing data

Answer 40

A

Scikit-Learn

Answer 41

A

Scikit-Learn

Answer 42

A

matplotlib

Answer 43

A

matplotlib

Answer 44

A

matplotlib

Answer 45

A

matplotlib

Answer 46

A

import numpy as np
import scipy as sp
impor pandas as pd
import matplotlib as mpl
import seaborn as sns

Answer 47

A

Shift+Enter

Answer 48

A

pd.read.excel(‘myfile.xlsx’, sheet_name = ‘Sheet1’, index_col = None, na_values = [‘NA’])

pd.read_stata(‘myfile.dts’)

Answer 49

A

df.head()

Answer 50

A

pd.iloc[:10]

Answer 51

A

df.tail(10)

Answer 52

A

object (string)

Answer 53

A

Int64 (Int)

Answer 54

A

Float64 (Float)

Answer 55

A

Datetime64, timedelta[ns] (N/A)

Answer 56

A

df[‘salary’].dtype

Answer 57

A

df.dtypes

Answer 58

A

parentheses.

Answer 59

A

dir() function

Answer 60

A

head( [n] ), tail( [n] )

Answer 61

A

describe()

Answer 62

A

max(), min()

Answer 63

A

mean(), median()

Answer 64

A

sample([n])

Answer 65

A

Split the data into groups based on some criteria
Calculate statistics (or apply a function) to each group
Similar to dplyr() function in R

Answer 66

A

df_rank = df.groupby([‘rank’])

Answer 67

A

Boolean indexing.

Answer 68

A

Boolean operator

Answer 69

A

one or more columns
one or more rows
a subset of rows and columns

Answer 70

A

double brackets:

Answer 71

A

cumsum() and cumprod()

Answer 72

A

Aggregation

Answer 73

A

min, max
count, sum, prod
mean, median, mode, mad
std, var

Answer 74

A

describe()

Answer 75

A

mean, median, mode

Answer 76

A

violinplot

Answer 77

A

jointplot

Answer 78

A

swarmplot

Answer 79

A

factorplot

Answer 80

A

statsmodel and scikit-learn

Answer 81

A

statsmodel

Answer 82

A

scikit-learn

Answer 83

A

inear regressions
ANOVA tests
hypothesis testings
many more

Answer 84

A

kmeans
support vector machines
random forests
many more