Models & Librairies Flashcards

1
Q

Decision tree

A

sklearn/tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Random Forest

A

sklearn.ensemble/RandomForestClassifier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Naïve Bayes

A

sklearn.naive_bayes/MultinomialNB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

K-means

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Libraries for task automation

A

• Automate - automation
• PyAutoGUI - Graphical User Interface (GUI) automation
• Selenium - web browser automation
• Schedule - job scheduling
• Fabric - streamlining the use of SSH for application deployment
• Celery — distributed task queue
• Invoke - task execution and command-line tooling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Librairies for machine learning

A

• scikit-learn - general-purpose machine learning
• TensorFlow - deep learning and neural networks
• Keras - high-level deep learning
• PyTorch - deep learning and neural networks
• XGBoost - gradient boosting framework
• LightGBM - gradient boosting framework
• CatBoost - gradient boosting framework
• statsmodels — statistical modeling
• NITK - natural language processing (NLP)
• spaCy - NLP
• Gensim - topic modeling and document similarity analysis
• fastai - deep learning
• H20-3 - general-purpose machine learning
• Prophet - time series forecasting
• Neural Structured Learning (NSL). - neural graph learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Libraries for data analysis

A

• pandas — data manipulation and analysis
• NumPy - numerical computing
• SciPy - advanced scientific computing
• math - Python’s built-in module for mathematical operations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Librairies for data visualization

A

• matplotlib - basic plotting
• seaborn — statistical data visualization
• plotly - interactive plotting/APIs
• bokeh - interactive visualization for web browsers
• Vega-Altair - declarative statistical visualization
• GeoPandas - geospatial data visualization
• HoloViews - interactive visualization
• Pygal - Scalable Vector Graphics (SVG) plots
• folium - geospatial data visualization on interactive maps
• Dash by Plotly - analytical web applications
• plotnine - for statistical visualization
• NetworkX - for network graphs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

t-test

A

from scipy.stats import ttest_ind

result = ttest_ind(data[‘group1’], data[‘group2’])

How well did you know this?
1
Not at all
2
3
4
5
Perfectly