Tools overview Flashcards
Azure cognitive services
Cognitive Services brings AI within reach of every developer—without requiring machine learning expertise. All it takes is an API call to embed the ability to see, hear, speak, search, understand, and accelerate decision-making into your apps. Enable developers of all skill levels to easily add AI capabilities to their apps.
Weka
Machine Learning Software in Java
Weka is a collection of machine learning algorithms for data mining tasks. It contains tools for data preparation, classification, regression, clustering, association rules mining, and visualization.
PyTorch
An open source machine learning framework that accelerates the path from research prototyping to production deployment.
Azure HDInsight
Run popular open-source frameworks—including Apache Hadoop, Spark, Hive, Kafka, and more—using Azure HDInsight, a customizable, enterprise-grade service for open-source analytics. Effortlessly process massive amounts of data and get all the benefits of the broad open-source project ecosystem with the global scale of Azure. Easily migrate your big data workloads and processing to the cloud.
CNTK Microsoft Cognitive Toolkit
The Microsoft Cognitive Toolkit (CNTK) is an open-source toolkit for commercial-grade distributed deep learning. It describes neural networks as a series of computational steps via a directed graph. CNTK allows the user to easily realize and combine popular model types such as feed-forward DNNs, convolutional neural networks (CNNs) and recurrent neural networks (RNNs/LSTMs). CNTK implements stochastic gradient descent (SGD, error backpropagation) learning with automatic differentiation and parallelization across multiple GPUs and servers.
DSVM
DSVMs are Azure Virtual Machine images, pre-installed, configured and tested with several popular tools that are commonly used for data analytics, machine learning and AI training.
rattle
Rattle: A Graphical User Interface for Data Mining using R
Theano
Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as aesara:
Chainer
Chainer is a powerful, flexible and intuitive deep learning framework. Chainer supports CUDA computation. It only requires a few lines of code to leverage a GPU. It also runs on multiple GPUs with little effort. (For Python)
Caffe2
Is being deprecated - Caffe2 is now a part of PyTorch.
While the APIs will continue to work, we encourage you to use the PyTorch APIs.
SmartNoise
SmartNoise is an open-source project that contains different components for building global differentially private systems. SmartNoise is made up of the following top-level components:
SmartNoise Core library
SmartNoise SDK library
Fairlearn
Fairlearn is an open-source, community-driven project to help data scientists improve fairness of AI systems.
MLFLOW
MLflow is an open source platform to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry. MLflow currently offers four components:
Azure Geo AI Data Science VM (Geo-DSVM)
The Geo AI Data Science VM extends the AI and data science toolkits in the Windows Server 2016 edition of the Data Science VM by adding ESRI’s ArcGIS Pro and interfaces in both Python and R to help data scientists leverage the spatial data, rich GIS processing, visualization and analytics in ArcGIS Pro to create better AI applications.
Vowpal Wabbit
Vowpal Wabbit provides a fast, flexible, online, and active learning solution that empowers you to solve complex interactive machine learning problems. Vowpal Wabbit provides fast, efficient, and flexible online machine learning techniques for reinforcement learning, supervised learning, and more.