Data Analytic Applications/processes Flashcards
API
Application program interface- allows applications to talk to each other
XML
Extensible markup language. Set of tags or codes to describe a text in digital document
SQL
Helps pull from databases, manages data in relational databases management
D3.js
Data driven document is a JavaScript library for manipulating documents based on data. Brings html to life
JSON
JavaScript object notation, format to transport data on networks without heavy weight. Format is also interchangeable,
MLaaS
ML as a service for cloud computing.
ETL and ETLTL
Extract, transform, load and the latter loads into a data Lake or warehouse to be analyzed at a later ping in time.
Training Data and Test/Validation
Practice data set implemented to build up a model , and test data uses to validate model
QLIK
End to end platform with data integration
Python Vs R
Python: mainly used by programmers , production ready tool that integrates all of the workflow, simple syntax is easy to debug, indentation affects meaning. Functionality is always written same way
R- used in academia, most commonly used by non programmers, statistical models only take a few lines, indents have no meaning, functionality can be written differently