G2. Data science issues Flashcards
Describe the general methodology of data science? What is its objective?
Representation of complex environments by rich data (we always assume we have enough data) The objective is to produce beliefs informed by data
What is a Web IDE? What does IDE stand for? What is a notebook?
Integrated Development Environment. (Visual)
Essential tool designated to maximize programmer productivity.
A notebook is a document that contains both code (e.g., Python, R, Julia) and rich text elements (markdowns)
Give a general description of a Data Science virtual machine
(Microsoft Azure) Data Science Virtual Machine (DSVM) is a pre-configured, ready-to-use virtual machine image tailored for data science and machine learning tasks. It comes with a set of tools, libraries, and frameworks commonly used in the data science field, providing a convenient and consistent environment for data scientists and analysts. designed to save time and effort in setting up a data science environment, allowing users to focus on their actual analysis and modeling tasks.
o Give the general functional architecture showing how does Azure Notebooks
communicates with GitHub and with the Python interpreter in the setting used for experimenting in the lab sessions?
pending