Programming as a data scientist Flashcards
Jupyter Notebook
Also called a Notebook, a cloud-based interface for writing, executing, and presenting Python code
Markdown
A way of writing text that, when read by an interpreter, formats the text that you’ve written so that it includes styling such as headers, bold, and italics.
Cascading Style Sheets CSS
(programming language) is a stylesheet language used to describe the presentation of a document written in HTML or XML. It describes how elements should be rendered on screen, on paper, in speech, or on other media.
What is the file extension for Jupyter Notebooks?
.ipynb
State of a Variable
What value is assigned to a variable
variable
a named container that can store a changeable data value
state
data that defines the condition of some object of program
Reserved word
a special category of words that you’re not allowed to use as variable names in a given programming language
Snake case
A style of writing in which each space is replaced by an underscore
String
A sequence of letters, numerals, punctuation marks, or other characters, treated as text
Integar
Also called an int, a whole number that isn’t written as a fraction or with a decimal point
Floating-point number
Also called a float, a number that includes a decimal point
Boolean
A binaroy variable with two possible values: true and false
Concatenate
link (things) together in a chain or series
List
An ordered collection of data in Python
Dictionary
A collection of key-value pairs in Python
Function
A block of code that describes a repeatable process or behavior
Argument
A value or other input that is passed into a function
Parameter
A placeholder variable listed in a function declaration
Parameter list
The parameters that a function expects to be passed inside to the main body of the function when the function is called
Determinate function
A function that will always return the same output for a given input
Side Effect
Any observable thing that happens while a function is running
Return statement
The main effect of a function, giving as a result whatever comes after the keyword return
Control flow
Also called flow of execution, the tools provided by a programming language to conditionally determine which set of instructions runs
Example: if/elif/else
and try/except
logical operators
used to combine conditional statementsand,
or,
not`
conditional statement
logical expression
Loop
A construct that allows you to repeat a set of instructions a specific number of times, or until a specific condition is true.
zero-indexed
means that the first element of a list (or any sequence) is accessed with the index 0
.
instantiate
means to create an instance of an object or data structure.
Example: When you create a list in Python, you are “—-“ ing a list object.
Dependencies
Other modules that a package depends on to run properly
Python object
A collection of attributes and methods
Method
A function that is attached to an object as one of the object’s attributes
Attribute
A set of properties that belong to an object
abstract base class (ABCs)
provide a way to define interfaces when other techniques like hasattr()
would be clumsy. They introduce virtual subclasses, which are classes that don’t in herit from a class but are still recognized in isinstance()
and issubclass()
numbers class
The root of the numeric hierarchy.
abstraction
abstract category of objects
API
Application Programing Interfaces
are web services that website owners provide for users to request certain types of data from large databases that are frequently being updated.
web scraping
basically glorified copying and pasting straight from the webpage, but you get the computer to do all of the work automatically.
SQL data bases
robots.txt
root page of a web site that states the rules for that site about scraping (add this after the main URL)
Request
a query from the user to the web
Response
What the web server returns as a result of the request
Status code
A numeric code that the server sends in response with regard to the request’s status
Rate-limiting
A policy to limit the use of an API to a certain number of API requests over a period of time
API key
An authorization code that is passed to an API request and identifies the requester
JavaScript Object
JSON, a common data format used for API requests and responses that is easy for humans to read and write and easy for machines to parse and generate.
Parameters
Arguments sent to the API that define the request and expected response
Representational state transfer
REST, a set of constraints for building web APIs.
Repository
(repo) is a storage location on platforms like GitHub where code, data, and project files are kept. It allows for version control, collaboration, and organization.
Simply put it is like a folder on a platform like GitHub where you can store your project files and track changes. It helps you and others work together on the project.
Parse
To take formatted data (such as a JSON object as a string) and convert it into a data structure that your programming language can use, like a dictionary in Python.
Web scraping
The process of using a program to follow links, find data on pages, and download that dat for processing
Web crawlers
Also called spiders, programs that have the ability to automatically follow links on web pages and parse the data on a web page
NumPy
A library in Python that forms the basis for many other scientific computation libraries
pandas
Short for Paython data analysis library, a Python library that’s built on top of NumPy
NumPyarray
A table of elements that are all of the same type
DataFrame
A pandas data structure similar to spreadsheet
Series
A single column in a pandas Data Frame