Fundamentals of Coding Flashcards
What is scientific computing?
Coding for the purpose of science
We want to use coding to:
- Manage large amounts of data
- Apply some math/algorithms
- Visualize our data
Computer Scientists
Solving complex problems with math and computations
Software Engineers
Design and develop user-friendly software
Why can’t we just use Excel
- Highly inefficient
- Greater chance of introducing errors
- No permanent record
- Not reproduceable
R
- Developed for statistical computing
- Many statistical libraries
- Used primarily by researchers
- Free to use
Python
- General purpose language
- Large user base
- variety of libraries for data
- free to use
- widespread use across industry
MatLab
- heavily used in engineering
- excellent for signal processing
- Standard toolboxes
- requires a license
Functions
Block of code that only runs when you call it
Modules
Grouping of functions for similar tasks
Packages/Libraries
Grouping of modules for similar projects
Anaconda
A distribution of the Python for scientific computing and data science, that aims to simplify package management and development
Google CoLab
- Cloud-based platform for writing and executing python code
- hosted by Google drive, requiring only a Google account
- Provides free access to computational resources
Strings
- Variables that contains numbers, letters, or other characters
- CANNOT be used in computations
Numbers
Only contain numbers and can be used in computations
Floats
Precision numbers that carry decimal places and as such are most commonly used for storing data
- can be converted to an integer using the int command
Booleans
Represent true or false evaluation of content
Lists
- Single variables that store multiple items
- Defined by square brackets []
- Can store strings, floats, booleans, integers
Finding items on a list through index
Add .index() after the name of the list with the item we are looking for in the round brackets
Dictionaries
Allows you to store “keys” to pair with values/variables
- defined by curly brackets{}
Rules for naming variables
- Cannot start with a number
- Cannot use a reserved word as a name
- Cannot use a special symbol
- Upper and lowercase characters are read differently
Best practices when naming variables
- Never use the letters I or 0 as a single letter names (easily mistaken for 1 or 0)
- Don’t make them too general, but also not so descriptive that they become wordy
- Keep them short, but long enough to be descriptive
- Variable names should be lowercase with underscores used as a separator
- A constant can be fully capitalized
Control structures
Blocks of code that analyze variables and decide the direction the program takes
- Precondition: state of the variables entering a control structure
- Control structure: the program or algorithm that runs based on the preconditions
- Post condition: stat of the variables after the control structure has run
What are the 3 basic types of control structures
- Sequence: Simply runs 1 line after another like a recipe
- Selection: Allows for decisions and branching to occur within the block of code
- Iteration: Used for repeating lines of code
How do you combine strings
can use +
Coding Iterations
- can code to print directly - use placeholder for each variable in a list
- Can code to save in a variable (code empty variable to add to, use temp interation variable and then save the new string into empty variable using empty.append(temp variable))
abs()
Returns the absolute value of a number
len()
Returns the length of an object
max()
Returns the largest item in an object
min()
Returns the smallest item in an object
Round()
Rounds a number
sum()
sums the items of a list
NumPy
Allows you to create array objects that are easier to manipulate
Pandas
- Pandas allows us to combine numbers, strings, booleans etc. all in the same array if we want
- Can add headers
- Cleaner application for working with a variety of data
concat
Concatenate, meaning joining together in series
How to upload data file
pd.read_csv()
Creating plots
- import matplotlib.pyplot
- define x and y
-plt.whatever_type of graph(x,y)
-plt.xlabel(), plt.ylabel()
-plt.show()
What is the difference between a function and library
Function is a block of reusable code that performs a specific task whereas as a library is a collection of modules and functions that provide prewritten code for various tasks
Indexing an array
- Print(Array_name[x,y]
- For all in a column or row [:,:]
- multiple numbers in a row or column [[x,x,x],y]
Indexing in Pandas
Grabs a row
df.loc[x]
in a range: df.loc[x:x]
Adding a row to pandas
pd.dataframe([data], columns =[column_names])
pd.concat([old_df,additional_row], axis=0, ignore_index=true
Adding a column in pandas
pd.dataframe([new data], columns= column_name
new_df = pd.concat([new_df, new_column], axis=1)