Libraries - pandas Flashcards
Import the pandas library
import pandas as pd
Store the data from a csv file called “stats.csv” (which is saved in the same directory as the script) and store it in the variable “df”
“df” stands for DataFrame
df = pd.read_csv(“stats.csv”)
pd.read_csv(“example”.csv)
Explain the data structure pandas uses to store the data from “example.csv”
pandas ESSENTIALLY uses a dictionary where column headers are the keys, and the values of each key are a List of the data from the “rows” under each key.
However, pandas also assigns an index number to each of the items in each List of data.
Essentially you just need to remember that the “keys” are like the column letters in Excel, and the index numbers assigned to the values are like the row numbers in Excel (tho index # starts from “0”)
A pandas DataFrame is stored in varable “df”
Preview the first 5 “rows” in the terminal
print(df.head())
This print only to “top” 5 rows
A pandas DataFrame is stored in varable “df”
Preview the last 5 “rows” in the terminal
print(df.tail())
A pandas DataFrame in variable “df” has a date column with the key-header “date”
Convert this date column to the datetime formatt
df[‘date’] = pd.to_datetime(df[‘date’])
A pandas DataFrame in variable “df” has a date column with the key-header “date”
The date column has been converted to datetime
Print all the month numbers to the terminal
df[‘date’].dt.month
A pandas DataFrame in variable “df” has a date column with the key-header “date”
The date column has been converted to datetime
Create a new column called “month” showing the month name only
df[‘month’] = df[‘date’].dt.month_name()