Quantopian Flashcards
You can print lines you want with the word print
True
In python, what is pandas?
A library
If I gave numpy the alias “np”, and I imported some data and named it “sample”, how would I calculate the mean and standard deviation of the data using numpy?
np.mean(sample), np.std(sample)
In Quantopian’s IPython interface, what is one of the functions that can be used to pull some real price data?
get_pricing()
What is one of the functions that can be used to calculate returns?
pct_change()
If I wanted to calculate the 100 day moving average using daily data, I can use pandas built-in tools to do so. The data is loaded in and named “price_data”. Assuming I load in pandas with the alias “pd”, then what would be the correct way to write the function to calculate the 100 day moving average using pandas?
pd.rolling_mean(price_data, window=100)
If “”” This text is enclosed between the triple quotes “”” then it is referred to as
a string
13.198 is what type of variable in Python?
Float
What happens to the number 13.198 when I run the following code,
int(13.198)
Truncate any digits after the decimal point
What will running the following code in Python create?
[1, ‘This’, 3, ‘that’]
List
What can the colon symbol be used to slice in Python?
List
If I wanted to grab the last item from a list in Python, which line of code would I need to run? Assume the name of the list is defined as X.
X[-1]
You can change any part of a tuple because it is immutable.
False
If I have two different variables, X and Y, and I wanted to know if they are the same, I would run the following code
print (X == Y)
True
What are the two main components that make up a dictionary in Python?
Key and Value
Numpy arrays can be created with multiple dimensions
True
Assume an array is defined as “tmp”. Which line of code would we need to run to get the dimensions of the array?
tmp.shape
A slice of an array will select a group of elements starting from the first element specified up to, and including, the last element indicated.
False
If I wanted to select the first row of a 2-d array, I would execute which of the following code? Assume the array is defined as “tmp”.
tmp[0,:]
Assume you have a set of returns and weights such that the weights are between 0 and 1, and sum to 1. Which line of code would return the expected return of your portfolio? Assume you load the numpy library in as np, define average returns as “mean_returns”, and weights as “weights”.
np.dot(weights, mean_returns)
Assume you define weights as “weights”, average returns as “mean_returns”, the covariance of returns as “cov”, and you load the numpy library in as np. How would I use this information to calculate the volatility of my portfolio?
np.dot(np.dot(weights, cov), weights.T)
A pandas Series is a 1-dimensional array with labels that can contain any data type.
True
Every Series has a name.
True
If you have a sample of daily data and the index is in DatetimeIndex format, which line of code would allow me to convert the daily data to monthly data? Assume the daily data is defined as “x”.
x.resample(‘M’)
Assume I have return data that has some missing values (‘NaN’). Which line of code will replace the missing values with the average returns? Assume you define the returns data as “x”.
x.fillna(x.mean())
The .dropna() function will automatically replace NaN values with the average
False
If you wanted to print the summary statistics for your returns data, defined as “x”, how would you do so?
x.describe()
If you are given a set of price data, defined as “x”, how would you calculate the multiplicative returns using pandas built-in methods simultaneously dropping the first set of missing values?
x.pct_change()[1:]
Assume you have a set of monthly pricing data defined as “x” and you load in the pandas library as pd. Which pandas built-in function allows you to calculate the 12 month rolling average price?
pd.rolling_mean(x, 12)
If you have monthly pricing data defined as “x” and you load in the pandas library as pd. Which line of code would calculate the 12 month rolling volatility?
pd.rolling_std(x, 365)
If you have a pandas DataFrame defined as “df” and you want the average of each column, which line of code would you run?
df.mean(axis=0)
Assume you have a pandas DataFrame defined as “df”. If you wanted to access the names of the columns, you would run which line of code?
df.columns
To get the standard deviation from a pandas DataFrame defined as “df”, I could run which of the following lines of code?
df.std()