Numpy and Pandas Fundamentals Flashcards

Question 1

Q

pd.Series

Answer

A

Question 2

Q

pd.DataFrame

Answer

A

Retrieve column by dict-like or attribute syntax
Get rows by frame.ix[‘row_index’]
accepts rows of dicts, dicts of dicts, and dicts of format ‘key’:list() where lists are of equal length (or np.arrays)
del dataframe[‘colname’] works as expected
columns and row indices can have names just like Series
frame.values returns a 2d array (rows, columns)
Accepts Numpy masked arrays. ‘masked’ elements are NA/missing in the result
list of lists or list of tuples defaults to passing in row-wise

Question 3

Q

index objects

Answer

A

methods

Question 4

Q

reindex

Answer

A

creates a new object that conforms to the new index
interpolation methods like ffill or bfill
with dataframe, rows is the default, but columns can be specified by keyword. new columns will be NaN, columns left out will be omitted.
limit option sets max of ffill or bfill
fill_value can fill elements that don’t exist

Question 5

Q

index selection and filtering

Answer

A

Series and DataFrames

DataFrames

Question 6

Q

indexing options with DataFrame

Answer

A

obj[val] select single col or sequence of columns, boolean, slice, boolean dataframe
obj.ix[val] select single row or subset of rows
obj.ix[: , val] select single column of subset of columns
obj.is[val1, val2] select both rows and columns
reindex - conform one or more axes to new index
xs method = select single row or column as a series by label
icol, irow methods : select column or row, respectively, as series by integer location
get_value, set_value: select single value by row and column lable

Question 7

Q

Arithmetic methods on DataFrames

Answer

A

each is a method on DataFrame with an optional fill_value argument for elements that do not have a match

So adding elements that don’t have a match will produce NaN, but fill_value=0 will produce identity.

Question 8

Q

Arithmatic between Series and DataFrames

Answer

A

df - series: will broadcast the series row down all the rows of the df
if an index wasn’t found in the series, that column will be added as NaN to the df
DataFrame arithmetic methods are used for column-wise math

ex. dframe.sub(series, axis = 0)

Question 9

Q

Function application and mapping

Answer

A

numpy ufuncs (element-wise array methods) work with dataframes and series
dataframe has an apply method, like R’s apply. however, axis = 0 applies ACROSS rows (colsum in R), axis = 1 applies ACROSS columns (rowsum in R)
Function to apply need not return a scalar - can also be a Series object.
applymap performs element wise operations ex. format = lambda x: “%.2f’ % x
5.

Fundamentals of Numpy and Pandas from the Pandas book