pandas Flashcards
pd.ExcelFile( )
parse specified sheet(s) into DataFrame, equivalent to read_excel
data type
pandas.io.excel._base.ExcelFile
xls.sheet_names
return a list of sheet names in the xls
xls.parse( )
parse the sheet in xls into DataFrame
df.duplicated( )
return boolean Series denoting duplicates row
- subset = [column label or sequence of labels]
only consider/use certain columns for identifying duplicates, by default use all of the columns. - keep = {‘first’, ‘last’, False}
pd.pivot_table( )
create a spreadsheet-style pivot table as a DataFrame
- data: data frame providing the info
- values
- index
- columns
- ## aggfunc
pivot table
a table of statistics that summarizes data of a more extensive table (such as from a database, spreadsheet, or business intelligence program)
df.isnull( )
To detect missing values, return bool values for each element in DataFrame
Parameters:
- None
se.map( )
Map values of Series according to input correspondence.
pd.concat( )
Concatenate pandas objects along a particular axis
df.se.str.cat( )
To join two columns in a data frame