Python Pandas Flashcards

1
Q

What are pandas in Python?

A

A data analysis and manipulation library.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you import pandas in Python?

A

import pandas as pd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a DataFrame in pandas?

A

A 2D labeled data structure with columns of potentially different types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Series in pandas?

A

A 1D labeled array capable of holding any data type.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do you create a DataFrame from a dictionary?

A

pd.DataFrame({‘col1’: [1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you read a CSV file with pandas?

A

pd.read_csv(‘file.csv’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you write a DataFrame to a CSV file?

A

df.to_csv(‘file.csv’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you display the first 5 rows of a DataFrame?

A

df.head()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you display the last 5 rows of a DataFrame?

A

df.tail()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you get DataFrame column names?

A

df.columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you get DataFrame index values?

A

df.index

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do you get a quick summary of a DataFrame?

A

df.info()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do you get statistical summary of numeric columns?

A

df.describe()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you select a single column from a DataFrame?

A

df[‘column_name’]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you select multiple columns from a DataFrame?

A

df[[‘col1’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do you select rows by index?

A

df.loc[0] or df.iloc[0]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How do you filter rows by condition?

A

df[df[‘col’] > 10]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How do you add a new column to a DataFrame?

A

df[‘new_col’] = values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How do you drop a column from a DataFrame?

A

df.drop(‘col’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

How do you drop a row from a DataFrame?

A

df.drop(index

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How do you rename columns in a DataFrame?

A

df.rename(columns={‘old’: ‘new’}

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How do you check for missing values?

A

df.isnull()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How do you fill missing values with 0?

A

df.fillna(0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

How do you drop rows with missing values?

A

df.dropna()

25
Q

How do you sort a DataFrame by column?

A

df.sort_values(‘col’)

26
Q

How do you reset the index of a DataFrame?

A

df.reset_index(drop=True)

27
Q

How do you set a column as the index?

A

df.set_index(‘col’

28
Q

How do you group data by a column?

A

df.groupby(‘col’)

29
Q

How do you aggregate grouped data?

A

df.groupby(‘col’).agg({‘val’: ‘sum’})

30
Q

How do you merge two DataFrames?

A

pd.merge(df1

31
Q

How do you concatenate two DataFrames vertically?

A

pd.concat([df1

32
Q

How do you concatenate two DataFrames horizontally?

A

pd.concat([df1

33
Q

How do you pivot a DataFrame?

A

df.pivot(index=’a’

34
Q

How do you melt a DataFrame?

A

pd.melt(df

35
Q

How do you check for duplicates?

A

df.duplicated()

36
Q

How do you drop duplicates?

A

df.drop_duplicates()

37
Q

How do you convert column to datetime?

A

pd.to_datetime(df[‘col’])

38
Q

How do you extract year from datetime column?

A

df[‘col’].dt.year

39
Q

How do you apply a function to a column?

A

df[‘col’].apply(func)

40
Q

How do you map values in a column?

A

df[‘col’].map({‘old’: ‘new’})

41
Q

How do you convert a column to numeric?

A

pd.to_numeric(df[‘col’]

42
Q

How do you create a DataFrame from a list?

A

pd.DataFrame(data

43
Q

How do you export a DataFrame to Excel?

A

df.to_excel(‘file.xlsx’

44
Q

How do you read an Excel file?

A

pd.read_excel(‘file.xlsx’)

45
Q

How do you check data types of columns?

46
Q

How do you change data type of a column?

A

df[‘col’] = df[‘col’].astype(‘int’)

47
Q

How do you get the number of rows and columns?

48
Q

How do you count unique values in a column?

A

df[‘col’].nunique()

49
Q

How do you get value counts in a column?

A

df[‘col’].value_counts()

50
Q

How do you sample random rows from a DataFrame?

A

df.sample(n=5)

51
Q

How do you get the memory usage of a DataFrame?

A

df.memory_usage()

52
Q

How to read a CSV file?

A

employee_df = pd.read_csv(‘employee_information.csv’)

53
Q

How to import the pandas lib to Python?

A

Import pandas as pd

54
Q

What is the key structure of pandas?

A

df (data frame)

55
Q

How to create a data frame?

A

x = pd.df({Employe ID:[1,2,3,4]’})

56
Q

How do I get a statistical summary?

A

x.describe() (you will get the mean,std,min,25%, 50%, 75%,max)

57
Q

How to Normalize the data?

A

df[‘normalized’] = (df[‘col’] - df[‘col’].min()) / (df[‘col’].max() - df[‘col’].min())

58
Q

What is MatLibplot?

59
Q

What is sklearn?