DA (Python) Flashcards

Question 1

Q

Mean (Pandas)
All Means
Mean for a column

Answer

A

df.mean()
df[‘a’].mean()

Question 2

Q

Mode (Pandas)
all Modes
Mode for a column

Answer

A

df.mode()
df[‘a’].mode()

Question 3

Q

Median (Pandas)
all Medians
Median for a column

Answer

A

df.median()
df[‘a’].median()

Question 4

Q

Standard Deviation (Pandas)
all Standard Deviations
Standard Deviation for a column

Answer

A

df.std()
df[‘a’].std()

Question 5

Q

Load data

Answer

A

import pandas as pd

df = pd.read_csv(‘SeaLevels.csv’)

Question 6

Q

Read data

Answer

A

df.head()

First 5 rows

Question 7

Q

dealing with duplicates

Answer

A

z = [1,2,3,1,4,5,1]
seen = set()
cz = [x for x in z if not (x in seen or seen.add(x))]
print (cz)

[1,2,3,4,5]

Question 8

Q

Slices

Answer

A

Ranges from a list

myList = [1,2,3,4,5]
print(myList[:3]) - up to third
print(myList[1:]) - from 1 onwards
print(myList[2:4]) - from 3 to 4

[1, 2, 3]
[2, 3, 4, 5]
[3, 4]

Question 9

Q

Creating a dataframe

Answer

A

N = [‘Jack’, ‘Jill’, ‘John’]
H = [180, 170, 200]
S = [9, 5, 8]

df = pd.DataFrame({‘N’: N, ‘H’: H, ‘S’: S})

print(df)

Question 10

Q

loop code

Answer

A

myList = [1,2,3,4,5,6]
sum = 0.0
for item in myList:
sum = sum + item
print(sum)

Question 11

Q

built in functions

Answer

A

print(np.max(z))
print(np.min(z))
print(np.sum(z))
print(np.mean(z))
print(np.median(z))

Question 12

Q

conditional selection

Answer

A

e = df[(df[‘N’] != ‘Jack’ ) & (df[‘H’] > 170)]

print(e)

Question 13

Q

function

Answer

A

def MULT_of_3(num):
return num % 3 == 0

mO3 = [num for num in multList if MULT_of_3(num)]

print(“Multiples of 3 in the list:”, mO3)

Question 14

Q

Drop nulls

Answer

A

completeRows = df.dropna()

Question 15

Q

Select the 2nd and 3rd shoe size
Select 1st and 2nd Name
Find the mean height
Find the max height
Find the min shoe size
Find the median shoe size

Answer

A

multList = [4,3,6,7,43,56,453,67,544,322,37,87,77,79,36,25,320]

print(multList[1:3])
print(multList[0:2])
print(np.mean(multList))
print(np.max(multList))
print(np.min(multList))
print(np.median(multList))

Question 16

Q

Drop duplicates

Answer

Study These Flashcards

A

cleaned = completeRows.drop_duplicates(subset=[‘N’,’H’,’A’,’B’,’S’])
print(cleaned)

Question 17

Q

groupby

Answer

Study These Flashcards

A

mean = df.groupby(‘a’)[‘Shoe Size’].mean()

Question 18

Q

Standard Deviation (Code)

Answer

Study These Flashcards

A

df.std()
std = df.[‘Shoe Size’].std()

Question 19

Q

Correlation (Python)

Answer

Study These Flashcards

A

correlation = df.corr()
correlation = df.[[‘column A’, ‘Column B’]].corr()

DA (Python) Flashcards

(19 cards)