First Flashcards

Question

Numpy coefficient

Answer 1

Are two things related np.corrcoef(x, y) also subset with np.corrcoef(df[:, 0], df[:,1])

Answer 2

np.std(x) also subset with np.std(df[:, 0])

Answer 3

np.sum(x) also subset with np.sum(df[:, 0])

Answer 4

np.columnstack((df_x, df_y))

Answer 5

plt. plot(x, y) | plt. show()

Answer 6

plt. scatter(x, y) | plt. show()

Answer 7

plt. hist(x, bins = #) | plt. show()

Answer 8

plt. xlabel('x') plt. ylabel('y') plt. title('title') plt. yticks([0,1,2,3,4]) plt. xticks([0,1,2,3], ['0', '1B', '2B', '3B']) # Reassign numbers on y -axis and change the name of y-axis ticks)

Answer 9

dict = {'k':v, 'k1':v1, 'k2',v2....} world = {'Nepal': 30.5, 'India': 1000, 'Bhutan' : 0.5}

Answer 10

dict.keys() > world = {'Nepal': 30.5, 'India': 1000, 'Bhutan' : 0.5} > print(world.keys()) Nepal, India, Bhutan

Answer 11

dict['k'] = v > world = {'Nepal': 30.5, 'India': 1000, 'Bhutan' : 0.5} > world['China'] = 1050 > print(world) {'Nepal': 30.5, 'India': 1000, 'Bhutan' : 0.5, 'China' : 1050}

Answer 12

del(dict['k']) > world = {'Nepal': 30.5, 'India': 1000, 'Bhutan' : 0.5} > del(world['Bhutan']) > world world = {'Nepal': 30.5, 'India': 1000}

Answer 13

pd.DataFrame(dict) > world = {'Nepal': 30.5, 'India': 1000, 'Bhutan' : 0.5} > df = pd.DataFrame(world)

Answer 14

pd.read_csv('path/to/dataframe.csv', index_col = 0) index_col = 0 means that the pd will not index the df

Answer 15

df['colname']

Answer 16

df[['colname']]

Answer 17

df[['col1', 'col2']]

Answer 18

df.loc[['k']] >df.loc[['RU']] Country Capital Area RU Russsia Moscow 17.1

Answer 19

df.loc[['k1', 'k2', 'k3']] >df.loc[['RU', 'IN']] Country Capital Area RU Russsia Moscow 17.1 IN India Delhi 3.2

Answer 20

df.loc[['k1', 'k2', 'k3'], ['col1', 'col2'] >df.loc[['RU', 'IN'], ['Country', 'Capital']] Country Capital RU Russsia Moscow IN India Delhi

Answer 21

df.iloc[[#]] >df.iloc[[1]] Country Capital Area RU Russsia Moscow 17.1

Answer 22

df.iloc[[#, #, #]] >df.loc[[1, 2]] Country Capital Area RU Russsia Moscow 17.1 IN India Delhi 3.2

Answer 23

df.iloc[[#, #, #], [#, #] >df.loc[[1, 2], [0, 1]] Country Capital RU Russsia Moscow IN India Delhi

Answer 24

[ ] is a pd. series where as [[ ]] is a pd. dataframe

Answer 25

both booleans need to be true > False and False True > x = 12 > x > 7 and x < 15 True > False and True False

Answer 26

at least one boolean needs to be true >True or False True > x = 5 > x < 7 or x > 13

Answer 27

logical_and() logical_or() logical_not() > y = [[5, 7, 9]] > np.logical_and(y > 5, y <9) [[False, True, False]]

Answer 28

Filter > df2 = df[‘col’] > # or Subset > df2 = df[df[‘col’] > #]

Answer 29

Filter > np.logical_and(df['col'] > #, df['col'] < #) or subset > df[np.logical_and(df['col'] > #, df['col'] < #)]

Answer 30

> fam = [1.5, 1.6, 1.7] ``` > for index, height in enumerate(fam): > print(str(index) + ' : ' + str(height)) 1 : 1.5 2 : 1.6 3 : 1.7 ```

Answer 31

First always key and then value > world = {'Nepal': 30.5, 'India': 1000, 'Bhutan' : 0.5} > for k, v in world.items(): > print(k + ' : ' + str(v)) Nepal : 30.5 India : 1000 Bhutan : 0.5

Answer 32

iterrows() not very efficient because on every iteration you are creating a new pandas series > for lab, row in brics.iterrows(): > print(lab + ' : ' + row['captial'] BR : Brasilia RU : Moscow

Answer 33

apply() > brics['name_length'] = brics['country'].apply(len) name_length BR Brazil Brasilia 6

Answer 34

np.random.rand()

Answer 35

np.random.seed(#) sets the random seed, so that your results are reproducible between simulations. As an argument, it takes an integer of your choosing. If you call the function, no output will be generated. np.random.rand() if you don't specify any arguments, it generates a random float between zero and one. > np.random.seed(123) > coin np.random.rand(0, 2) #Randomly generate 1 or 0

Answer 36

np.transpose(df)

First Flashcards

(61 cards)