2 Introduction to Python (II) Flashcards

Question 1

Q

1 What is Matplotlib?

Answer

A

A plotting library for the Python programming language and its numerical mathematics extension NumPy. It provides an object-oriented API for embedding plots into applications using general-purpose GUI

Question 2

Q

2 Complete code:

import matplot… as …

Answer

A

import matplotlib.pyplot as plt

Question 3

Q

3 Make a line plot (year x-axis, pop y-axis)

year=[‘1975’,’1976’,’1977’]
pop=[2340,2405,2890]

Answer

A

import matplotlib.pyplot as plt

plt. plot(year,pop)
plt. show()

Question 4

Q

4 How to display a matplotlib plot?

Answer

A

plt.show()

Question 5

Q

5 Print the last item of the list year:

year=[‘1975’,’1976’,’1977’]

Answer

A

print(year[-1])

print(year[2])

Question 6

Q

6 What is a scatter plot?

Answer

A

A type of plot or mathematical diagram using Cartesian coordinates to display values for typically two variables for a set of data

Question 7

Q

7 Complete code (scatter plot):

x = [1,3,5]
y= [2,6,7]

’'’import mat….
…
plt.show()’’’

Answer

A

import matplotlib.pyplot as plt

plt. scatter(x,y)
plt. show()

Question 8

Q

8 Change the line plot below to a scatter plot

year=[‘1975’,’1976’,’1977’]
pop=[2340,2405,2890]

import matplotlib.pyplot as plt

plt. plot(year,pop)
plt. show()

Answer

A

plt. scatter(year,pop)

plt. show()

Question 9

Q

9 Put the x-axis on a logarithmic scale

day=[‘1’,’2’,’3’]
virus=[18,55,320]

import matplotlib.pyplot as plt

plt. scatter(day,virus)
plt. show()

Answer

A

plt. scatter(day,virus)
plt. xscale(‘log’)
plt. show()

Question 10

Q

10 What is a correlation coefficient?

Answer

A

A value that indicates the strength of the relationship between variables. The coefficient can take any values from -1 to 1.

Question 11

Q

11 What is a histogram?

Answer

A

An approximate representation of the distribution of numerical or categorical data

Question 12

Q

12 Create histogram

years = [1975,1976,1978,1975]

Answer

A

import matplotlib.pyplot as plt

plt. hist(years)
plt. show()

Question 13

Q

13 Create histogram with 5 bins using data (list)

data = [random.randint(1, 5) for _ in range(100)]

Answer

A

plt.hist(data,bins=5)

Question 14

Q

14 What is the use of plt.clf() ?

Answer

A

Cleans a plot up again so you can start afresh

Question 15

Q

15 You want to visually assess if the grades on your exam follow a particular distribution. Which plot do you use?

Answer

A

Histogram

Question 16

Q

16 You want to visually assess if longer answers on exam questions lead to higher grades. Which plot do you use?

Answer

A

Scatter plot

Question 17

Q

17 Add labels

year =list(range(1975,2000))
scores = list(range(1,26))

plt.scatter(year,scores)
…

Answer

A

plt. xlabel(‘year’)
plt. ylabel(‘scores’)
plt. show()

Question 18

Q

18 Add ‘scores’ as a title

data = [int(random.randint(1, 5)) for _ in range(100)]
plt.hist(data,bins=5)
…
plt.plot()

Answer

A

plt.title(‘years’)

Question 19

Q

19 Add log scale

year =list(range(1975,2000))
scores= [2**n for n in range(25)]

plt.scatter(year,scores)
…

Answer

A

plt. yscale(‘log’)

plt. show()

Question 20

Q

20 What are ticks in matplotlib?

Answer

A

Ticks are the values used to show specific points on the coordinate axis. It can be a number or a string.

Question 21

Q

21 What is a legend in matplotlib?

Answer

A

The legend of a graph reflects the data displayed in the graph’s Y-axis

Question 22

Q

22 Change the ticks in the x-axis to strings

x=[1, 3, 5]
y=[1, 5, 9]

import matplotlib.pyplot as plt
plt.scatter(x,y)

Answer

A

plt. xticks(x, [“one”,”three”,”five”])

plt. show()

Question 23

Q

23 Write a scatter plot with gdp as independent variable and population size as the size argument

gdp=[100, 200, 300]
life_exp=[50, 70, 82]
pop_size=[30,20,40]

Answer

A

import matplotlib.pyplot as plt

plt. scatter(gdp, life_exp, s =pop_size)
plt. show()

Question 24

Q

24 What is a dependent variable?

Answer

A

A variable (often denoted by y ) whose value depends on that of another.

Question 25

Q

25 What is an independent variable?

Answer

A

A variable (often denoted by x ) whose variation does not depend on that of another.

Question 26

Q

26 Code: Scatter plot with text ‘A’ pointing at the second element

gdp=[100, 200, 300]
life_exp=[50, 70, 82]

Answer

A

import matplotlib.pyplot as plt

plt. scatter(gdp, life_exp)
plt. text(195,65,’A’)
plt. show()

Question 27

Q

27 Add a grid to a matplot figure

Answer

A

plt.grid(True)

Question 28

Q

28 Get the position of germany

countries = [‘spain’, ‘france’, ‘germany’, ‘norway’]

Answer

A

countries.index(‘germany’)

Question 29

Q

29 What is the difference between list and dictionary in Python?

Answer

A

A list is an ordered sequence of objects, whereas dictionaries are unordered sets. But the main difference is that items in dictionaries are accessed via keys and not via their position.

Question 30

Q

30 Get the keys

europe = {‘spain’:’madrid’, ‘france’:’paris’, ‘germany’:’berlin’, ‘norway’:’oslo’ }

Outcome:
dict_keys([‘spain’, ‘france’, ‘germany’, ‘norway’])

Answer

A

print(europe.keys())

Question 31

Q

31 Get the capital of norway

europe = {‘spain’:’madrid’, ‘france’:’paris’, ‘germany’:’berlin’, ‘norway’:’oslo’ }

Outcome: oslo

Answer

A

print(europe[‘norway’])

Question 32

Q

32 Add italy and rome to the dictionary

europe = {‘spain’:’madrid’, ‘france’:’paris’,
‘germany’:’berlin’ }

Answer

A

europe[‘italy’]=’rome’

Question 33

Q

33 Check whether the dictionary has spain

europe = {‘spain’:’madrid’, ‘france’:’paris’,
‘germany’:’berlin’ }

Answer

A

print(‘spain’ in europe)

Question 34

Q

34 Outcome of:

europe = {‘spain’:’madrid’, ‘france’:’paris’, ‘germany’:’berlin’, ‘norway’:’oslo’ }

print(‘madrid’ in europe)

Question 35

Q

35 Delete spain

europe = {‘spain’:’madrid’, ‘france’:’paris’,
‘norway’:’oslo’}

Answer

A

del(europe[‘spain’])

Question 36

Q

36 Update the capital of spain with madrid

europe = {‘spain’:’Barcelona’, ‘france’:’paris’,
‘norway’:’oslo’}

Answer

A

europe[‘spain’]=’madrid’

Question 37

Q

37 Get the capital of france

europe = { ‘spain’:
{ ‘capital’:’madrid’, ‘population’:46.77 },
‘france’: { ‘capital’:’paris’, ‘population’:66.03 }}

Answer

A

print(europe[‘france’][‘capital’])

Question 38

Q

38 Complete Code

dr =[False, False, True]
names = ['Spain','France','UK']
...
...
#Outcome:
 country drives_right
0 Spain False
1 France False
2 UK True

Answer

A

import pandas as pd

my_dict={‘country’:names, ‘drives_right’:dr}

print(pd.DataFrame(my_dict))

Question 39

Q

39 Use row_labels as index of the dataframe

ages = [i for i in range(3)]
df_ages = pd.DataFrame(ages, columns = ['Ages'])
names = ['Jon','Jorge','Ana']

Ages
Jon 0
Jorge 1
Ana 2

Answer

A

df_ages.index = names

print(df_ages)

Question 40

Q

40 Transform the csv to a dataframe called cars

cars.csv

Answer

A

import pandas as pd

cars = pd.read_csv(‘cars.csv’)

Question 41

Q

41 Set the first column as row labels

import pandas as pd
cars = pd.read_csv(‘cars.csv’,..(code)..)

Answer

A

cars = pd.read_csv(‘cars.csv’, index_col = 0)

Question 42

Q

42 What is a panda series?

Answer

A

A one-dimensional labeled array capable of holding data of any type (integer, string, float, python objects, etc.). Pandas Series is nothing but a column in an excel sheet.

Question 43

Q

43 Print the column country of df as Panda Series

countries = [‘Spain’,’France’,’UK’]
df =pd.DataFrame(countries, columns = [‘country’])

0 Spain
1 France
2 UK
Name: country, dtype: object

Answer

A

print(df[[‘country’]])

Question 44

Q

44 Print the column country of df as dataframe

countries = [‘Spain’,’France’,’UK’]
df =pd.DataFrame(countries, columns = [‘country’])

#Outcome:
 country
0 Spain
1 France
2 UK

Answer

A

print(df[[‘country’]])

Question 45

Q

45 Print out columns a, b from df

Answer

A

print(df[[‘a’,’b’]])

Question 46

Q

46 Print out first 2 observations (2 methods)

import pandas as pd
n = [i for i in range(3)]
df =pd.DataFrame(n, columns = [‘number’])

Answer

A

Outcome

print(df[:2])
print(df.head(2))

number
0 0
1 1

Question 47

Q

47 Print out the fourth, fifth and sixth observation

import pandas as pd
n = [i for i in range(0,20,2)]
df =pd.DataFrame(n, columns = [‘number’])

Answer

A

print(df.iloc[3:6])

Question 48

Q

48 What is loc in python?

Answer

A

A method that takes only index labels and returns row or dataframe if the index label exists in the caller data frame

Question 49

Q

49 What is a DataFrame in Python?

Answer

A

is two-dimensional size-mutable, potentially heterogeneous tabular data structure with labeled axes (rows and columns)

Question 50

Q

50 Use iloc to get jon’s row as dataframe

name age
0 nick 15
1 jon 18

Answer

A

Outcome:

df.iloc[1,]

name jon
age 18
Name: 1, dtype: object

Question 51

Q

51 Use iloc to get nick value

name age
0 nick 15
1 jon 18

#Outcome:
nick

Answer

A

print(df.iloc[0,0])

Question 52

Q

52 Use loc to get nick’s row as dataframe

name age
rank_1 nick 15
rank_2 jon 18

Answer

A

Outcome:

print(df.loc[[‘rank_1’]])

name age
rank_1 nick 15

Question 53

Q

53 Output of:

dict ={'name': ['nick','jon'],
 'age':[15,18]}
index_rows = ['rank_1','rank_2']
df = pd.DataFrame(dict)
df.index = index_rows

df.loc[‘rank_2’]

Answer

A

name jon
age 18
Name: rank_2, dtype: object

Question 54

Q

54 Use loc to get jon’s age:

name age
rank_1 nick 15
rank_2 jon 18

Answer

A

df.loc[‘rank_2’,’age’]

Question 55

Q

55 Use iloc to get age column as a dataframe

name age
rank_1 nick 15
rank_2 jon 18

Answer

A

df.iloc[:,[1]]

Question 56

Q

56 Outcome of:

print(True == False)

Question 57

Q

57 Outcome of:

print(- 1!= 75)

Question 58

Q

58 Outcome of:

print(True == 1)

Question 59

Q

59 Outcome of:

print(True == 0)

Question 60

Q

60 Outcome of:

x = -3 * 6
print(x>=-10)

Question 61

Q

61 Complete code:

import numpy as np
my_house = np.array([18.0, 20.0, 10.75])
…

#Outcome: 
[ True True False]

Answer

A

There are many possible answer

#Answer:
print(my_house>11)

Question 62

Q

62 List out and name comparison operators

Answer

A

Equal: 2 == 2 True
Not equal: 2 != 2 False
Greater than: 2 > 3 False
Less than: 2 < 3 True
Greater than or equal to: 2 >= 3 True
Less than or equal to: 2 <= 3 True

Question 63

Q

63 Outcome of:

a,b =[2,3]
a > b and a < b

Question 64

Q

64 Outcome of:

a,b =[2,3]
a > b or a < b

Answer 57

A

np. logical_and()
np. logical_or()
np. logical_not()

Answer 58

A

print(np.logical_and(my_house>18, my_house<21))

Answer 59

A

Order in which the program’s code executes. The control flow of a Python program is regulated by conditional statements, loops, and function calls.

Answer 60

A

small
small
medium
large

Answer 61

A

house=[2,4,6]
for i in house:
 if(i <4) :
 print("small")
 elif(i ==4 ) :
 print("medium")
 else :
 print("large")

Answer 62

A

filter_ = df[‘name’] == ‘nick’
selection =df[filter_]
print(selection)

Answer 63

A

df[df[‘Country’]==’Spain’]

Answer 64

A

age = df[‘Age’]
between = np.logical_and(age>10,age<15)
df[between]

Answer 65

A

x = 1
while x < 4 :
print(x)
x = x + 1

Answer 66

A

correcting...
3
correcting...
2
correcting...
1
correcting...
0

Answer 67

A

for area in areas :

print(area)

Answer 68

A

for index, area in enumerate(areas,1) :

print( str(index)+ “-“ + str(area))

Answer 69

A

for x in house :

print( str(x[0]) + “-“ + str(x[1]) )

Answer 70

A

for key, value in world.items() :

print(key + “ – “ + str(value))

Answer 71

A

1
9
25
49

Answer 72

A

for ind,col in df.iterrows():
print(ind)
print(col[1])

Answer 73

A

import pandas as pd

data = {'Name':['Tom', 'Jack'],'Country':['Spain','USA']}
df = pd.DataFrame(data, index =['rank1', 'rank2'])

Answer 74

A

for lab, row in df.iterrows() :
df.loc[lab, “name_length”] = len(row[“Name”])

Outcome:
Name Country name_length
0 Tom Spain 3.0
1 Jack USA 4.0

Answer 75

A

Seeding a pseudo-random number generator gives it its first “previous” value. Each seed value will correspond to a sequence of generated values for a given random number generator.

Answer 76

A

import numpy as np
np.random.seed(123) #any number
print(np.random.rand())

np.random.seed(123)
print(np.random.rand())

Answer 77

A

print(np.random.randint(1,7))

Answer 78

A

import numpy as np
np.random.seed(124)
step = 0
dice=np.random.randint(1,7)
if dice <= 2 :
 step = step - 1
elif dice>4 :
 step=step+1
else:
 step = step

print(‘dice:’,dice)
print(‘step:’,step)

Answer 79

A

np.random.seed(124)

random_walk=[0]
step = 0
for i in range(10):
dice=np.random.randint(1,7)
if dice <= 2 :
 step = step - 1
elif dice>4 :
 step=step+1
else:
 step = step
random_walk.append(random_walk[-1]+step)

meters_forward = random_walk[-1]
steps_walked = len(random_walk)-1 #First step is 0

print(‘steps_walked:’, steps_walked)
print(‘meters_forward:’, meters_forward)

Answer 80

A

max_value=max([i for i in range(10)])

Answer 81

A

They are used for creating new lists from other iterables.

Answer 82

A

random_walk =[0,1,2,3,2,3,4,5,6] 0=starting position

#Get the amount the meters advance in this random_walk.
#Get the number of steps given
#Use matplotlib line plot to display the walk

import matplotlib.pyplot as plt

random_walk =[0,1,1,0,-1,0,1,2,3]
steps_walked = len(random_walk) -1
meters_forward = random_walk [-1]
print('steps_walked:',steps_walked)
print('meters_forward:',meters_forward)

plt. plot(random_walk)
plt. show()

Brainscape's Knowledge GenomeTM

2 Introduction to Python (II) Flashcards

Matplotlib, dictionaries, dataframes... https://colab.research.google.com/drive/1fKMFrRbIJQE8Tpa06us0qQPnamBn957z?usp=sharing

Brainscape's Knowledge Genome^TM