Pandas Strings Flashcards

Question 1

Q

names.str.capitalize()

Answer

A

a single method that will capitalize all the entries, while skipping over any missing values

Question 2

Q

Using tab completion on this str attribute

Answer

A

will list all the vectorized string methods available to Pandas.

Question 3

Q

len() lower() translate() islower()

ljust() upper() startswith() isupper()

Answer

A

Methods similar to Python string methods

Question 4

Q

rjust() find() endswith() isnumeric()

center() rfind() isalnum() isdecimal()

Answer

A

Methods similar to Python string methods

Question 5

Q

zfill() index() isalpha() split()

strip() rindex() isdigit() rsplit()

Answer

A

Methods similar to Python string methods

Question 6

Q

rstrip() capitalize() isspace() partition()

lstrip() swapcase() istitle() rpartition()

Answer

A

Methods similar to Python string methods

Question 7

Q

match()

Answer

A

Call re.match() on each element, returning a boolean.

Question 8

Q

extract()

Answer

A

Call re.match() on each element, returning matched groups as strings.

Question 9

Q

findall()

Answer

A

Call re.findall() on each element

Question 10

Q

replace()

Answer

A

Replace occurrences of pattern with some other string

Question 11

Q

contains()

Answer

A

Call re.search() on each element, returning a boolean

Question 12

Q

count()

Answer

A

Count occurrences of pattern

Question 13

Q

split()

Answer

A

Count occurrences of pattern

Question 14

Q

rsplit()

Answer

A

Equivalent to str.rsplit(), but accepts regexps

Question 15

Q

get()

Answer

A

Index each element

Question 16

Q

slice()

Answer

A

Slice each element

Question 17

Q

slice_replace()

Answer

A

Replace slice in each element with passed value

Question 18

Q

cat()

Answer

A

Concatenate strings

Question 19

Q

repeat()

Answer

A

Repeat values

Question 20

Q

normalize()

Answer

A

Return Unicode form of string

Question 21

Q

pad()

Answer

A

Add whitespace to left, right, or both sides of strings

Question 22

Q

wrap()

Answer

A

Split long strings into lines with length less than a given width

Question 23

Q

join()

Answer

A

Join strings in each element of the Series with passed separator

Question 24

Q

get_dummies()

Answer

A

extract dummy variables as a dataframe

Question 25

Q

df.str.slice(0, 3) is equivalent to

Answer

A

df.str[0:3]

Question 26

Q

monte.str.split().str.get(-1)

Answer

A

to extract the last name of each entry, we can combine split() and get():

Question 27

Q

recipes.ingredients.str.len().describe()

Answer

A

sample info getting

Question 28

Q

recipes.name[np.argmax(recipes.ingredients.str.len())]

Answer

A

sample info getting

which recipe has the longest ingredient list

Question 29

Q

recipes.description.str.contains(‘[Bb]reakfast’).sum()

Answer

A

how many of the recipes are for breakfast food

Question 30

Q

recipes.ingredients.str.contains(‘[Cc]innamon’).sum()

Answer

A

cinnamon as an ingredient

Question 31

Q

spice_list = [‘salt’, ‘pepper’, ‘oregano’, ‘sage’, ‘parsley’,
‘rosemary’, ‘tarragon’, ‘thyme’, ‘paprika’, ‘cumin’]

import re
spice_df = pd.DataFrame(dict((spice, recipes.ingredients.str.contains(spice, re.IGNORECASE))
for spice in spice_list))
spice_df.head()

Answer

A

search to see whether they are in each recipe’s ingredient list

return boolean data frame