Pandas Index & Slice Flashcards

Question 1

Q

data[‘b’]

returns ‘b’

Answer

A

Like a dictionary, the Series object provides a mapping from a collection of keys to a collection of values:

Question 2

Q

‘a’ in data
data.keys()
list(data.items())

Answer

A

We can also use dictionary-like Python expressions and methods to examine the keys/indices and values:

Question 3

Q

data[‘e’] = 1.25

Answer

A

Series objects can even be modified with a dictionary-like syntax. Just as you can extend a dictionary by assigning to a new key, you can extend a Series by assigning to a new index value:

Question 4

Q

# slicing by explicit index
data['a':'c']

# slicing by implicit integer index
data[0:2]

# masking
data[(data > 0.3) &amp; (data < 0.8)]

# fancy indexing
data[['a', 'e']]

Answer

A

A Series builds on this dictionary-like interface and provides array-style item selection via the same basic mechanisms as NumPy arrays – that is, slices, masking, and fancy indexing. Examples of these are as follows:

Question 5

Q

data. loc[1]

data. loc[1:3]

Answer

A

Pandas provides some special indexer attributes that explicitly expose certain indexing schemes. These are not functional methods, but attributes that expose a particular slicing interface to the data in the Series.

First, the loc attribute allows indexing and slicing that always references the explicit index:

Question 6

Q

data. iloc[1]

data. iloc[1:3]

Answer

A

The iloc attribute allows indexing and slicing that always references the implicit Python-style index:

Question 7

Q

data[‘area’]

data.area

Answer

A

The individual Series that make up the columns of the DataFrame can be accessed via dictionary-style indexing of the column name:

Question 8

Q

data[‘density’] = data[‘pop’] / data[‘area’]

data

Answer

A

Like with the Series objects discussed earlier, this dictionary-style syntax can also be used to modify the object, in this case adding a new column:

Question 9

Q

data.T

Answer

A

many familiar array-like observations can be done on the DataFrame itself. For example, we can transpose the full DataFrame to swap rows and columns:

Question 10

Q

data.iloc[:3, :2]

Answer

A

Here Pandas again uses the loc, iloc, and ix indexers mentioned earlier. Using the iloc indexer, we can index the underlying array as if it is a simple NumPy array (using the implicit Python-style index), but the DataFrame index and column labels are maintained in the result:

Question 11

Q

data.loc[:’Illinois’, :’pop’]

Answer

A

Similarly, using the loc indexer we can index the underlying data in an array-like style but using the explicit index and column names:

Question 12

Q

data.loc[data.density > 100, [‘pop’, ‘density’]]

Answer

A

Any of the familiar NumPy-style data access patterns can be used within these indexers. For example, in the loc indexer we can combine masking and fancy indexing as in the following:

Question 13

Q

data[‘Florida’:’Illinois’]
data[1:3]
data[data.density > 100]

Answer

A

First, while indexing refers to columns, slicing refers to rows:

direct masking operations are also interpreted row-wise rather than column-wise: