subset observations (Rows) and subset variables (columns) Flashcards by pas alsan

Extract rows that meet logical criteria.

df[df.Length > 7]

How well did you know this?

Not at all

Perfectly

Remove duplicate rows (only considers columns).

df.drop_duplicates()

How well did you know this?

Not at all

Perfectly

Select first n rows.

df.head(n)

How well did you know this?

Not at all

Perfectly

Select last n rows.

df.tail(n)

How well did you know this?

Not at all

Perfectly

Randomly select fraction of rows.

df.sample(frac=0.5)

How well did you know this?

Not at all

Perfectly

Randomly select n rows.

df.sample(n=10)

How well did you know this?

Not at all

Perfectly

Select rows by position.

df.iloc[10:20]

How well did you know this?

Not at all

Perfectly

Get the rows of a DataFrame sorted by the n largest values of columns.

DataFrame.nlargest(n, columns, keep=’first’)

How well did you know this?

Not at all

Perfectly

Get the rows of a DataFrame sorted by the n smallest values of columns.

DataFrame.nlargest(n, columns, keep=’first’)

How well did you know this?

Not at all

Perfectly

Select multiple columns with specific names.

df[[‘width’,’length’,’species’]]

How well did you know this?

Not at all

Perfectly

Select single column with specific name.

df[‘width’] or df.width

How well did you know this?

Not at all

Perfectly

Select columns whose name matches regular expression regex.

df.filter(regex=’regex’)

How well did you know this?

Not at all

Perfectly

Select all columns between x2 and x4 (inclusive).

df.loc[:,’x2’:’x4’]

How well did you know this?

Not at all

Perfectly

Select columns in positions 1, 2 and 5 (first column is 0).

df.iloc[:,[1,2,5]]

How well did you know this?

Not at all

Perfectly

Select rows meeting logical condition, and only the specific columns .

df.loc[df[‘a’] > 10, [‘a’,’c’]]

How well did you know this?

Not at all

Perfectly

subset observations (Rows) and subset variables (columns) Flashcards

(15 cards)