summarice data Flashcards

1
Q

Count number of rows with each unique value of variable

A

df[‘w’].value_counts()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

number of rows in DataFrame.

A

len(df)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

number of distinct values in a column.

A

df[‘w’].nunique()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Basic descriptive statistics for each column (or GroupBy)

A

df.describe()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Return the sum of the values for the requested axis

A

DataFrame.sum(axis=None, skipna=None, level=None, numeric_only=None, min_count=0, **kwargs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

DataFrame.count(axis=0, level=None, numeric_only=False)

A

Count non-NA cells for each column or row.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Return the median of the values for the requested axis

A

DataFrame.median(axis=None, skipna=None, level=None, numeric_only=None, **kwargs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Return values at the given quantile over requested axis, a la numpy.percentile.

A

DataFrame.quantile(q=0.5, axis=0, numeric_only=True, interpolation=’linear’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

This method returns the minimum of the values in the object.

A

DataFrame.min(axis=None, skipna=None, level=None, numeric_only=None, **kwargs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

This method returns the maximum of the values in the object.

A

DataFrame.max(axis=None, skipna=None, level=None, numeric_only=None, **kwargs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Return the mean of the values for the requested axis

A

DataFrame.mean(axis=None, skipna=None, level=None, numeric_only=None, **kwargs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Return unbiased variance over requested axis.

A

DataFrame.var(axis=None, skipna=None, level=None, ddof=1, numeric_only=None, **kwargs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Return sample standard deviation over requested axis.

A

DataFrame.std(axis=None, skipna=None, level=None, ddof=1, numeric_only=None, **kwargs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly