Lesson 17 Exploratory analysis Flashcards

Question 1

Q

Import the follow csv from this location, ensuring to specify that data is separated by delimiter ;

r’C:\Users\User\Documents\CFG_DATA\data\winequality-red.csv’

Answer

A

df = pd.read_csv(r’C:\Users\User\Documents\CFG_DATA\data\winequality-red.csv’, sep=’;’)

Question 2

Q

check labels for each column

Answer

A

df.columns.values
or
df.keys()

Question 3

Q

What is the number of rows?
Columns?

Answer

A

df.shape[0]
df.shape[1]

Question 4

Q

Check information for each column

Answer

A

df.info()

Question 5

Q

Return the unique values from a column called quality.

Answer

A

df.quality.unique()

Question 6

Q

Calculate the frequency of each unique value in the “quality” column of the DataFrame df (return a Series with the unique values as the index and their respective counts as the values.)

Answer

A

df.quality.value_counts()

Question 7

Q

Check for missing values using a heatmap

Answer

A

cbar is the colorbar

sns.heatmap(df.isnull(),cbar=False,yticklabels=False)

Question 8

Q

calculate attributes correlation

Answer

A

df.corr()

Question 9

Q

Build correlation heatmap

Answer

A

plt.figure(figsize=(6,4))
sns.heatmap(df.corr(),annot=False)

Question 10

Q

Increase the size of the heatmap.

Answer

A

plt.figure(figsize=(16, 6))

Question 11

Q

k = 12

Answer

A

specify the number of variables for the heatmap

Question 12

Q

Question I need to figure out why it is necessary to create this new heatmap of the correlation matrix

Answer

A

Quality correlation matrix

Increase the size of the heatmap.
plt.figure(figsize=(16, 6))

k = 12 # number of variables for heatmap
cols = df.corr().nlargest(k, ‘quality’)[‘quality’].index
cm = df[cols].corr()

sns.heatmap(cm, annot=True)

Question 13

Q

Create a boxplot

Answer

A

plt.boxplot(df_happy_gdp[‘Happiness_score’])

Set the title and labels
plt.title(‘Box Plot of Happiness Score’)
plt.xlabel(‘Happiness Score’)
plt.ylabel(‘Value’)
plt.show()

Lesson 17 Exploratory analysis Flashcards

(13 cards)