Data Analyst Interview Prep Flashcards

Question 1

Q

How would you find duplicate records in a table?

Answer

A

Use a SQL query that groups by the key fields and counts the occurrences. For example:
SELECT column_name, COUNT() FROM table_name GROUP BY column_name HAVING COUNT() > 1;

Question 2

Q

Write a query to join two tables and filter results based on a specific condition.

Imagine you are tasked with combining the customer lists from two different branches of an alarm system company. How would you write a query to merge these lists and filter the results to only include customers with a premium alarm package?

Answer

A

SELECT a.column1, b.column2 FROM table_a a INNER JOIN table_b b ON a.id = b.id WHERE a.condition = ‘specific_condition’;

To merge the customer lists from two branches and filter for those with a premium alarm package, you could use a SQL query like this:

SELECT b1.customer_name, b2.branch_location
FROM branch1 b1
INNER JOIN branch2 b2 ON b1.customer_id = b2.customer_id
WHERE b1.package_type = ‘Premium’;

Question 3

Q

Explain the difference between INNER JOIN, LEFT JOIN, RIGHT JOIN, and how you would use these in SQL queries.

Answer

A

INNER JOIN returns only matching rows; LEFT JOIN returns all from left table and matched from right; RIGHT JOIN returns all from right and matched from left.

Question 4

Q

How do you optimize a slow-running query?

Answer

A

Analyze the execution plan, create indexes, avoid SELECT *, and limit dataset with WHERE conditions.

Question 5

Q

What steps would you take to clean a dataset with missing values?

Answer

A

Identify missing values, decide on a strategy (imputation or removal), and document the process.

Question 6

Q

How do you handle outliers in your data?

Answer

A

Use statistical methods to identify outliers and decide to remove, transform, or investigate.

Question 7

Q

Explain the process of normalizing and standardizing data.

Answer

A

Normalization rescales values to a range, while standardization rescales data to have mean 0 and std dev 1.

Question 8

Q

How do you use pivot tables for data analysis?

Answer

A

Pivot tables summarize data dynamically, allowing users to rearrange fields for quick comparisons.

Question 9

Q

How do you create and use calculated fields to derive new insights from existing data?

Answer

A

Create calculated fields in Microstrategy to derive metrics using existing fields.

Question 10

Q

Describe how you would create interactive dashboards in Power BI/Tableau.

Answer

A

Connect the data source, create visualizations, and use filters and slicers for interaction.

Question 11

Q

Explain the difference between correlation and causation.

Answer

A

Correlation indicates a relationship; causation implies one variable influences another.

Question 12

Q

How would you test a hypothesis using data?

Answer

A

Formulate null and alternative hypotheses, select a statistical test, and analyze the data.

Question 13

Q

What is the Central Limit Theorem, and why is it important?

Answer

A

The Central Limit Theorem states sample means approach a normal distribution as sample size increases, crucial for inferential statistics.

Question 14

Q

How would you approach analyzing sales data to identify trends and patterns?

Answer

A

Clean and prepare data, use visualizations to spot trends, and apply time series analysis for forecasting.

Question 15

Q

Imagine a scenario where a company’s profits are declining. What data would you look at, and how would you analyze it to find the root cause?

Answer

A

Analyze sales, customer feedback, and market trends; use regression analysis to find significant factors.

Question 16

Q

What is the difference between a Security filter and a View filter?

Answer

Study These Flashcards

A

A Security filter restricts data access; a View filter restricts data displayed without affecting security.

Question 17

Q

How can we migrate MicroStrategy Reports and Dashboards from on-premise to cloud (AWS, GCP)?

Answer

Study These Flashcards

A

Use migration tools to export and import reports into the cloud environment, ensuring compatibility.

Question 18

Q

Define Attribute role with a use case.

Answer

Study These Flashcards

A

Attribute roles define how attributes relate to facts (e.g., a customer may have roles in sales and marketing).

Question 19

Q

How do you change the data type in MicroStrategy?

Answer

Study These Flashcards

A

In MicroStrategy Desktop, right-click on the attribute or metric and change the data type in properties.

Question 20

Q

What are the types of Transformation, and where can we use them? (Provide a case scenario.)

Answer

Study These Flashcards

A

Types include Aggregation, Derivation, and Filtering. Use case: Transforming sales data to show the top 10 products.

Question 21

Q

How is one fact table shown in SQL Query in MicroStrategy?

Answer

Study These Flashcards

A

A fact table appears as a joined table in SQL queries, aggregating data over dimensions.