Terms_and_Definitions Flashcards

Question 1

Q

A/B Test

Answer

A

A method of comparing two versions of a webpage, feature, or app against each other to determine which performs better.

Question 2

Q

Null Hypothesis (H₀)

Answer

A

Assumes there is no significant difference between the control and test groups.

Question 3

Q

Alternative Hypothesis (H₁)

Answer

A

Assumes there is a significant difference between the control and test groups.

Question 4

Q

P-Value

Answer

A

The probability of observing results at least as extreme as those measured, assuming the null hypothesis is true.

Question 5

Q

Significance Level (α)

Answer

A

The threshold for rejecting the null hypothesis (commonly set at 0.05).

Question 6

Q

Confidence Interval (CI)

Answer

A

A range of values that is likely to contain the true effect size or metric with a given level of confidence (e.g., 95%).

Question 7

Q

Control Group

Answer

A

The group that does not receive the treatment or variant being tested.

Question 8

Q

Test Group

Answer

A

The group that receives the treatment or variant being tested.

Question 9

Q

Randomization

Answer

A

Assigning participants to groups in a way that each participant has an equal chance of being in any group.

Question 10

Q

Power Analysis

Answer

A

A calculation to determine the minimum sample size required to detect a given effect size with sufficient power.

Question 11

Q

Effect Size

Answer

A

The magnitude of the difference between groups (e.g., a 5% increase in conversion rate).

Question 12

Q

Type I Error

Answer

A

Incorrectly rejecting the null hypothesis (false positive).

Question 13

Q

Type II Error

Answer

A

Failing to reject the null hypothesis when it is false (false negative).

Question 14

Q

Bonferroni Correction

Answer

A

A method to adjust significance levels when multiple comparisons are being made.

Question 15

Q

Simpson’s Paradox

Answer

A

A trend appears in different groups of data but disappears or reverses when the groups are combined.

Question 16

Q

Descriptive Statistics

Answer

A

Summarizing and describing the features of a dataset (e.g., mean, median, mode).

Question 17

Q

Inferential Statistics

Answer

A

Using a sample to make generalizations about a population (e.g., hypothesis testing, confidence intervals).

Question 18

Q

Mean

Answer

A

The average value of a dataset.

Question 19

Q

Median

Answer

A

The middle value in a dataset when ordered.

Question 20

Q

Mode

Answer

A

The most frequently occurring value in a dataset.

Question 21

Q

Variance

Answer

A

A measure of how much values in a dataset vary from the mean.

Question 22

Q

Standard Deviation

Answer

A

The square root of the variance, representing data dispersion.

Question 23

Q

Z-Test

Answer

A

A hypothesis test for comparing means when the population variance is known.

Question 24

Q

T-Test

Answer

A

A hypothesis test for comparing means when the population variance is unknown.

Question 25

Q

ANOVA (Analysis of Variance)

Answer

A

A test to compare the means of three or more groups.

Question 26

Q

Chi-Square Test

Answer

A

A test for relationships between categorical variables.

Question 27

Q

Linear Regression

Answer

A

A method to model the relationship between a dependent variable and one or more independent variables.

Question 28

Q

Logistic Regression

Answer

A

A regression model used when the dependent variable is categorical.

Question 29

Q

Bayesian Statistics

Answer

A

An approach to statistics that incorporates prior beliefs or evidence.

Question 30

Q

Frequentist Statistics

Answer

A

A traditional approach to statistics based on frequency or proportion.

Question 31

Q

SELECT

Answer

A

A SQL command used to retrieve data from a database.

Question 32

Q

FROM

Answer

A

Specifies the table to retrieve data from.

Question 33

Q

WHERE

Answer

A

Filters rows based on conditions.

Question 34

Q

GROUP BY

Answer

A

Groups rows sharing a property for aggregation.

Question 35

Q

HAVING

Answer

A

Filters grouped rows based on aggregated values.

Question 36

Q

JOIN

Answer

A

Combines rows from two or more tables based on a related column.

Question 37

Q

INNER JOIN

Answer

A

Returns rows with matching values in both tables.

Question 38

Q

LEFT JOIN

Answer

A

Returns all rows from the left table and matching rows from the right table.

Question 39

Q

RIGHT JOIN

Answer

A

Returns all rows from the right table and matching rows from the left table.

Question 40

Q

OUTER JOIN

Answer

A

Returns all rows from both tables, with nulls where no match exists.

Question 41

Q

ORDER BY

Answer

A

Sorts the result set by specified columns.

Question 42

Q

LIMIT

Answer

A

Restricts the number of rows returned in a query.

Question 43

Q

Subquery

Answer

A

A query nested within another query.

Question 44

Q

CTE (Common Table Expression)

Answer

A

A temporary result set used within a SQL query.

Question 45

Q

Pandas

Answer

A

A library for data manipulation and analysis.

Question 46

Q

NumPy

Answer

A

A library for numerical computations.

Question 47

Q

Matplotlib

Answer

A

A library for creating static visualizations.

Question 48

Q

Seaborn

Answer

A

A library for statistical data visualization.

Question 49

Q

Scipy.stats

Answer

A

A library for statistical functions and tests.

Question 50

Q

Statsmodels

Answer

A

A Python module for statistical modeling and hypothesis testing.

Question 51

Q

A/B Test Simulation

Answer

A

A process to mimic test results using random sampling or bootstrapping.

Question 52

Q

Data Visualization

Answer

A

Representing data graphically to communicate insights.

Question 53

Q

Dashboard

Answer

A

A visual interface that displays key performance metrics and data.

Question 54

Q

Power BI

Answer

A

A business analytics tool for creating dashboards and visualizations.

Question 55

Q

Tableau

Answer

A

A software tool for data visualization and business intelligence.

Question 56

Q

Funnel Analysis

Answer

A

A method to track user journey and identify drop-off points.

Question 57

Q

Cohort Analysis

Answer

A

Analyzing behavior by grouping users based on shared characteristics.

Question 58

Q

Customer Journey

Answer

A

The path a customer takes from initial interaction to conversion.

Question 59

Q

Clickstream Data

Answer

A

Data collected about user interactions on a website or app.

Question 60

Q

Hadoop

Answer

A

A framework for distributed storage and processing of large datasets.

Question 61

Q

Telemetry

Answer

A

The collection of data about the usage of a digital product.

Question 62

Q

Data Pipeline

Answer

A

A series of steps to process and analyze data from source to destination.

Question 63

Q

Hypothesis Validation

Answer

A

The process of testing assumptions with data.

Question 64

Q

Exploratory Data Analysis (EDA)

Answer

A

Initial analysis to summarize data characteristics.

Answer 65

A

A process for collecting, transforming, and storing data.

Brainscape's Knowledge GenomeTM

Terms_and_Definitions Flashcards

Brainscape's Knowledge Genome^TM