Exam preparations Flashcards

Question

What is Zero conditional mean?

Answer 1

Exogeneity. Ensures that the independent variables are uncorrelated with the error term u. The covariance between independent variables and the residuals is zero.

Answer 2

Homoskedasticity

Answer 3

Omitted variables are important factors that influence the dependent variable Y but are not included as independent variables X in the model. This creates omitted variable bias.

Answer 4

1. For panel or longitudinal data, fixed effects can control for omitted variables that are constant within individuals or groups. 2. Randomized Control Trials

Answer 5

Overspecification occurs in regression analysis when the model includes too many independent variables, some of which are irrelevant or redundant. These extra variables do not improve the model’s ability to explain the dependent variable Y and can even harm its performance. Consequences: Multicollinearity

Answer 6

No. Vi kan ju ha en bra modell trots tex negative skewness.

Answer 7

A Q-Q plot (Quantile-Quantile plot) is used to compare the distribution of a dataset to a theoretical distribution (e.g., normal distribution) by plotting their quantiles. Assess if residuals from a regression model are normally distributed.

Answer 8

It's used to show the relationship between two variables by plotting their values as coordinate points. Ex: Explore if study hours X and test scores Y have a linear relationship.

Answer 9

Sampling error is the difference between a sample statistic (e.g., sample mean, sample proportion) and the corresponding population parameter (e.g., population mean). For example: You survey 1,000 people from a city to estimate the average income. The sample mean might differ from the actual population mean due to sampling error. SOLVE IT BY INCREASING THE SAMPLE SIZE

Answer 10

Ensure the sample is chosen randomly so every member of the population has an equal chance of being selected. This reduces the likelihood of systematic bias.

Answer 11

Divide the population into subgroups (strata) based on characteristics like age, income, or region, and take random samples from each. Ensure that each subgroup is proportionally represented in the sample.

Answer 12

Divide the population into clusters (e.g., geographic areas or naturally occurring groups) and randomly select clusters for sampling. Makes data collection more efficient by sampling whole clusters instead of individuals.

Answer 13

A quasi-experiment lacks the random assignment of participants to treatment and control groups, which RCT has.

Answer 14

Compares outcomes between a treatment group and a control group before and after an intervention.

Answer 15

Panel data (also known as longitudinal data) is a type of dataset that contains observations of multiple entities (such as individuals, firms, countries, etc.) over multiple time periods. It combines elements of cross-sectional data (data collected at one point in time) and time-series data (data collected over time for a single entity).

Answer 16

Fixed effects control for factors unique to each entity that do not change over time (time invariant, culture) Its cross-sectional but with fixed time. Time-invariant factors are automatically controlled in FE for because these models focus only on **within-entity** variation over time. **Fixed Effects (FE)** is a method used in panel data analysis to control for unobserved factors that don’t change over time but could influence the dependent variable. **Key idea:** It focuses on changes **within** each individual (like a person, firm, or country) over time, removing the effect of time-invariant characteristics. Simple Steps: 1. **Remove what doesn’t change:** Fixed effects filter out any constant characteristics of individuals (e.g., gender, long-term preferences). 2. **Focus on within variation:** The analysis only uses differences **inside each individual unit** over time to estimate relationships. 3. **Control for bias:** This avoids bias from factors that are constant over time but differ between individuals. Example: Imagine studying how work hours affect productivity across workers. Workers might differ in skills (constant over time). Fixed effects handle this by only looking at **how changes in work hours for the same worker** impact their productivity, ignoring cross-worker differences in skills. When to use: - When you suspect unobserved variables (e.g., personality, location) might bias your results, but these factors don’t vary over time. ------------------------------------------- 1. Students and Test Scores Question: How does study time affect test scores? Problem: Some students are naturally smarter than others, which might bias the results. Fixed Effects Solution: Compare each student’s test scores across different tests they take. This removes the effect of natural intelligence (which doesn’t change over time). Focuses only on how changes in study time for the same student affect their scores. 2. Cities and Pollution Question: Does traffic volume increase air pollution in a city? Problem: Some cities are naturally more polluted than others due to geography or industry. Fixed Effects Solution: Look at the same city over time and how changes in traffic volume affect pollution. This removes constant factors like geography. Focuses only on how variations in traffic affect pollution within each city.

Answer 17

Time-invariant variables can be estimated because random effects assume that these factors are **uncorrelated** with the independent variables.

Answer 18

Not exactly. A one-way panel regression model can be a fixed effects model, but it can also be a random effects model depending on how the unobserved effects are treated.

Answer 19

A one-way panel regression model accounts for unobserved effects that vary only across entities or only across time, but not both.

Answer 20

A two-way panel regression model accounts for unobserved effects that vary both across entities and over time. It includes both entity fixed effects and time fixed effects, making it more comprehensive.

Answer 21

Usually no, because its not needed.

Answer 22

An event study is considered a quasi-experimental design because it has no randomization. It relies on naturally occuring events rather than a random assignment of treatment.

Answer 23

Causal inference is the process of determining whether and how a change in one variable (the cause) directly influences another variable (the effect). It aims to go beyond observing correlations or associations by establishing a cause-and-effect relationship between variables. Causal inference revolves around estimating the treatment effect by comparing observed outcomes to counterfactuals, often using control groups or statistical methods to approximate the unobservable counterfactual.

Answer 24

A counterfactual is the hypothetical scenario representing what would have happened in the absence of the treatment.

Answer 25

In a staggered rollout design, the treatment or intervention is introduced to different entities at different points in time. Ex. Different regions of the country implement the smoking ban in public places at different times: Region A: January 2023 Region B: June 2023 Region C: December 2023

Answer 26

In a non-staggered rollout design, the treatment or intervention is introduced to all treated entities at the same point in time. Ex. A country implements a nationwide smoking ban in public places on January 1, 2023, applying to all regions simultaneously.

Answer 27

Event time refers to a time scale centered around a specific event or intervention (e.g., a policy implementation, a treatment, or a natural disaster). t = 0 Day 0: The day of the merger (event). Time is measured relative to this event (e.g., Day -5, Day +1) (typ som att vi räknar 2025 för att något hände för 2025)

Answer 28

Calendar time refers to the actual date or time period when an observation occurs, using a universal, chronological scale (e.g., years, months, days). Time is measured in absolute terms (e.g., January 2022, Q3 2023). Alltså vanlig tid vi snackar om.

Answer 29

time periods before the event occurs. Pre-Event Window: Directly examines what happens before the event. do not confuse it with estimation window: Estimation Window: Provides a benchmark for "normal" returns, used to calculate abnormal returns.

Answer 30

Same as event time. refers to the time period during which the event occurs, t = 0

Answer 31

The post-event window includes the time periods after the event occurs

Answer 32

What would have been if we didn't have the treatment? In the context of CAPM, the counterfactual is indeed what the return on an asset would have been if CAPM perfectly explained returns. Only accounts for market risk (systematic risk), not firmspecific risk.

Answer 33

Two-Way Fixed Effects accounts for both entity-specific and time-specific unobserved factors. It isolates the effect of interest by controlling for unobserved heterogeneity

Answer 34

The calendar-time portfolio method analyzes portfolio returns over regular calendar periods (e.g., monthly or yearly) to evaluate long-term performance or strategies. For example, you might group stocks into value and growth portfolios based on their book-to-market ratios and track their average returns month by month to see if value consistently outperforms growth. The key difference is that the calendar-time method looks at overall trends across time, while the event study method zeroes in on the effects of a particular event.

Answer 35

Portfolio sorts refer to a method used in finance to group assets (e.g., stocks) into portfolios based on certain characteristics or metrics, such as size, value, momentum, or other financial variables. Once asset are sorted, they are grouped into portfolios and divided into quantiles or other bins.

Answer 36

Portfolio sorts are simple and intuitive, grouping assets into portfolios based on a characteristic (e.g., size or value) and comparing returns. They are robust to outliers, non-parametric (no linearity assumptions). Panel regressions allow for controlling multiple variables and provide more precise estimates but can be sensitive to outliers and require assumptions about linearity.

Answer 37

Sorting bias: sorting stocks into portfolios can be biased. Fix: Larger data set. Also: Portfolio sorts difficult to apply with more than two factors. Fix: Allow TWFE?

Answer 38

Assets are sorted into portfolios based on the primary characteristic while controlling for another characteristic. Sorting order is important. Först dela upp i stor, mellan, liten.

Answer 39

Each parameter is independently sorted. Assets are sorted into portfolios based on a single characteristic without considering any other variables.

Answer 40

Independent

Answer 41

To motivate and focus the research.

Answer 42

Motivate theory development

Answer 43

Is relevant, researchable, and represents a gap in knowledge.

Answer 44

Data is collected at a specific point in time from a cross-section of respondents, so concluding causal inference is weak.

Answer 45

It is the plan for how a research project will be conducted.

Answer 46

Facilitates the formation of a research question that is relevant and researchable.

Answer 47

Tänk natural experiment: CAUSALITY An event occurs to a specific group of people outside the control of the researchers, but in such a way as to resemble random assignment. Data is collected from before and after the event, and causality is established.

Answer 48

You may answer a research question by posing hypotheses. The hypotheses must be empirically testable.

Answer 49

Operationalization specifies how a construct will be measured.

Answer 50

The researcher wants to get deep knowledge about a certain phenomenon.

Answer 51

The researcher, through a literature review, finds that there is a lack of theory explaining a certain phenomenon. Then, decides to investigate.

Answer 52

Should in detail describe how the research has been conducted, such as who did what, how, and when.

Answer 53

Start by looking at existing questionnaire on similar topics or theories.

Answer 54

You only remember certain things better than other which may bias your answers.

Answer 55

conducting a survey on companies and sustainability, and only the companies interested in that area joins

Answer 56

Probability sampling

Answer 57

not all members of the population have a known or equal chance of being included in the sample. aka no random selection.

Answer 58

sampling list is a practical subset of that frame used for selecting the actual sample in a study

Answer 59

Validity is how well a measure reflects what it intends to measure, and reliability is about the consistency of measurements.

Answer 60

It is the process of organizing and labeling data.

Answer 61

You move from data-text (empirics) to higher analytical levels by aggregating and condensing. (Condense = To reduce large amounts of data or information into a more concise and summarized form while retaining the most essential parts Aggregating = To combine or group multiple pieces of information, data points, or observations into larger units or categories based on shared characteristics.)

Answer 62

Inter-rater reliability

Answer 63

Unstandardized beta coefficients for the statistically significant independent variables.

Answer 64

Residual variance that is not explained by the regression coefficients.

Answer 65

The beta coefficient for the X variable indicates the slope of the regression line.

Answer 66

It is the variance that is not explained by the regression coefficients

Answer 67

It adjusts downward for each additional independent variable.

Answer 68

It is the explained variance in the regression equation.

Answer 69

The relative effect size of each independent variable on the dependent variable.

Answer 70

A too long model reduces the precision of the beta coefficients, whereas a too short model causes a systematic bias in the parameter estimates.

Answer 71

In simple regression (one X variable), the standardized beta is the slope of the regression line.

Answer 72

What should happen?

Answer 73

What is happening?

Answer 74

What might be happening?

Answer 75

Why or how is it happening?

Answer 76

Research question/purpose: To Design/Control Researcher: Attached inside

Answer 77

Research question/purpose: To describe/explain. Co-Produce Knowledge with Collaborators. Researcher: Attached Inside

Answer 78

Research question/purpose: To Design/Control. Normative questions. Researcher: Outside this form of research goes beyond describing or explaining a social problem, but also seeks to obtain evidence-based knowledge or relative success. Evaluation researchers typically take a distanced and outside perspective of the designs or policies being evaluated. Inquiry from the outside is necessary.

Answer 79

Research question/purpose: To describe and explain a social phenomenon. Basic Science with Stakeholder Advice. Researcher: The researcher is a detached outsider of the social system being examined. The researcher directs and controls all research activities.

Answer 80

Through interaction and reflection.

Answer 81

Recall bias

Answer 82

Is used to map change, and to understand the mechanisms whereby change happens.

Answer 83

All three answers.

Answer 84

Correlational

Answer 85

A variance approach is characterized by **analysis of variables seeking answers** to what causes what, whereas a process approach is characterized by studying interrelational between events seeking answers to how things develop and change over time.

Answer 86

Delimited in space and time, in-depth, and context dependent.

Answer 87

Situating, grounding, diagnosing the problem and selecting a question.

Answer 88

Taking field notes on who said what and when.

Answer 89

A specific phenomenon present in the case. (An instrumental study is used to explore or understand broader phenomena by focusing on a specific case, event, or example. The primary goal is not the case itself but the insights it provides about a larger issue or theory.)

Answer 90

The difficulty in asserting ourselves in relation to other researchers and experts, or determining what we can claim is new and relevant with our research.

Answer 91

How knowledge is related to action, and how theory is related to practice.

Answer 92

Descriptive

Answer 93

Is analytical, researchable, and permits more than one answer.

Answer 94

All three answers

Answer 95

Trustworthiness, credibility and transferability.

Answer 96

Undertaken to describe, explain or predict a social phenomenon.

Answer 97

All three answers.

Answer 98

May motivate theory development.

Answer 99

Establishing general laws and empirical generalization.

Answer 100

When it is testable using empirical data.

Answer 101

Descriptive and exploratory.

Answer 102

Problem of chaos, representation and authority.

Answer 103

Both qualitative and quantitative data

Answer 104

A clear focus of a bounded situation or system, and intense examination of the setting

Answer 105

In order for others to be able to conduct a similar study.

Answer 106

An analysis focusing on the terms and labels that are found in the empirical data. NOT quotes and examples

Answer 107

Have it connected to the research question --> good qualitative research design.

Answer 108

1. Homoskedasticity 2. Normality 3. No Autocorrelation 4. Linearity

Answer 109

It creates a smiley because of heteroskedasticity

Answer 110

states that when you take sufficiently large random samples from a population with any distribution (e.g., uniform, skewed, or normal), the sampling distribution of the sample mean will approximate a normal distribution

Answer 111

1. Larger sample size 2. Deflate the variables, normalize the x variable so its non dependent of the size 3. Take the natural logarithm

Answer 112

1. Define the event and establish the event window. Should be short. usually 3 day = -1, 0, +1 cause the info to the stock market can be early or late 2. Define the estimation window. 120 days - his example 3. Define the post-estimation window Typically not very interested in this as opposed to regression 4. Establish the firm selection criteria. Make sure that the shares are frequently traded during the event window. Frequent trading ensures accurate pricing: If shares are not frequently traded, the observed price may be outdated and not reflect the true market value during the event Example: A stock that trades infrequently might show no price change during the event, not because the event had no impact, but because there were no trades to update the price. 5. Estimate the model parameters using data in the estimation window. alpha and beta hat 6. Measure the abnormal returns for the shares in the sample. CAR 7. Conduct tests. Define null and alternative hypotheses. Measure the abnormal returns. Determine levels of significance for tests. 8. Present results and diagnostics 9. Interpret results and draw inferences and conclusions

Answer 113

controlling for entity-specific and time-specific fixed effects. Focused on both dimensions: Units and time. Event study needs both unit and time. More in detail how to set it up: To set up a **two-way fixed effects (TWFE) study**, you aim to account for both **entity-specific** (e.g., individuals, firms, or countries) and **time-specific** effects (e.g., year, month). Here's a step-by-step guide: 1. Define Your Research Question Identify what you want to study, such as the effect of a policy or treatment on an outcome. 2. Collect panel data. Your dataset should have repeated observations for each entity over time (e.g., regions across multiple years). Dependent variable (Y): The outcome of interest (e.g., employment rates). Independent variable (X): The main variable of interest (e.g., minimum wage level). + Other factors that might affect the outcome (e.g., GDP, population). + Entity ID (e.g., region) and time period (e.g., year). 3. Specify the model 4. Include Fixed Effects 5. Ensure Robustness Use clustered standard errors to address heteroskedasticity and serial correlation, typically clustered at the entity level. Test for collinearity issues (e.g., perfectly collinear variables).

Answer 114

A scientific revolution refers to a fundamental transformation in the way science is conducted, leading to new paradigms or frameworks for understanding the world. Thomas Kuhn, a key figure in the philosophy of science, described it as a shift from "normal science"—routine work within an established framework—to revolutionary science, where anomalies and crises lead to a radical change in the underlying paradigm. The steps are: Normal Science: Researchers work within a shared paradigm, solving problems using its methods and assumptions. Crisis: Anomalies or unresolved problems challenge the paradigm, causing doubt among the scientific community. Revolution: A new paradigm emerges, reshaping scientific theories, methods, and goals. Return to Normal Science: Work resumes under the new paradigm.

Answer 115

Objective reality independent of human thought, and systematic inquiry can reveal universal truths about this reality (Quantitative, genereliazible). BUT can oversimplify complex phenomena by ignoring subjective or contextual nuances.

Answer 116

Logical Positivism emphasizes objective observation and empirical verification. Only measurements to be valid (quantitative). Ignores subjective (qualitative) phenomena that can't be measured or tested, such as social constructs. Rejects metaphysics.

Answer 117

Focuses on problem-solving and evaluates theories based on their practical utility rather than absolute truth (Quant & Qual). BUT can undervalue theoretical rigor.

Answer 118

Being scientific means adhering to the rules of science, rather than the topic investigated. 1. The goal is **inference**: - descriptive inference - casual inference 2. The procedures are **public** 3. The conclusions are **uncertain** 4. The content is the **method**

Answer 119

grounding is the foundational step that ties the research question to a solid base of evidence, context, or theory, ensuring its relevance

Answer 120

Want to go deeper, explore complex vague/new theme/phenomena. Dynamics of a process “Thick descriptions” Study meaning Detailed insights into why and how things happen

Answer 121

Grounded theory Creating structure in coding, based on emergent inductive ways of thinking. Open minded. conceptualisation of underlying patterns Content Analysis Good for large amount of text. Umbrella term: Word counts, spaces, important themes. Quantitative presentation Discourse analysis Looking for meaning, deep, language as a form of social practice. Umbrella term: Rhetoric, Text/Narrative analysis. Focus on Language as Social Action, HOW they are said

Answer 122

Claim - A statement that something is true or false, “it's dark outside” Reasons - Something to support that claim, “when i look outside, i don't see any light” Evidence - accepted as a fact Convincing → Representing → linking claims to evidence Claim that… because of reasons… which I base on this evidence...

Answer 123

Your own or others research; analyses, experiments, interviews etc. Authoritative statements and “accepted facts”.

Answer 124

Trustworthy: Research should be credibility and transferability. The text and language is central and should be interpreted as truthful.

Answer 125

Autocorrelation deals with correlations between residuals (temporal or spatial relationships), while heteroskedasticity focuses on unequal variability of residuals.

Answer 126

Overall plan for answering your research problem/ question. A framework reflecting decisions regarding priority in relation to several aspects of the research process. Design Match RQ. Research design turns a research question and objectives into a project. Good if it considers strategies and choices for what data to collect, how to collect it, and how to access it External validity (generalized to a broader population) → Internal validity (research design answer RQ) → Reliability

Answer 127

Finding your way from “raw data” (rough and unsorted) to “making a statement”. Connect “claims” to “evidence”. Categorize data, organizing into similar “chunks”. Inductive, Deductive, Grounden, Gioia etc. Data (read/hear/see/feel) → Interpretation (make sense of data) → Statement (Claim/Argument) Get to know your data → Mark the text → Code → Relate to theory

Answer 128

We Add interpretation! Relate codes to research question, to existing theory Theorize - can we use this to make sense “at a higher level”? Read! → Code! → Analyze!

Answer 129

1. Linearity Linear relationship between X and Y 2. Normality of errors The residuals (errors) are normally distributed. 3. Homoskedasticity The residuals (errors) are evenly distributed. Constant error variance assumption. 4. No autocorrelation (independence) Residuals (errors) not correlated across time.

Answer 130

Yes. Both rely on observed data and assumptions (e.g., no confounding trends) to infer causality without randomization. Often, in event study, the "control group" is just the counterfactual.

Answer 131

Portfolio sorts make it easy to see how average returns change across groups (e.g., quintiles of size or value), while regressions give coefficients that are harder to interpret. Sorts also handle extreme values better and don’t rely on strict linear assumptions.

Answer 132

Estimation Window: Provides a benchmark for "normal" returns, used to calculate abnormal returns. Usually 120 days. Do NOT confuse with pre-event window: directly examines what happens before the event.

Answer 133

Can only handle linear relationship. Difficult to handle inclusion/exclusion. Does not allow for a hedge portfolio.

Answer 134

Allows for non-linear relationship. Easy to handle inclusion/exclusion. Allows for hedge portfolio.

Answer 135

Only causality shows the direction of the relationship between variables

Answer 136

Cartesian DUAlism is the idea that the mind and body are two completely separate things. The mind is non-physical (thinking, consciousness), while the body is physical (material, extended in space). Knowledge comes from rational thinking and innate ideas that are certain and beyond doubt. Innate ideas are concepts or knowledge that are believed to be present in the mind from birth, without being learned through experience. Descartes argued that some truths, like the existence of God or basic logical principles, are built into the human mind naturally.

Answer 137

- **Explanandum**: The thing you want to explain (the phenomenon or question). Why is the sky blue? - **Explanans**: The explanation or reason that answers it. Because sunlight scatters blue light more in the atmosphere.

Answer 138

The Lockean view sees knowledge as coming from sensory experience and reflection. According to John Locke, the mind starts as a blank slate and gains knowledge through perceptions and experiences, rejecting the idea of innate ideas.

Answer 139

Different groups compared at the same time.

Answer 140

generalizable to a larger population.

Answer 141

Credibility. External validity: can the result be generalized to a broader population? Internal validity: is the research design appropriate to answer the research question?

Answer 142

When asking “how” and “why” questions When the investigator has little or no control over events Focus on a phenomenon in its real-life context and creates context-dependent knowledge For theory development, not statistical generalization especially appropriate in new topic areas.

Answer 143

A way of working. A set in the analytical process, but no everything. Coding is not necessary. Coding is most often inductive, but can also be deductive.

Answer 144

Content analysis

Answer 145

focuses on deviations from entity-specific means and isolates within-entity variations while controlling for constant unobserved factors. 1. avoids autocorrelation in the error term. 2. preserves the degrees of freedom 3. It does not rely on T-consistency (large time periods). 4. the standard method for fixed-effects regressions due to its simplicity and effectiveness. First-differencing focuses on changes between consecutive periods, while time-demeaning focuses on deviations from the individual’s average.

Answer 146

Use the F-test.

Answer 147

Use the F-test.

Exam preparations Flashcards

(173 cards)