Last part Flashcards

Question

V-DEM index

Answer 1

varieties democracy. five components of democracy w/ subcomponents. resulting index is normalized from 0 to 1

Answer 2

typically 4+ ppl collecting the same data and **cross-referencing results**. Teams of coders include project managers, research assistants, and experts.

Answer 3

Standardized scales using aggregate probability distributions across coders Alternate versions using average, median, ordinal scores, and high-low estimates These are “confidence intervals” for probability densities Each variable has around 10 versions available in the codebook

Answer 4

varieties democracy five components of democracy w/ subcomponents resulting index is normalized from 0 to 1

Answer 5

typically 4+ ppl collecting the same data and cross-referncing results. Teams of coders include project managers, research assistants, and experts.

Answer 6

objective: Facts based on real things like laws, stats, and official records. subjective: Used when things aren’t clear-cut, like judging if one party controls vote counting.

Answer 7

a method where experts rate complex or subjective political phenomena (like attacks on the judiciary) using tools like Likert scales (from 1-5), rather than relying on hard data. **It helps measure concepts that can't be captured with purely objective indicators.**

Answer 8

A common survey tool that asks respondents to rate their **agreement** or **perception** on a **scale** (e.g., from “Strongly Disagree” to “Strongly Agree”), usually 5 or 7 points, used to quantify attitudes or assessments. Use in cross-sectional data requires more individual judgment from researcher.

Answer 9

Snapshot in time; no time variable. Best for variables that don’t change much over time (e.g., constitution). Useful for multilevel models (grouping individual data by country).

Answer 10

data that changes over time (e.g., regime change). Easier at macro-level (countries), hard at individual-level. Requires Fixed Effects to control for units (e.g., year, country).

Answer 11

Control for unobserved differences across groups (like countries or years). Fits separate regression lines within each group. Used only when you have data for the whole population.

Answer 12

transforming, adjusting, or interpreting data so that it can be meaningfully compared or applied across different levels, units, or contexts. easiest way of scaling data is to set values between 0 and 1 in this case, a value corresponds to a percent of the max value but data can also be normalized or transformed (set to a particular curve) transforming usually means using natural logs to account for skew

Answer 13

Normalizing data means data is fit to a normal distribution normalized data could be scaled, corresponding to percentile in normal distribution - Could also be a raw # of SDs away from mean raw value of V-DEM data are usually normalized as STANDARD DEVIATIONS

Answer 14

under what **parameters** are you collecting data? country-level data is exhaustive -- bc pulls from whole population can also do this with any JURISDICTION - a district, province etc individual-level data is not -pulled from a sample

Answer 15

snapshot in time comparing macro-level effects w/out the element of time country-level variables that dont change much over time, like constitutional scope

Answer 16

It’s when a trend appears in aggregate data, but **disappears** or **reverses** when you break it into subgroups. happens when u mix 2 diff levels of analysis only works if you have complete data of the target population. You might find a strong relationship at the country level (like: "More immigration restrictions → fewer women in parliament") But that trend could vanish or reverse when looking at specific regions, income levels, or years.

Answer 17

FIXED: deployed when u have whole population RANDOM: unit controls when you have a partial population MIXED: fixed and random effects together for multilevel data

Answer 18

cross sectional = snapshot in time times series = observes changes over time fixed effects = controls for group level differences

Answer 19

count model = type of statistical model used when your dependent variable is a count of events -- like # of protests, # of laws passed or # of times someone voted negative binomial more often used bc more robust compared to POISSON

Answer 20

examines distribution of occurences - the rate at which diff outcomes occur

Answer 21

🔁** The Poisson Distribution (Baseline)** Used for count data (e.g. number of protests, number of asylum claims). Assumes that the **mean = variance** — this is called *equidispersion*. Good for when your data is neatly distributed with little variance. ❗ The Problem: **Overdispersion In real life, the variance is often greater than the mean** (i.e. overdispersion). Poisson struggles here — it underestimates standard errors and can inflate significance. ✅ The Negative Binomial: A Fix The Negative Binomial distribution generalizes Poisson by introducing an extra parameter to model overdispersion. Here’s how it builds on Poisson:gauges rate of failures given # of trials

Answer 22

rate parameter It tells you the average number of events per interval if lambda = 1 means you expect 1 event per interval

Answer 23

Regression show incidence Rate Ratios -- outcomes divided by non-outcomes + it accounts fo overdispersion The Negative Binomial distribution becomes more symmetric and “normal-looking” as r increases When r is small (like r = 1), it looks very skewed, almost like an exponential or geometric distribution

Last part Flashcards

(47 cards)