Lecture 1 Flashcards
Define Econometrics
Econometrics is the quantitative analysis (numbers) of actual economic phenomena (economic events)
It can also be defined as: the application of statistical methods to economic models
What is important to remember about economic data?
Economics is a non-experimental science (difficult to establish cause and effect as a result as there are so many factors at play - unlike physics, chemistry etc where extraneous variables are closely controlled to establish cause and effect)
As a result, the data is described as being weak/noisy etc and … the empirical evidence provided by econometrics is frequently inconclusive in that it struggles to establish cause and effect (confirming that a certain factor/factors cause economic phenomena/events)
What does Econometrics involve?
It may involve:
- developing statistical methods to estimate economic relationships
- testing economic theories
- evaluating economic policies
- forecasting future path of economic variables e.g. GDP, inflation etc (one of the most important applications of econometrics)
Give an example of the use of econometrics in a practical, real life situation
A University needs to estimate how much enrolment will fall by a £100 increase in tuition fees per semester and … predict whether its tuition revenue will rise or fall
State the econometric methodology in detail including the differences when investigating time-series, cross-sectional and panel data
1) Statement of theory or hypothesis (what you are investigating)
2) Specification of the mathematical model of the theory
3) Specification of the econometric model of the theory (normally same mathematical model but written with the inclusion of an error term or disturbance denoted by an epsilon)
4) Collect data (now the subscript, small letter(s) in front of each variable of the model, will vary depending on what type of data is used - note that the subscript is only put in front of the dependent, independent and extraneous variables (which is denoted by epsilon) and not any stand alone beta’s): time-series data will have subscript t for time, cross-sectional data will have subscript i for individual and panel data will have subscript it denoting both data for several individuals over time
5) Estimate the parameters of the econometric model (at this point you substitute values in for your beta(s) and denote your dependent variable (being measured typically left of equals sign alone) using its same symbol e.g. C but this time with a hat ^ on top of it which indicates that it is an estimation of the dependent variable
6) Conduct hypothesis testing (statistical test) to determine whether the data sufficiently supports the hypothesis/theory
7) Conduct forecasting or a prediction (I assume given that your hypothesis is statistically sound - so your hypothesis is true that’s when you can start using your model to make predictions and forecasts)
8) Use the model for control or policy purposes - basically just future use of model
What do we try to do using our econometric model and the data found from our sample?
Make an inference about the real world
Briefly state and describe the difference between the 2 types of statistics
1) Descriptive statistics
2) Inferential statistics
Descriptive statistics describe/summarise features from sample data whereas inferential statistics use the sample data to make predictions about the population
How important is data?
Data is extremely important in economics - it is described as being the ‘new oil’ - once converted into information it can be extremely useful for several purposes e.g. in making informed investment and policy decisions
In how many ways (that we look at in this module) can data be generated?
We look at 3 ways
What are the ways (that we look at in this module) can data be generated?
1) Experimental data
2) Quasi-experimental data
3) Observational data
Describe experimental data
- Involves control (receive no treatment - potentially using a placebo e.g. sugar tablets) and treatment/experimental groups - participants/subjects are randomly assigned to either the control or treatment group
- highly controlled laboratory conditions - typical of experiments in sciences like physics and chemistry (all variables and potential variables both exogenous/independent and endogenous/dependent are closely controlled so that only the independent variables have an effect on the dependent variable being measured/studied and no others)
- economics data may be collected experimentally but it can be difficult to show the affect of real world economic events and reactions in a lab - it is very difficult to predict what the effect of certain policies would be for example
Describe Quasi-experimental data
- here the main difference is that subjects aren’t randomly assigned to either the control or treatment group but instead they are assigned based on some criteria
- like experimental data, they aim to establish cause and effect between independent (exogenous) and dependent (endogenous) variables
Describe observational data
- this type of data is typical of social sciences like economics as it is often quite difficult to collect data in economics in a laboratory due to the immense number of factors at play and their unpredictable effect on the system
- here there is no control group as it is quite difficult to only observe yet somehow control which subjects receive treatment and which don’t
- there is also no assignment criteria here as there is no control group - all subjects that are observed are in the experimental/treatment group regardless of any criteria
What are exogenous and endogenous variables?
Exogenous (external) variables are the independent variables (being manipulated to see the affect on the dependent/endogenous variables) of a model - their cause is external to the model and their role is to explain other (dependent/endogenous variables and the outcomes of a model)
Endogenous (internal) variables, as mentioned above, are the dependent variables of a model which are measured and the effect on which is considered the outcome of the model - the effect on the endogenous variable is caused by its relationship with the independent variables of the model and potentially, if any, the extraneous variables of a model
How many types of dataset are there?
3 types
State the types of dataset
1) Cross-section data
2) Time-series data
3) Panel data