Kap 1 - Introduction to data Flashcards
Randomized Experiment
When individuals are randomly assigned to a group.
Anacdotal evidence
Lítið av data, kann vera . Um man hevur eitt data og persónurin doyr eftir at hava tikið medisinið, kann tað vera ein ekstrem case av medisinum, meðan tað kann gott vera eitt gott medisin alíkavæl.
Be careful of data collected in a haphazard fashion. Such evidence may be true and veriable,
but it may only represent extraordinary cases.
What is a summary statistic?
A summary statistic is a single number
summarizing a large amount of data.
What is variables?
Columns represent characteristics, called variables.
What is a data matrice?
A table. A convenient and common way to organize data, especially if collecting data in a spreadsheet. Each row of a data matrix corresponds to a unique case
observational unit), and each column corresponds to a variable.
What is a case or observational unit?
Ein rekkja í einari talvu.
What is a numeric variable?
numerical variable since it can take a wide
range of numerical values, and it is sensible to add, subtract, or take averages with those values.
What is a discrete variable?
Discrete variables are whole integers in a specific range.
What is a continuous variable?
It is numeric and can be all numbers.
What is a categorical variable?
Usually categories. Usually text. The possible values in a category is called the variable’s levels.
What are ordinal and nominal variables?
An ordinal variable is a categorical variable but the levels have a natural ordering, while a regular categorical variable without this type of special ordering is called a nominal variable.
What is a scatterplot?
Scatterplots are one type of graph used to study the relationship between two numerical variables.
What is it called when two variables show some connection with one another?
When two variables show some connection with one another, they are called
associated variables. Associated variables can also be called dependent variables and vice-versa.
What is it called when two variables don’t have a connection with one another?
Independent
explanatory and response variables?
When we suspect one variable might causally affect another, we label the frst variable the
explanatory variable and the second the response variable.
For many pairs of variables, there is no hypothesized relationship, and these labels would not
be applied to either variable in such cases.