Data Collection Flashcards

1
Q

Design points for a survey

A
  • make all questions clear don’t use technical jargon
  • make sure each question only asks about one issue
  • make questions as short as possible
  • avoid negative items as they can confuse respondent
  • avoid biased items and terms
  • use a consistent response method such as a scale of 1 to 7 or yes or no
  • sequence questions from general to specific
  • make the questions as easy to answer as possible
  • Define any unicorn usual terms for example when you were conducting a survey about open space zoning be sure to define what the term means
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

3 steps for the statistical process

A
  1. collect data (ie surveys)
  2. Describe and summarize the distribution of the values
  3. Interpret by means of inferential statistics and statistical modeling. (Ie draw general conclusions for the population based of the sample)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Types of measurement

A
  1. nominal data
  2. Ordinal data
  3. Interval data
  4. Ratio data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Types of variables

A
  1. Quantitative
  2. Qualitative
  3. Continuous
  4. Discrete
    4a. Binary
    4b. Dichotomous
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Nominal data

A

Nominal data are classified at a mutually exclusive groups or categories and lack intrinsic order.

Zoning classification, Social Security number, and sex are examples of nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Ordinal data

A

Ordinal data are ordered categories implying a ranking of the observations.

Even though ordinal data maybe give a numerical values such as 1, 2, 3, 4, the values themselves are meaningless, only the rank counts. So, even though one might be tempted to infer that 4 is twice 2, that is not correct. Examples of ordinal data or letter grades suitability for development in response scales on a survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interval data

A

Interval data is data that has an ordered relationship where the difference between the scales has a meaningful interpretation.

The typical example of interval data is temperature, where the difference between 40 and 30° is the same as between 30 and 20°, but 20° is not twice as cold as 40°

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Ratio data

A

Ratio data is the gold standard of measurement we’re both absolute and relative differences have a meeting.

The classic example of ratio data is a distance measure, where the difference between 40 and 30 miles is the same as the difference between 30 and 20 miles, and in addition, 40 miles is twice as far as 20 miles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Continuous variables

A

Continuous variables can take an infinite number of values, both positive and negative, and with as find a degree of precision as desired.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Discrete variables

A

Discrete variables can only take on a finite number of distinct values. An example is the count of the number of events, such as the number of accidents per month.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Binary and dichotomous variables

A

A special case of discreet variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Inferential statistic

A

Inferential statistics his probability Siri to determine characteristics of a population based on observations made on a sample from the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Distribution

A

Distribution is the overall shape of all observed data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Gaussian distribution

A

The normal bell curve.
This distribution is symmetric and has the additional property that the spread around the mean can be related to the proportion observed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Symmetrical distribution

A

An equal number of observations are below and above the mean. (This is the case for normal distribution)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Skewed

A

More observations below the mean or more about the mean.

17
Q

Skewed to the right

A

When the bulk of the values are about the mean.

18
Q

Skewed to the left

A

Small values (such as zero) pull the distribution to the left.

19
Q

Central tendency

A

Typical or representative value for the distribution of observed values.

Measurements of central tendency are mean, median and mode.

20
Q

Mean

A

Is the average distribution

21
Q

Media

A

Is the middle value of a ranked distribution.

22
Q

Mode

A

Is the most frequent number in the distribution.

23
Q

Coefficient variation

A

Measured the relative dispersion from the mean by taking the standard deviation and dividing by the mean.

24
Q

Z-score

A

This is the standardization of the original variable by subtracting the mean and dividing by the standard deviation.

25
Q

Inter-quartile range or IQR

A

An alternative measure of dispersion.

IQR forms the basis for an alternative concept of outliers. Two fences are computed as the first quartile less 1.5 times the IQR and the third quartile plus 1.5 times the IQR. Observations that are outside these fences are termed outliers. This is visualized in a box plot aka box and whiskers plot.

26
Q

Statistical inference

A

The process for drawing conclusions about the characteristics of a distribution from a sample of data.

27
Q

Hypothesis test

A

A statement about a particular characteristic of a population.

28
Q

Null hypothesis

A

Point of departure or reference.

29
Q

Alternative hypothesis

A

The research hypothesis one wants to find support for by rejecting the null hypothesis.