Chapter 2: Data Collection Flashcards

1
Q

Observation

A

A single member of a collection of items that we want to study, such as a person, firm, or region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Variable

A

Characteristic of the subject or individual, such as an employee’s income or an invoice amount

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data set

A

Consists of all the values of the variables for all of the observations we have taken as a whole

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data

A

Used as a plural. Data usually are entered into a spreadsheet or database as an n x m matrix

Specifically, each column is a variable (m columns) and each row is an observation (n rows)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Univariate data sets

A

Data sets with one variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Bivariate data sets

A

Data sets with two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Multivariate data sets

A

Data sets with more than two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Types of data - Categorical

A

Qualitative.
Values that are described by words rather than by numbers.

Verbal label such as vehicle type, pay type )car, truck , salary, hourly, etc) or coded (1, 2, 3 )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Types of data - Numerical

A

Quantitative.
Values that are described by numbers rather than words, such as counting, measuring something.

Discrete (ie. broken eggs in a carton, annual dental visits) or Continuous (patient waiting time or customer satisfaction percentages)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Coding

A

When values of categorical variable are represented using numbers.

Ie. 1 = cash 2 = check 3 = credit etc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Binary variables

A

Categorical variables that only have two values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Discrete

A

A variable with a countable number of distinct values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Continuous

A

A numerical variable that can have any value within an interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Time series data

A

If each observation in the sample represents a different equally spaced point in time (years, months, days)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Periodicity

A

The time between observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Cross- sectional data

A

If each observation represents a different individual unit (a person, firm, geographic area) at thee same point in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Sample

A

A subset of the population that we will actually analyze

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Population

A

All of the items that we are interested in

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Census

A

An example of all items in a defined population.

A sample involves looking only at some items selected from the population while the census is an examination of all the items.

20
Q

Parameter

A

A measurement or characteristic of the population (eg. a mean or a proportion) Usually unknown because we can rarely observe the entire population.

21
Q

Statistic

A

A numerical value calculated from a sample (eg. a mean or proportion)

22
Q

Target population

A

Contains all the individuals in which we are interested

23
Q

Sampling frame

A

The group from which we take the samples (ex. are phone directories, voter registration lists, alumni associations mailing lists, or marketing databases)

24
Q

Random sampling

A

Items are chosen by randomization or a chance procedure

25
Q

Non-random sampling

A

Less scientific but it is sometimes used for expediency

26
Q

Simple random sample

A

Every item in the population of N items has the same chance of being chosen in the sample of n items

27
Q

Random number

A

A sampling to chose at random

Excels function =RANDBETWEEN(1,4) or any set of numbers

28
Q

Sampling without replacement

A

Once an item has been selected to be included in the sample, it can not be considered for the sample again

29
Q

Sampling with replacement

A

Once an item has been selected, it can be selected again

=RANDBETWEEN (a,b) function uses sampling with replacement

30
Q

Infinite population

A

When the sample is less than 5 percent of the population (ie. when n/N is less than or equal to .05), then the population is effectively infinite.

An equivalent statement is that a population is effectively infinite when it is at least 20 times as large as the sample (or when N/n is more than or equal to 20)

31
Q

Systematic sampling

A

Choosing every kth item from a sequence or list, starting from a randomly chosen entry among the first k items on the list

A systematic sample of n items from a population of N items requires the periodicity k be approximately N/n

32
Q

Strata

A

Homogeneous subgroups of known size

33
Q

Stratified sampling

A

Within each stratum, a simple random sample of the desired size could be taken.

Alternatively, a random sample of the whole population could be taken, and then individual strata estimates could be combined using appropriate weights

34
Q

Cluster samples

A

Taken from strata consisting of geographical regions

We divide a region (a city) into sub regions (blocks, subdivisions, school districts)

35
Q

Judgement sampling

A

Non-random sampling method that relies on the expertise of the sampler to choose items that are representative of the population

36
Q

Convenience sampling

A

Quick. Grabbing whichever sample is available and handy.

37
Q

Focus group

A

A panel of individuals chosen to be representative of a wider population, formed for open-ended discussion and idea gathering about an issue

38
Q

Non-response bias

A

Occurs when those who respond have characteristics different from those who don’t respond

39
Q

Selection bias

A

Self selected samples, ie someone who volunteers for a survey

40
Q

Response error

A

Occurs when respondents deliberately give false information to mimic socially acceptable answers, to avoid embarrassment, or protect personal information

41
Q

Coverage error

A

Occurs when some important segment of the target population is systematically missed

42
Q

Measurement error

A

Results when the survey questions do not accurately reveal the construct being assessed

43
Q

Interviewer error

A

When the interviewer’s facial expressions, tone of voice, or appearance influences the responses

44
Q

Sampling error

A

Uncontrollable random error that is inherent in any random sample

45
Q

Valid survey

A

A survey that measures what the researcher wants to measure

46
Q

Reliable survey

A

A survey that is consistent. In other words, will the responses from similar respondents stay the same over time?