Stata Lecture 1 Flashcards

1
Q

What are the two types of quantitative data?

A

Numerical (discrete) and Continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does standard deviation mean?

A

Average difference between all individual results and the mean result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is another term for the 50th percentile?

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How is quantitative data described?

A

Mean
Range
Symmetry
Quartiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is numerical/discrete data?

A

Can only take particular values e.g. How many apples?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is continuous data?

A

Can occupy any value over a continuous range e.g. height

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the 3 graphical methods by which quantitative data can be represented?

A

Histograms
Dotplots
Box and Whisker plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does positively skewed mean?

A

More scores towards the lower end

Long tail towards the higher end

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When would you use a dotplot instead of a histogram?

A

When the sample size is small (less than 50)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is the length of the whiskers in a box and whisker plot determined?

A

Extend as far as 1.5 X the IQR from the bottom/top of the box
OR as far as the furthest observation
Whichever is the shortest

Anything outside this is an outlier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a binary variable?

A

Only has 2 categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the types of categorical data?

A

Nominal and Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is nominal data?

A

Categories with no order e.g. male/female

Also includes binary data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is ordinal data?

A

Categories with ranked order e.g. 1 star rating, 2 star rating etc…
Usually a small set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a variable?

A

Set of characteristics that define an aspect of participants in a study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the two types of variable?

A

Quantitative and Categorical

17
Q

What graphical method is used to represent categorical data?

A

Bar Chart

18
Q

How is the median found?

A
Middle score (if odd number of scores)
Mean of two middle scores (if even number of scores)
19
Q

What is the mode?

A

Most frequently occurring score

20
Q

What is numerically defined as an outlier?

A

Values outside the 95% range

21
Q

2 disadvantages of using the range to describe variation in a sample?

A

Extremely sensitive to outliers

Dependent on sample size (can only get bigger as sample size increases)

22
Q

What is a bimodal data set?

A

Two peaks

23
Q

What is a uniform data set?

A

Even distribution of values over the range

24
Q

When should you use the mean/SDV vs median/IQR to describe your data set?

A
Mean/stdv = symmetrical, normally distributed data
Median/IQR = Skewed/asymmetrical data
25
Q

What is a dichotomous variable?

A

Another word for binary variable

Two possible categories