introduction Flashcards

1
Q

a variable is

A

a set of characteristics that describe ana spect of participants in research e.g. gender, blood pressure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

two main types of variable

A

categorical

quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

categorical data

A

usually facts rather than numerical

- participants are classified into categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

two types of categorical

A

nominal and ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

nominal variables

A

unordered labelled characteristics

  • binary variables (just two categories)
  • observations can be assigned a code in the form of a number where the numbers are simply labels
  • can count but not order
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

example of a nominal variable

A

blood group: A, B, AB, O

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ordinal variable

A
  • small set of ordered categories
  • categories might be labels or numbers
  • obs can be ranked
    e. g. house numbers and swimming level
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

example of ordinal variable

A

disease severity: none, mild, mod, sevre

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

categorical data are recorded as

A

numbers which represent specific categories e.g.

Gender: (1) male, (2) female

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

quantitative variables

A

valies have quantitative meaning. The higher the number the more there is of the concept
e.g. the tiger number for age means you’re older

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

quantitative variables are also known as

A

continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

distribution def

A

refers to the diff values that occur and the frequency with which they occur for a given variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

categorical data can be described using

A
  • frequency tables

- bar charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how to describe quanitiaitve data

A

average. variation, symmetry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

average

A

what value characterises the middle of distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

variation

A

speed, dispersion, how far apart the values are from each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

symmetry

A

for each person that has a score below the average is there a corresponding person with the score the same distance about the avergae

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

types of average

A

mean, median mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

mean

A

sum of soccer divided by the number of scores

20
Q

median

A

rank the scores in order and the median is the value that divides the data in 2

21
Q

mode

A

the most frequently occurring score

22
Q

when is mode not useful

A

in quantitive data-there may be more than one mode, each value in study might appear only once- mode could be low/high

23
Q

disadvantages of the range

A
  • sensitive to unusually extreme values (outliers)

- dependent on sample size- as sample size gets larger the range cannot get smaller, but it can get larger.

24
Q

symmetry of quantitative variable

A

o Examples of symmetrically distributed date:
• 1,2,3,4,5
• -2,0,2
• 3,3,10,17,17

25
Q

graphical summary of quantitative data

A
  • histograms
  • dotplots
  • box and whisker plot
26
Q

what is the definition of standard deviation

A

the spread of data around the mean

- the average difference between the spare and the mean

27
Q

standard error

A

The standard error is the estimated standard deviation or measure of variability in the sampling distribution of a statistic. A low standard error means there is relatively less spread in the sampling distribution. The standard error indicates the likely accuracy of the sample mean as compared with the population mean.

28
Q

the lower the standard deviation

A

the more accurate the data

29
Q

Inter-quartile rane

A

IQR spans the middle 50% of score i.e. range between the lower and upper quartiles

30
Q

lower quartile

A

is the value below which 1/4 of socks lie

1/4(n+1)th score (25th percentile)

31
Q

upper quartile

A

is the value above which 1/4 of score lie

3/4(n+1)th score (75th percentile)

32
Q

histogram

A

a graph where the heights of the rectangular bars/bins are used to indicate the relative frequency which with values in specific ranges occur

33
Q

for histograms to show a clear shape, at least…

A

50 observations are required

34
Q

symmetrical histogram

A

bell curve

35
Q

positive skewed histogram

A

skewed to the left

36
Q

negative skewed histogram

A

skewed to the right

37
Q

bimodal histogram

A

will have two peaks

38
Q

uniform histogram

A

rectangle shape

39
Q

dot plot

A

is like a histogram that is turned back to front and flipped on its side

40
Q

in a dot plot each dot represents

A

an Ob

41
Q

the length of the brass of dot plots indicate

A

how common the value is

42
Q

when are dot plot especially useful

A

when plotting distributions for small sample sizes

43
Q

The Box and whisker lot graph indicated

A
  • median
  • lowe quartile
  • upper quartile
  • rage that contains most values
  • outliers
44
Q

outliers

A

extreme observations which are either low or high values

45
Q

in box and whiskers what would represent a positive skew

A

if the top part of the box is thicker than the bottom part and top whisker is slightly longer than the bottom

46
Q

what sort of summary statistics should be used for symmetric data (bell curve)

A

summary statistics that make use of all the data

47
Q

what sort of summary statistics should be used for asymmetric data (positive or negatively skewed)

A

summary stats that are insensitive to extreme values