Biostatistics Flashcards

1
Q

What is statistics (using information and data)

A

Statistics is a body of techniques and tools used in the collection, organization, analysis, interpretation and presentation of information that can be stated numerically

It is the collection, presentation, analysis and interpretation of numerical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Explain the two types of statistics

A

Descriptive:
Describes the population. Summarizes measurements .
Involves: frequencies, proportions, measures of central tendency, measures of dispersion/variation.
e.g weight of final year medical students

Inferential:
Uses data from a sample to represent the population which the sample came from
e.g weight of final year medical students to represent weight of medical students as a whole

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe types of variables

A

Quantitative/Numerical:
Are just numbers, whether whole/integers(Discrete) or fractions(continuous). Any thing that can be counted or measured.

Qualitative/Categorical
Describes data that fits into categories.
3 types: Binary, nominal and ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

meaning of observations

A

Any subject that serves as the data source e.g people, schools

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

meaning of variables

A

The thing that can be measured e.g blood pressure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

meaning of values

A

The actual result gotten from measuring a variable e.g 130/75mmHg

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

how many people are at least 7 years old? Is this a qualitative or quantitative variable.

A

Quantitative because the people can be counted. Specifically discrete quantitative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are measures of central tendency/location

A

They are tools used to summarize entire quantitative datasets into the most likely value (basically like the average)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the 3 measures of central tendency

A

Mean
Median
Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

mean

A

Simply the arithemetic average
m = ∑x/n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Mean for grouped data

A

Mean = ∑fx /n

Where f = frequency of each group or class
x = mean value of the group
n = number of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

median

A

The mid value of a series of data
(n + 1)/2
Best for skewed data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Mode

A

Most frequently occurring observation in a series

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

4 common measures of variation/dispersion

A

Range
Interquartile range
Variance
Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Range

A

This is the difference between the largest and smallest values.

For grouped data, it is the difference between the mid-points of the extreme categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Interquartile range

A

This indicates the spread of the middle 50% of the data

IQR = upper quartile – lowest quartile

17
Q

Upper quartile and lower quartile

A

Find the median of the entire number series.
Find the median of the lower and upper halves, these two numbers are the lower and upper quartile respectively

18
Q

Variance

A

variance = (summation(x - m)squared)/(n-1)

Where x = each data point/value
m = mean
n = size of the sample

19
Q

Standard deviation

A

aka root mean square deviation
It is the square root of the variance

20
Q

Which method of dispersion to use for skewed data

A

range and IQR

21
Q

Formula for obtaining the standard deivation of grouped data

A

square root of (∑x² - (∑x)² /n)/(n-1)

22
Q

3 methods of data presentation

A

Text
Tables/charts
Graphs

23
Q

When can data be used within a text

A

When there are only two data points being compared

24
Q

Guidelines for drawing a good table

A

Should be able to stand on its own

Tables should be numbered in Arabic numerals in the order in which they appear (Table 1, 2 etc)

Title should be informative and written above the table

Better to remove grid lines
Use footnotes to explain abbreviations or symbols

25
Q

When can data be used within a table

A

When the data is more complex

26
Q

When can data be used within a graph

A

Useful when there are few data points or categories. Also to show trends

27
Q

Guidelines for drawing a good table

A

Should have an informative title

Title should be written below the graph

Figures should be numbered in Arabic numerals according to the order in which they appear

Legends should be clear (a legend is that colour-coded thing that gives you more information about the graph)

Use strong contrasting colours

28
Q

Guidelines when making a pie chart

A

Ensure the wedges all add up to 100%
Begin at 12 o’clock position
Go clockwise from largest to smallest
Show no more than 7 wedges
Use distinct colours for wedges

29
Q

What is the major difference between histograms and bar charts

A

Bar charts are used to represent discrete data, that’s why there are spaces
Histograms are used to represent continuous data

30
Q

Mention 7 types of graphs used for data presentation

A

Line graph
Stem leaf
Box and whisker plot
Frequency polygon
Histogram
Pie chart
Bar chart

31
Q

What are scales of measurement

A

Ways in which variables are defined and categorized. It determines the type of statistical analysis that is done.

32
Q

What are the 4 scales of measurement

A

Nominal
Ordinal
Interval
Ratio

33
Q

Nominal scale

A

For unordered categorical data
Places people or objects in mutually exclusive categories
Eg. gender

34
Q

Ordinal scale

A

For ordered categorical data
Ranks objects in order
Eg. Level of education

35
Q

Interval scale

A

For discrete dataFor discrete data
Units of measurement are equal throughout the full range of the scale but has no ‘true zero’ point
Zero does not represent the absolute lowest value
Addition and subtraction operations can be performed
Eg. Measurement of temperature in degrees

36
Q

Ratio scale

A

For continuous data
Has a true zero point (No numbers exist below zero)
Can calculate ratios between scale values
All 4 mathematical operations (addition, subtraction, multiplication and division) can be performed
Eg. Height, serum Calcium