Lesson 2.1: Data, Variables & Samples Flashcards

Central Tendencies and Variability

1
Q

population

A

Entire group of individuals or objects to be studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

sample

A

Subset of population that is being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

individual

A

Person or object that is part of the population being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

statistics

A

Numerical summary of a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

parameter

A

Numerical summary of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

inferential statistics

A

uses methods thattake a result from a sample, extend it to the
population, and measure reliability of the result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

variable

A

Variables are characteristics of an individual within a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Qualitative / Categorical Variables

A
  • Allow for classification of individuals based on some attribute
  • Cannot perform arithmetic operation on this data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Quantitative / Numerical Variables

A

Provide numerical measure of individuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Qualitative Variables

Types (3)

A
  1. Dichotomous
  2. Nominal
  3. Ordinal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Qualitative Variables

Dichotomous

A
  • Only 2 values
  • eg. present/absent, alive/dead
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Qualitative Variables

Nominal

A
  • Unordered
  • eg. A, B, AB, O Blood Type
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Qualitative Variables

Ordinal

A
  • ordered
  • eg. Rate Pain scale (mild to severe)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Quantitative Variables

Discrete

A
  • certain values gaps
  • eg. sick days per year
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Quantitative Variables

Continuous

A
  • no value gaps
  • eg. blood glucose levels
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Quantitative / Numerical

Interval Scale

A
  • Numerical data measured using an ordered scale
  • Difference between measurement is meaningful
  • But does NOT have a true zero

Example: How satisfied are you
from airline service
- Scale: 10, 8, 6 ,4
- ACT or SAT scores

17
Q

Quantitative / Numerical

Ratio Scale

A
  • Numerical data measured using
    an ordered scale
  • Difference between
    measurement is meaningful
  • And involves a true zero point

Most of the numerical data is ratio type because it has a true
zero
- Example: Height, Weight

18
Q

Categorical Variables in R

Factors

A
  • nominal or ordinal variables
  • factor(…) converts string variables to factors
  • factor(status, order=TRUE) for ordinal variables
  • factors (categories) listed alphabetically by default
19
Q

Categorical Variables in R

Factors: specific order

R code

A

factor(variable, order=TRUE, levels = c(…)

  • eg. status <- factor(status, order=TRUE, levels=c(“Poor”, “Improved”, “Excellent”)
  • assigns numbers to factors (eg. 1,2,3 for E, I, P )
20
Q

class(…)

A
  • displays the data type
  • eg. ordered, factor, string, numeric, etc.
21
Q

Bias

Types (3)

A

Sampling Bias
- Technique used to obtain the individuals to be in the sample tends to favor one part of the population

Non response bias
- Individual selected to be in the sample who do not respond

Response Bias
- Answer on the survey do not represent true feeling of the respondent

22
Q

Sampling in R

set.seed()

A
  • creates sampling population
  • need to set new population (…) every time
23
Q

sampling

sample(x,y)
sample(x, y, replace=T)

A
  • generate a set size (y) of randomly sampled values (x)
  • sample (1:10, 5) = 5 sampled values from range 1:10
  • sample(1:10, 5, replace = T) = can replace selected value for reselection
24
Q

nrow(DataFile)

A

returns number of rows in Data file

25
Q

floor(DataFile)

A

returns lowest integer value from calculation
rewatch video