C4: Graphical Techniques Flashcards

1
Q

Data

A

= a set of specific values observed in an experiment.

- comprises of a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sample

A

= sub-set collected from a popn.

• used to deduce some properties about the sample space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Statistical methods

A

Methods that enable us to look at information from a small collection of subjects/samples & make deductions about a larger collection of subjects/popn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Individuals

A

Subjects/objects included in the study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Descriptive statistics

A

Branch of statistics that deals with the organising & presenting of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Variable

A

A characteristic of the individual to be measured/observed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Types Of Variables(2)

A

[A] Quantitative variable
= has a numerical measurement for which arithmetic operations makes sense.

[B] Qualitative variable
= describes an individual by placing an individual into a category/group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data sources

A

Where data is obtained.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Types Of Data(2)

A

[A] Population data
= data are from every individual of interest.
• popn parameter = numerical measure that describes an aspect of a popn.

[B] Sample data
= data are from only some of the individuals of interest.
• sample statistic = numerical measure that describes an aspect of a sample.
• varies from sample to sample.
• incomplete.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Raw data

A

Collected data that hasn’t been organized numerically in any way.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Array

A

An arrangement of raw data either in ascending or descending order of magnitude.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Scales Of Measurements(4) NOIR

A

[A] Nominal scale
= measures type of data that consists of names/labels/categories.
• order is not important.
eg. Eye colour.

[B] Ordinal scale
= categorized data that can be ordered/ranked.
• given positions from smallest to largest.
• differences between data values indeterminate & meaningless.
eg. Grades.

[C] Interval scale
= applies to data that can be arranged in order.
• difference between data values is meaningful.
eg. Temperature.

[D] Ratio scale
= data that can be arranged in order.
• differences between values & ratios of data values are meaningful.
• data at ratio level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Qualitative data

A

Nominal & Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Quantitative data

A

Interval & Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Data collection

A

The methods used to obtain pertinent information from the elementary units drawn into a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Approaches To Data Collection(3) DIE

A

[A] Direct observation
• done by asking questions.
• gives more accurate information.
eg. Human populations.

[B] Interview methods
• similar if not number 1.

[C] Experimentation.

17
Q

Terminology: Graphical methods

Range

A

Difference between the largest value and smallest value of the data set.

18
Q

Cluster

A

A group of points that fall very close together.

19
Q

Outlier

A

An observation that’s usually far from the rest of the data.

20
Q

Relative frequency

A

= A proportion of the data having a certain property.

• gives the proportion of the observations falling in a particular group.

21
Q

Frequency distribution

A

= an arrangement of data that allows the frequency of the occurrence of the values in each of several size classes.
• simplifies data without losing too many details.

22
Q

Plot Types(7) DPB2LSH

A
  • Dot plot.
  • Pie chart.
  • Box plot.
  • Bar graph.
  • Line graph.
  • Stem & Leaf plot.
  • Histogram.
23
Q

Dot plot

A

= plot of a batch of no. on a vertical or horizontal scale.
• each no. = a dot.
• dots should be distinct.

24
Q

Pie charts

A

= a circle divided into segments.
• size of each segment proportional to the importance of data category of random variable relative to the whole.
• used for categorical data —> ordinal scale.

25
Q

Bar graph

A

= a simple presentation of the categorical/discrete data via bars put side to side.
• bars should be of same thickness.

26
Q

Line graph

A

= graph used to show the value of a random variable over time.
• used to investigate trends in 2 variables which are related.

27
Q

Stem & Leaf plot

A

= a method of EDA(Exploratory Data Analysis) used to ran-order and arrange data into groups.
• good for plotting data with many observations (~ 15-50).
• numbers are split into 2 parts: stem & leaf.
• has a key.
• displays shape of the data distribution.
• shows us all data (in array format).
• helps us to spot cluster data values.
☆ when stem has too many leaves, split the stem into 2 or more line.

28
Q

Histogram

A
  • constructed from frequency table.
  • bars are continuous.
  • x-axis –> intervals.
  • y-axis –> no. of scores in each interval is height of a rectangle.
  • used for continuous variables & discrete variables.
  • shape varies.
  • shows distributional aspects of a continuous variable (symmetry + slowness of the data).
29
Q

Histogram Types(2)

A
  • Equal width Histogram.

* Unequal width Histogram.

30
Q

Histogram Interpretation

A
  • bars are disjoint + exhaustive.
  • frequency = height of histogram bars.
  • frequency density = area under histogram bar.
31
Q

Frequency polygon

A
  • obtained by joining the top midpoint of each bar of the Histogram to each other/join top points of class midpoint + frequency density of each class.
  • doesn’t represent basic data well.
  • area under it is not proportional to frequencies.
32
Q

Frequency curve

A
  • obtained by smoothing out frequency polygon.
  • used to compare distribution of data sets.
  • taken to represent the true distr. of popn from which sample is drawn.
33
Q

Class boundary

A

The exact value for each set of grouped data where one set of values ends & the other begins.