Lectures 3- Tables and Charts Flashcards
What is an experiment?
is any activity aimed at collecting data.
Why do we present data?
we can communicate the salient
features of the data to others with little or no
difficulty. Example-Let us assume for the moment that we collected data on total
household income from a random sample of 1000 households drawn from St. Lucia, St. Vincent, St. Kitts and Trinidad & Tobago. The 1000 values of household income so collected constitute a dataset. It is physically impossible to look at a dataset of size 1000 and
successfully identify the salient features of the dataset. Summarising the data is necessary for analysis of a dataset.
Basic forms of a summary of a dataset
Summary Table and Frequency Table
Other forms of summary
The Cumulative Frequency Table
–The Relative Frequency Table
–The Percentage Frequency Table
Summary Table
provide users of the data with a summary
*bring together a ‘mass’ of connected information for digestion ‘at a glance’
*help both the researcher and the user to draw preliminary conclusions from the data
*can become cluttered very quickly and hence, confusing; be wary of truncated axes and changing the scales of either axis.
Frequency Table
are comprised of two columns
*the left column comprises a listing of the values assumed by the variable in the case of quantitative data or a listing of the categories/attributes in the case of qualitative data
*the listing of values in the quantitative case is presented either as grouped data or ungrouped data
*the right column displays the corresponding frequencies
*all the guidelines for table construction would apply
Constructing the grouped frequency table
Divide the range of values into a finite number of equal sub-intervals otherwise called classes
*Classes must be defined so that no observation from the survey data could fall into more than one class
*Too many classes will give the table a cluttered appearance. Suggest that you use between 6 and 15 classes
*Class Limits – lower limit and upper limit of the class
*Class Mark – midpoint of the class
*Class Width – difference between two successive class marks
*Class Frequency – number of observations that fall into the class
Frequency table grouped data
Raw data for the recorded ages at diagnosis of 80 patients with stage 1 carcinoma of the cervix.
84 20 31 43 24 76 67 55 46 59 48 52 74 23 35 65 36 63 51 49 52 48 53 47 38 68 37 67 45 55 48 58 47 57 72 23 65 35 61 31 53 43 54 44 55 45 56 46 33 63 34 64 51 41 52 42 53 43 54 44 55 45 32 62 33 63 34 64 27 77 56 46 57 47 58 48 59 49 34 74
1.Scan the data for highest and lowest observations
2.Find the difference between the highest and lowest score to determine range of values which must be divided into class intervals.
Class limits
Class limits, boundaries, width, midpoint
Age range (Years)
20 – 29
30 – 39
40 – 49
50 – 59
60 – 69
70 – 79
80 - 89
Consider the second interval 30 – 39 30 and 39 are referred to as the class limits. 30 is the lower limit of the second class interval. 39 is the upper limit of that interval.
Class boundaries
Suppose an individual is 29 years and 10 months of age. In which interval would you record their age?
20 – 29
30 – 39
40 – 49
50 – 59
60 – 69
70 – 79
80 - 89
30 – 39 Now, suppose an individual is 29 years and 2 months of age. In which interval would you record their age?
20 - 29
This implies that the 30 – 39 interval records ages greater than and equal to 29.5 and up to ages less than 39.5
Let us again consider the second interval 30 – 39 The difference between the boundaries give the class width or class size Class size/width: UCB – LCB = 39.5 – 29.5 = 10
UCB = upper class boundary
LCB = lower class boundary
The values 29.5 and 39.5 are the class boundaries of the second interval.
Midpoint
What is the midpoint of the second interval 30 – 39?
We can determine the mid point by
30, 31, 32, 33, 34, 35, 36, 37, 38, 39
So midpoint = (34 + 35)/2 = 34.5
Even easier, the midpoint is equal to the sum of the limits divided by 2
Midpoint = (30+39)/2 = 34.5
15
Cumulative Frequency Table
A derivative of a Frequency Table
*Applicable to quantitative data only
*We can accumulate the values in the left column in any of four (4) ways
–less than
–less than or equal to
–greater than
–greater than or equal to
A corresponding accumulation of the frequencies in the right column must be done
*The result is a table comprised of two columns; the cumulative frequencies are displayed in the right column.
*Care should be taken to adjust the column headings
*Rules for table construction apply
Relative Frequency Table
derivative of a Frequency Table
*Applicable for both qualitative and quantitative data
*Relative Frequency is defined as Frequency divided by Total Frequency
*Convert all Frequencies in the right column into Relative Frequencies
*The result is a table comprised of two columns; the relative frequencies are displayed in the right column.
*Care should be taken to adjust the column headings
Charts and Graphs
Once we have constructed a Frequency Table,
we may seek to create a pictorial representation
of it.
Types of Charts and Graphs
The Pie chart
*Histogram
*The Frequency Graph/Polygon
*The Ogive
*The Stem and Leaf Display
*The Box Plot
*The Scattergraph