Organization and presentation of data Flashcards
Forms of arranging data
Arrays
Frequency tables
Stem and leaf plot
Tabulations: Simple tables and cross tabulation
Arrays
Simple arrays
Frequency arrays
Simple arrays
It is an arrangement of data in an ascending or descending order
Frequency arrays
It is a series formed on the basis of frequency with which each item is repeated in a series.
Steps in constructing a frequency array
- Prepare a table with three columns
- Put the item in the first column in an ascending order such that one item is recorded only
once - Prepare the tally sheet in the second column marking one bar for an item. Make blocks of
five tally bars to avoid mistakes in counting. Every fifth bar is shown by crossing the first
four bars like ////. - Count the tally bar and record the total number in the third column.
Main limitation: it does not give the idea of the characteristics of groups. For example, it does not tell us how many students have obtained marks between 45 -50. It’s not possible to
compare characteristics of different groups
Frequency distribution tables
- Lists categories of items along with their corresponding frequencies.
- The frequency for a particular category or class is the number of original items that fall into that class.
- The classes or categories refer to the groupings of a frequency table.
- The range is the difference between the highest value and the lowest value.
(R = highest value – lowest value). - The class width or class interval is the difference between two consecutive lower class limits or
class boundaries. - The class limits are the smallest or the largest numbers that can be included in the class: Lower class limits represents the smallest data value that can be included in the class. Upper class limits represents the highest data value that can be included in the class.
- The class boundaries are used to separate the classes so that there are no gaps in the frequency distribution. They are obtained by increasing the upper class limits and decreasing the lower
class limits by the same amount so that there are no gaps between consecutive under classes. The
amount to be added or subtracted is 0.5 the difference between the upper limit of one class and the lower limit of the following class. - The class limits should have the same decimal place value as the data,
but the class boundaries should have one additional place value. - Class marks/mid value/mid points is the average value of two limits of a point.
Guidelines for constructing a frequency distribution table
- The classes must be mutually exclusive. That is, each score must belong to exactly one class.
- The classes must be exhaustive meaning there should be enough classes to accommodate
all data. Include all classes, even if the frequency might be zero. - All classes should have the same width, to avoid a distorted view of data. One exception
occurs when a distribution has a class that is open – ended such as “65 years or older”. - The number of classes should be between 5 and 20.
Process of Constructing a Frequency Table
- Determine the range. (R = Highest Value – Lowest Value).
- Determine the tentative number of classes (k). The number of classes is the least value of k such that 2^k>N. This involves some trying for different values of k e.g. if N=45 we can try 2^2=32 , this is less than 45, 2^6=64 which satisfies the condition 2^k>N therefore k=6.
- Find the class width by dividing the range by the number of classes.(Always roundoff).
- Write the classes or categories starting with the lowest score. Stop when the class already includes the highest score.
- Determine the frequency for each class by referring to the tally columns and present the results in a table.
Limitation of frequency distribution table
, while the frequency of each class is easy to see, the original data points have been lost.
Stem-and-Leaf Plots
It shows us potential patterns in the responses that may not be apparent in the original listing of the data
Steps of constructing a stem and leaf plot
- Find the least number and the greatest number in the data set.
- Draw a vertical line and write the digits in the tens places from 1 to 3 on the left of the
line. The tens digit form the stems. - Write the units digit to the right of the line. The units digits form the leaves.
- Rewrite the units digits in each row from the least to the greatest.
- Include an explanation. To analyze a stem and leaf plot look for peaks, gaps and symmetry.
Tabulation:
Simple (one way)
Cross tabulation
A table is a systematic arrangement of statistical data in columns and rows.
Rows are horizontal arrangements while columns are vertical arrangements.
The purpose of a table is to simplify the presentation and to facilitate comparison.
Essential parts of a table
Table number Title of the table Caption (Headings of columns) Stub (Headings of rows) Body Head note (unit of measurement of data) Foot note
Simple (One-Way) Tables
only one characteristic is shown.
Cross Tabulation
More than one characteristic is included in a table. They are a good way to compare two
subgroups of information.
Cross tabs allow you to compare data from two questions to determine
if there is a relationship between them.
Cross tabs are used most frequently to look at answers to a question among various demographic
groups. The intersections of the various columns and rows, commonly called cells, are the
percentages of people who answered each of the responses.