Quantitative Methods - Sources of Data Flashcards

Question

What is categorical or nominal data?

Answer 1

Data classified into a number of distinct categories. Collection of this data is seen on census forms and market research questionnaires, where a box is ticked in response to questions such as 'which of the following newspapers do you read (followed by a list of popular dailies), do you drive a car, did you vote in the last election etc When processing this data on a computer, we may assign a number to each band. However this number does not convey any other information and cannot be used to calculate such statistics as the standard deviation. Such data can only be used as simple statistic, such as 30% of people drive cars

Answer 2

Data classified into a number of distinct ranked categories. The star system for hotels is an example or the classification of university degrees When assigning numbers to this type of data for processing, care should be taken in trying to draw statistical conclusions. Standard deviation would be inappropriate and only such measures which are based on the position within the order, such as the median (also covered later), should be considered.

Answer 3

It groups data into bands of specific value and displays the frequency of occurrence of arch band. Tabulating into a frequency distribution represents a very powerful way of presenting and summarising data, though care needs to be taken in the selection of the size of the bands.

Answer 4

It displays the same data as a percentage of the sample or population size, rather than as actual observed frequencies A relative frequency distribution would, possibly, be more appropriate where a more direct comparison between ban dings is desired or where the sample size has been exceptionally large and the scale of the numbers may obscure their understanding Relative frequency distribution is useful for determining the relative historical frequency of occurrence that may, in turn, be useful for determining the probable future distribution. In this context it may be referred to as a probability distribution. In probability occurrence pathetic sum of all the probabilities must add up to to 1 or 100%. Probability and relative frequency are synonymous, for example the probability of a fair coin landing on heads when tossed is 0.5 or 50%, there are two sides and it will land on them with equal frequency.

Answer 5

Shows the number/percentage of a sample or population with a value less than or equal to a given figure. It can be used in addition to either frequency distribution or relative frequency distribution.

Answer 6

A popular non-random sampling technique where a sample is selected which is believed to be representative of the full population. Help for this may be obtained from census info which enables us to get a picture of the proportion of the population exhibiting a range of characteristics. A sample can then be selected which displays these characteristics and, hence, should be reasonably representative. This is the typical approach used in market research

Answer 7

Selection is very important It is perhaps an even more acute problem when we are considering continuous data where we must be very careful to ensure that all items are included within a band, but only within one band. Data must be described as thus. Greater than or equal to 20 but less than 24 etc

Answer 8

The detail in numbers may obscure the understanding slightly. It is possible that the same level of info can be conveyed more easily by charts and tables

Answer 9

Method of representing relative frequency by dividing a circle into sections whose area is proportionate to the relative frequency. It is of most use when communication categorical data. 1% represents 3.6 degrees on a pie chart

Answer 10

A chart which represents through the height of the bar, he number or percentage of items displaying a particular characteristic.

Answer 11

Convey more info than a bar chart where each column is broken down to show e.g the number of stores in the north who sold between 20-29, 30-39 etc whilst still showing the total number of stores in the north

Answer 12

It displays the number or percentage of items falling within a given band through the area of a bar. Usually a histogram is used to describe circumstances where one bar is used to represent a range of values for continuous data. Where discrete data is grouped, it may be represented as a histogram as if it were continuous.

Answer 13

When extreme bands are described as greater or less than something. Bands must be bounded, but what width should they be made. There are no rules for this and require judgement by the researcher. If a definite upper and lower limit is known then, these will provide obvious bounds. If they are unknown, a bound must be assumed since this histogram cannot be drawn unless all areas are bound.

Answer 14

How tall they need to be made. If any bands are wider than other, their heights will have to be scaled down proportionally to ensure that the area of that band still reflects the number of items. This problem is most likely to arise in the context of extreme bands however it may also be applicable to other bands of differing width

Answer 15

The x-axis is the horizontal one and the y-axis is the vertical one

Answer 16

The variable thought to be responsible for causing the change

Answer 17

Independent variable

Answer 18

The variable whose value is driven by the x value and whose change we are seeking to predict

Answer 19

The dependent variable

Answer 20

The independent variable is plotted along the x-axis whilst the dependent variable is plotted on the y-axis

Answer 21

A frequent requirement is plot how something has changed with time. Here the item alters with time, meaning the item is the dependent variable and time is the independent one. Time can never be altered and therefor is always the independent variable and always on the x-axis

Answer 22

They do not lend themselves very well to extrapolation, or predicting forward, meaning this may not be the preferred presentation and a semi-logarithmic graph may be more useful

Answer 23

The value should be representative of the data being graphed

Answer 24

It plots the log of the value instead of the value itself on the y-axis

Answer 25

If something is growing at a constant rate it will appear as a straight line which is much more useful for prediction purposes. Any move away from steady growth would be shown. If growth increased the line would get steeper whilst flatter is growth decreased

Quantitative Methods - Sources of Data Flashcards

(49 cards)