Descriptive Analytics: Data Management Flashcards
It refers to a scientific body of knowledge that deals with: collection of data; organization and presentation of data and analysis and presentation of data
Statistics
It is statistical procedure concerned with describing the characteristics and properties of a group of persons, places or thongs; it is based on easily verifiable facts; organize presentation, description, and interpretation of data gathered. It includes the study of relationships among variables.
Descriptive Statistics
It encompasses the set of techniques that describes what has happened in the past. It is a statistical method that is used to search and summarize historical data in order to identify patterns or meaning.
Descriptive Analytics
These are the facts and figures collected, analyzed, and summarized for presentation and interpretation
Data
It is a characteristic or a quantity of interest that can take on different values, also known as?
Variable
What are the 2 Categories of Data?
- Categorical Data
- Numerical or Continuous Data
It is where arithmetic operation cannot be performed are nominal and ordinal scales; use non parametric statistics.
Categorical Data
It consists of a finite sets of possible values having no particular order.
Ex. gender, mode of transportation, nationality, occupation, civil status.
Nominal Scales
It is a set of possible values having specific order:
Ex. pain level, social status, attitude toward a subject.
Ordinal Scales
It is numeric and arithmetic operation can be perform; ratio and interval scales; use parametric statistics.
Numerical or Continuous Data
These are measured on continuum and differences between any two numbers of known size: temperature, tons of garbage, number of arrest, income and age.
Interval Scales
These are numerical in nature and meaningful arithmetic can be done; age, weekly allowance, income of parents.
Quantitative Data
It assumes exact value only and can be obtained by counting
Ex. number of students
Discrete Data
It assumes infinite values within a specified interval and can be obtained by measurement
Ex. height
Continuous Data
These are attributes which cannot be subjected to meaningful arithmetic.
Ex. gender
Qualitative Data
It is where the data can be categorized in several ways based on how they are collected and the type collected.
Population and Sample Data
It is not feasible to collect data from the population of all elements of interest. In such instances, we collect data from a subset of the population known as?
Sample / Sample Data
These data are collected from several entities at the same, or approximately the same, point in time.
Cross-Sectional Data
These are the data collected over several time periods. Graphs of data are frequently found in business and economic publications. Such graphs help analysts understand what happened in the past, identify trends over time, and project future levels for the time series.
Time Series Data
It is where a variable of interest is first identified. Then one or more other variables are identified and controlled or manipulated so that data can be obtained about how they influence the variable of interest.
Experimental Study
The studies make no attempt to control the variables of interest. A survey is perhaps the most common type of observational study.
Non-experimental or Observational
This is where the data gathered shall be presented; analyzed and interpreted that can be easily understood by the reader.
Presentation of Data
It is presented in paragraph or in sentences are said to be in textual form.
Textual Data
It uses statements with numerals in order to describe the data for the concrete information and in expository form. It is to discuss the data and the information and interpretation it carries.
Textual Presentation
This is a table which shows data arrange into different classes, and the number of cases which fall into each class.
The Frequency Distribution Table
It uses statistical table to directly display the quantities or values collected as data.
Tabular Presentation
It adds life and beauty to one’s work, but more than this, it helps facilitate comparisons and interpretation without giving through the numerical data. These are devices that help minimize the “thinking through” process as one analyzes statistical or quantitative data.
Graphs / Graphical
It illustrates data in a form of a graph, aiding readers to understand the text easily, A graph is the most attractive, effective and convincing way. There are various types of graphs we can prepare like bar graph, line graph and pictograph.
Graphical Presentation
It represents by either vertical or rectangular rectangles whose bases represent the class intervals, and whose height represents the frequencies, It is used for discrete variables.
Bar Chart
It is a circle graph showing the proportion of each class, through the relative or percentage frequencies. Legends are used to provide a clearer distinction of categories and types (of business firms for example). Graphs complement the tabular presentation of data.
Pie Chart
It is a type of chart used to show information that changes over time. These are created by plotting a series of several points and connecting them with a straight line, and used to track changes over short and long periods.
Line Chart
It is where a grouping of the data into categories showing the number of observations in each of the non-overlapping classes.
Frequency Distribution
It refers to the data gathered where it should be properly organized in to grouped data called frequency distribution.
Grouped Data
It can be used to provide estimates of the relative likelihoods of different values of a random variable.
Percent Frequency Distribution
It is a tabular summary of data showing the relative frequency for each bin.
Relative Frequency Distribution
It refers to a graph in which the classes are marked on the horizontal axis (x axis ) and the class frequencies on the vertical axis (y axis). It focusses on the frequency for each class and sacrifices whatever information is contained in the actual observation.
Histogram
It is a graph that displays the data using points which are connected by lines. The frequencies are represented by the heights of the points at the midpoint of the classes.
Frequency Polygon
It shows the number of data items with values less than or equal to the upper class limit of each class.
Cumulative Frequency Distribution
It is a graph that displays the cumulative frequencies or the classes in a frequency distribution.
Cumulative Frequency Polygon