The Audit Process: Use of Statistics (18) Flashcards
Mean
Average of numerical dataset - sum of values divided by number of items, denoted with μ
Median
Central point in the dataset when arranged in ascending order
Mode
Most frequently occurring number in dataset
Outlier
Data point significantly different from other data points in dataset (might indicate misstatement)
Standard Deviation
Measure of dispersion of dataset showing average distance between values and the mean, denoted with σ. Datasets with a wider spread will have a higher standard deviation.
Benford’s Law
Probability distribution of likelihood of first digit in set of numbers (1 is most common first digit in a dataset, descending to 9 - requires large dataset)
Regression analysis
A statistical method for estimating the relationship among variables based on past relationship.
The aim is to understand the relationship between an independent variable and a dependent variable and is often performed through ADA.
What happens when correlation is 0?
There is no correlation between the two factors.
Dependent variable
The variable we are trying to understand or predict
Independent variable
The variable that may impact dependent variable
Regression line
Statistically determined relationship between dependent and independent variables in historical data that we can compare new data against to identify anomalies.
How can Benford’s Law be applied within the audit process?
It can be used to identify where anomalies (including incidents of fraud) appear in the dataset. Allowing the auditor to identify a misstatement in relation to error or fraud during the audit process.
How can regression analysis be applied within the audit process?
Historical data can be collected from a client’s (financial data and non-financial data) system, allowing the auditor to understand a historical relationship between variables.
The variables will be plotted on a scatter chart and a line (regression line) will be drawn through the middle.
The current year data can be compared with this line and any outliers that appear may indicate misstatements in the dependent variable being audited (Sales being overstated or cost of sales being understated).
Correlation
Correlation is the statistical association or relationship between two variables. Calculated between +1 and -1.
Correlation of 0 indicates no relationship.
Correlation of +1 indicates a strong relationship
Correlation of -1 indicates an inverse relationship.
Correlation vs Causation
Important to remember that correlation does not necessarily imply causation.
Both variables could alter dependent on a shared independent variable i.e. sales of Christmas trees and mulled wine are correlated but there is no causation between them.