Final Review Flashcards
What is Big Data?
Large & complex data sets
4 Vs
Volume
Variety (data sources of unstructured/structured data)
Velocity
Veracity (data quality - clean & credible)
Pros/Cons of Structured Data
Hard to collect
Limited Insights
Affordable
Active participation
Transparent
Pros/Cons of Unstructured Data
Easy to collect
Pricy
Unlimited Insights
Presence
Lack of transparency
What is Analytics?
ETL data to gain valuable insights to inform decision making
Requires critical thinking & judgement
IMPACT
I - Identify questions
M - Master the data
P - Perform the test plan
A - Address & refine results
C - Communicate insights
T - Track Outcomes
Descriptive
What happened?
Diagnostic
WHY did it happen? Root causes?
Prescriptive
What if scenarios
Optimize performance based on constraints
Predictive
What WILL happen (future outlook)? Probability? Forecasting
MASTER THE DATA
Appropriateness - can the data answer the questions
Accessibility - cost of acquisition, sources of data
Reliability - data integrity (accurate, valid, consistent)
Financial Accounting Data Sources
- XBRL.gov
-SEC EDGAR
-Company websites/press releases
-Fee based databases: Dow Jones, CRSP
-Internal data - journal entries, general ledger, subledgers
Audit Data Sources
- PCAOB (audit regulators)
- Auditor Search
-Audit Analytics (audit report, fees, restatement data) - Firm transparency reports - insight into audit culture
Managerial Accounting Data Sources
- Budget Variance
- Point of Sale Transaction
- Potential cost drivers
- Supply chain
- CRM, HRM, ERM
Other Relevant Data Sources
- Government Data (GDP, CPI, Census)
- Sustainability Reports
- Current & Historical Stock Prices
- Earnings Forecast
Alternative Data
- Social media
- Cell phone location
- Geospatial
- Employee Sentiments (Glassdoor)
- Foot traffic
What is Blockchain?
A decentralized digital ledger that records transactions
(Visibility for all parties on all transactions occurring on the same chain that is solidified by a hash - unable to go back to alter data)
Benefits of Blockchain
- Verified transactions
- Almost impossible to manipulate data
Limitations of Blockchain (Benefits of Relational Databases)
-Centralization of data
- Limitation of access to particular data tables
- Embedded checks through linking of tables with PK & FK
Delimiter
Smith | David or Smith, David
(Intentional separation of values for table column headings)
Qualifier
“Property, Plant and Equipment”
(Double quotes indicate keeping the text together)
Categorical Data
Data divided by grouping (composed of nominal and ordinal data)
Nominal Data
Gender, eye color, dates, account #
Ordinal data
Ranking (gold, silver, bronze)
Numerical data
Used for calculations (composed of interval & ratio)
Interval Data
No “absolute zero” –> temperature (0 degrees does not mean there is no temperature anymore)
Ratio Data
Defined zero value (sales, net income)
Skewed right
tail is to the right which is driven by outliers (mean > median)
Skewed left
tail is to the left (mean < median)
Correlation
measure relationship between 2 variables that ranges from -1 to 1
p-value > 0.05
fail to reject null hypothesis - not statistically significant
p-value < 0.05
reject null hypothesis - statistically significant
R^2
fit of data (increased R^2 = good fit)
Regression Analysis
Diagnostic Test (measure relationship) or Predictive Test (estimation of dependent variable value based on independent variable inputs)
Linear Regression Formula
y = B1X1 + y-intercept + error
Descriptive Analytics Tests
- Sums, min/max, standard deviation, counts, difference in means
- horizontal & vertical analysis
- ratio analytics
- Dupont analysis
- IQR
Diagnostic (Drill down or Root Cause) Tests
- Hypothesis Testing (Regression)
- Clustering
-Grouping/Filtering in PivotTables - Benford’s Law
- Fuzzy Matching
- Identify outliers/anomalies through IQR
Prescriptive Tests
-What-if scenarios
-Sensitivity analysis
Predictive Tests
-Regression Analysis
Where in the audit process could you use ADA?
-Risk assessment phase –> fraud risk/material misstatements
- Testing phase –> tests of controls, substantive tests
Common Types of Analytics in Management Accounting
Descriptive: KPI, Clustering suppliers, customers, locations
Diagnostic: Comparing KPIs, Budget Variance, Regression (cost behavior)
Prescriptive: sensitivity, capital budgeting, goal-seek
Predictive: forecasting
Cost Behavior (Regression Analysis)
Management’s attempt to understand how operating costs change in relation to a change in organizational activity
Variable (independent) + Fixed (y-intercept)
Capital Budgeting
process of analyzing/deciding which long-term investment to make using NPV or IRR
NPV
used to determine current value of all future cash flows generated by a project + initial capital investment
IRR
time-adjusted rate of return for an investment
(NPV = 0 or IRR > r –> making $ or break even)
In a balanced scorecard, what should be aligned with strategic goals of the organization?
Objectives
Common Types of Analytics in Financial Accounting
Descriptive: horizontal, vertical & ratio analysis
Diagnostic: benchmarking/comparative analysis, DuPont
Prescriptive: sensitivity analysis
Predictive: bankruptcy predictions
XBRL
tagging & reporting financial information in a computer readable format
XBRL Taxonomy
defines/describes each key element & relationship between each element
Strengths of XBRL
tagging allows data to be quickly transmitted & can extend taxonomy to include custom tags
Weaknesses of XBRL
Concerns regarding data quality (accuracy, consistency, reliability?)
All of the following are ways XBRL tags can be useful except for:
helping auditors assess where accounting errors may occur
helping financial analysts value a company
helping regulators determine if audit firms are in compliance with audit standards (correct)
helping regulators see if companies are in compliance with regulations
Liquidity
measures short-term ability of company to pay its maturing obligations & meet unexpected needs of $
Current ratio
short-term debt paying ability
Cash debt coverage ratio
short-term debt paying ability on a cash basis
Receivables Turnover Ratio
of times a company collects receivables a year
Average Collection period
converts RTR into days
Inventory Turnover ratio
of times a inventory was sold a year
Days in Inventory
Avg # of days inventory is held
Solvency
measures the ability of a company to survive for a long duration
Debt to Assets Total Ratio
% of total assets provided by creditors
Times Interest Earned Ratio
company’s ability to meet interest payments
Cash Debt Coverage Ratio
long-term debt paying ability on a cash basis
Free cash flow
cash available to pay dividends or expand operations
Profitability
measure income or operating success of a company
Return on Stockholder’s Equity
$ of net income earned/$ invested
Return on assets
overall profitability of assets
Profit Margin ratio
net income generated by each $ of sales
Asset turnover
how efficiently assets are used to generate sales
Gross Profit Rate
margin between selling price & COGS
Earnings per Share
net income earned on each dollar of common stock
P-E Ratio
increase means investors believe company future earnings will grow
Payout Ratio
% of earnings distributed in cash dividends
Basic DuPont Model
ROE = Profit Margin x Total Asset Turnover x Financial Leverage (higher value –> more risk as reliant on debt for funding)
Increased ROE
increase in PM, TAT or FL
Decreased ROE
decrease in PM, TAT, or FL
Bankruptcy Classification (Altman’s Z-score)
Z = 1.2X1 + 1.4X2 + 3.3X3 + 0.6X4 + 1.0X5
Variable Meanings
X1 = liquidity level
X2 = long-term profitability
X3 = short term profitability
X4 = solvency
X5 = asset efficiency (TAT)
If Z < 1.80
Significant risk of bankruptcy - “distress zone”
If 1.8 <= Z >= 3.0
At risk of bankruptcy - “gray zone”
If Z > 3.0
Not at risk - “safe zone”
A data point with which of the following z-scores would likely alert an auditor to the existence of a potential outlier that warrants further scrutiny?
3.5
RPA
-Form of digital labor
-Software robots to automate repetitive tasks (scanning, reading docs, downloading/merging files, converting currency)
-Less flexible
-Does not make decisions
-Simple & quick implementations to deploy
AI
-Tasks that require human intelligence
-Capable of tasks that require cognitive abilities, recognizing patterns, and making predictions
-Adapt and learn from new data
-Complex/resource intensive
-Make decisions based on data analysis/learned patterns