27.08.2015 Views

AUDIT ANALYTICS AUDIT

x8YaD9

x8YaD9

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

ESSAY 5: DATA <strong>ANALYTICS</strong> FOR FINANCIAL STATEMENT <strong>AUDIT</strong>S<br />

Figure 5-9: Distribution of (Current Ratio, ROA) Distances from the<br />

Median Point<br />

Distribution of (Current Ratio, ROA) Distances from Median<br />

Where multivariate ratio analysis is used, the DA software would<br />

compute distance of the entity’s multivariate ratio point from the median<br />

point and the auditor would compare that distance with the distribution<br />

of distances (for example, figure 5-9) to determine the extent to which the<br />

entity is an outlier. Distance as measured in this multivariate example<br />

accords with our everyday notion of distance—the length of the straight<br />

line joining two points. Data science also recognizes other distance<br />

metrics as well (Starkweather 2013; Tan, et al. 2006). Because ensembles<br />

of multiple approaches have been shown to be particularly effective in<br />

identifying true outliers (Zimek et al. 2014), DA applications seeking true<br />

outliers in a multivariate ratio analysis might compute distance from the<br />

median in several different ways and then calculate a composite outlier<br />

score. For example, if four different measures are calculated, they can be<br />

normalized to lie between 0 and 1 so that the composite outlier score lies<br />

between 0 and 4, with 0 indicating that the point coincides exactly with<br />

the median and 4 indicating that the point is a maximal outlier. Scores<br />

might also be weighted in the construction of the composite.<br />

The literature on multivariate ratio analysis in auditing is thin to<br />

non-existent (Google hits: zero) and the discussion in the preceding<br />

paragraph is speculative. The distance measures of data science are<br />

principally used in applications such as text analysis, and the extent to<br />

127

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!