03.04.2017 Views

CoSIT 2017

Fourth International Conference on Computer Science and Information Technology ( CoSIT 2017 ), Geneva, Switzerland - March 2017

Fourth International Conference on Computer Science and Information Technology ( CoSIT 2017 ), Geneva, Switzerland - March 2017

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Computer Science & Information Technology (CS & IT) 159<br />

Table 8. Regular expressions contained in the “Items Algorithm”<br />

Notes: The table presents the regular expressions contained in the modified “Items Algorithm” for<br />

extracting particular items from the annual report on Form 10-K. RegExes 1.1-6.1 modify the text version<br />

of a financial statement to be able to extract (clear) textual information from particular items. RegExes 7.1-<br />

7.21 represent the actual regular expressions designed to extract particular sections from the text version of<br />

the annual report.<br />

Figure 2. Examples of the extraction result of the “Items Algorithm”<br />

Notes: The figure presents extraction results from Coca Cola´s 2015 annual report on Form 10-K filed with<br />

the SEC. The first part of the figure displays Item 1A (Risk Factors) embedded in the overall 10-K section.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!