29.12.2020 Views

OUR WORLD - Perspectives on Artificial Intelligence

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

PERSPECTIVES ON ARTIFICIAL INTELLIGENCE<br />

disincentives, but is well established EU policy in many of the network industries. Under what c<strong>on</strong>diti<strong>on</strong>s would the benefits of forced<br />

sharing outweigh the costs?<br />

The emergence of ML as a general-purpose technology raises difficult empirical and normative questi<strong>on</strong>s. Does the relati<strong>on</strong>ship between<br />

data accumulati<strong>on</strong> and ec<strong>on</strong>omic returns give data-rich incumbents a significant and self-reinforcing advantage? Are competiti<strong>on</strong> authorities<br />

equipped to discern and analyse data-driven m<strong>on</strong>opolistic returns? These questi<strong>on</strong>s are high <strong>on</strong> the new European Commissi<strong>on</strong>’s agenda,[24]<br />

and for good reas<strong>on</strong>s. M<strong>on</strong>opolistic behaviour by ML providers could slow the adopti<strong>on</strong> of technology critical for EU competitiveness,<br />

especially hitting those smaller firms that lack the knowledge and resources to build alternative capacity in-house. If technological revoluti<strong>on</strong>s<br />

are distributi<strong>on</strong>al earthquakes, competiti<strong>on</strong> authorities should work to ensure that every<strong>on</strong>e lands <strong>on</strong> their feet.<br />

[1] This post focuses <strong>on</strong> ML applicati<strong>on</strong>s.<br />

[2] See https://medium.ec<strong>on</strong>omist.com/will-big-data-create-a-new-untouchable-business-elite-8dc23bcaa7cb<br />

[3] DG COMP 2019, citing https://medium.com/machine-intelligence-report/data-not-algorithms-is-key-to-machine-learning-success-69c6c4b79f33, https://www.<br />

edge.org/resp<strong>on</strong>se-detail/26587, and http://www.spacemachine.net/views/2016/3/datasets-over-algorithms.<br />

[4] While this post focuses <strong>on</strong> issues pertaining to the volume of data, other characteristics of data are just as important for generating value. These include the<br />

other so-called ‘4Vs’ of data: volume, but also velocity (i.e. frequency), variety (e.g. administrative data, social media data, pictures, etc), and veracity (i.e. representative<br />

of the target populati<strong>on</strong>, free of bias, etc). For a firm to have a competitive advantage over these other characteristics can also generate important ec<strong>on</strong>omic benefits.<br />

For the purpose of this blog, I note that securing a sufficient volume of data appears to be necessary but not sufficient to having a competitive AI/ML business.<br />

[5] Varian (2018) and Bajari et al. (2018)<br />

[6] In particular, Bajari et al. (2018) find that the length of histories is robustly helpful in improving the demand forecast quality, but at a diminishing rate; whereas<br />

the number of products in the same category is not (with a few excepti<strong>on</strong>s where it exhibits diminishing returns to scale).<br />

[7] Agrawal, Ajay, Joshua Gans, and Avi Goldfarb. 2018a. Predicti<strong>on</strong> Machines: The Simple Ec<strong>on</strong>omics of <strong>Artificial</strong> <strong>Intelligence</strong>. Cambridge, MA: Harvard Business<br />

Review Press.<br />

[8] As related in Goldfarb et al. (2018)<br />

[9] See the deal’s press release: https://news.microsoft.com/2009/07/29/microsoft-yahoo-change-search-landscape/<br />

[10] See https://www.cnet.com/news/googles-varian-search-scale-is-bogus/<br />

[11] See Hal Varian in a CNET interview: “the scale arguments are pretty bogus in our view” (https://www.cnet.com/news/googles-varian-search-scale-is-bogus/)<br />

[12] “the amount of traffic that Yahoo, say, has now is about what Google had two years ago” and “when we do improvements at Google, everything we do<br />

essentially is tested <strong>on</strong> a 1 percent or 0.5 percent experiment to see whether it’s really offering an improvement. So, if you’re half the size, well, you run a 2 percent<br />

experiment.” Source: ibid<br />

[13] i.e. in performing tasks such as crawling, index, or ranking.<br />

[14] The European Commissi<strong>on</strong>’s DG COMP made similar claims in the c<strong>on</strong>text of the Google Shopping case. DG COMP claimed that general search service has<br />

to receive at least a certain minimum volume of queries in order to improve the relevance of its results for uncomm<strong>on</strong> queries because users evaluate the relevance<br />

of a general search service <strong>on</strong> the basis of both comm<strong>on</strong> and uncomm<strong>on</strong> queries. See para. 288 of the EC decisi<strong>on</strong> (https://ec.europa.eu/competiti<strong>on</strong>/antitrust/cases/<br />

dec_docs/39740/39740_14996_3.pdf)<br />

[15] Cockburn et al. (2019).<br />

[16] An podcast episode from the Ec<strong>on</strong>omist brings this point to life (https://www.ec<strong>on</strong>omist.com/podcasts/2019/10/09/the-promise-and-peril-of-ai)<br />

[17] See Schaefler and al. (2018): “In perhaps no other market has the questi<strong>on</strong> of the role of data stirred such a vivid discussi<strong>on</strong> am<strong>on</strong>g industry participants,<br />

academic experts, and policy advocates than in general internet search.”<br />

[18] Glen Weyl and Eric Posner, 2018. Radical Markets<br />

[19] https://cs.stanford.edu/people/eroberts/cs181/projects/1997-98/microsoft-vs-doj/ec<strong>on</strong>omics/returns.html<br />

[20] See Calvano et al. (2020) for a survey of the literature around these issues in digital markets.<br />

[21] Commissi<strong>on</strong>er Terrell McSweeny, Opening Remarks for a Panel Discussi<strong>on</strong>, “Why Regulate Online Platforms?: Transparency, Fairness, Competiti<strong>on</strong>, or<br />

Innovati<strong>on</strong>?” at the CRA C<strong>on</strong>ference in Brussels, Belgium, at 5 (Dec. 9, 2015), https://www.ftc.gov/system/files/documents/public_statements/903953/mcsweeny_-_<br />

cra_c<strong>on</strong>ference_remarks_9-12-15.pdf.<br />

[22] includes data generated <strong>on</strong>line and by IoT and c<strong>on</strong>nected devices. Source: Word Ec<strong>on</strong>omic Forum citing Rac<strong>on</strong>teur (https://www.weforum.org/agenda/2019/04/<br />

how-much-data-is-generated-each-day-cf4bddf29f/)<br />

[23] Note that a range of issues lie at the intersecti<strong>on</strong> of privacy and competiti<strong>on</strong>, including data ownership, reuse, transparency, sharing. These issues are bey<strong>on</strong>d<br />

the score of this post and will not be explored here.<br />

[24] See the missi<strong>on</strong> statement of European Commissi<strong>on</strong> President Ursula v<strong>on</strong> der Leyen, which instructs Margarethe Vestager: “In the first 100 days of our<br />

mandate, you will coordinate the work <strong>on</strong> a European approach <strong>on</strong> artificial intelligence, including its human and ethical implicati<strong>on</strong>s. This should also look at how<br />

we can use and share n<strong>on</strong>-pers<strong>on</strong>alised big data to develop new technologies and business models that create wealth for our societies and our businesses.” (https://<br />

ec.europa.eu/commissi<strong>on</strong>/sites/beta-political/files/missi<strong>on</strong>-letter-margrethe-vestager_2019_en.pdf)<br />

Anders<strong>on</strong>, J. (2020), ‘The dynamics of data accumulati<strong>on</strong>’, Bruegel Blog, 11 February.<br />

https://bruegel.org/2020/02/the-dynamics-of-data-accumulati<strong>on</strong>/<br />

<str<strong>on</strong>g>OUR</str<strong>on</strong>g> <str<strong>on</strong>g>WORLD</str<strong>on</strong>g> | January 2021<br />

45

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!