A study of Patent Thickets (1.31Mb) - UK Intellectual Property Office
A study of Patent Thickets (1.31Mb) - UK Intellectual Property Office
A study of Patent Thickets (1.31Mb) - UK Intellectual Property Office
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
72 A Study <strong>of</strong> <strong>Patent</strong> <strong>Thickets</strong><br />
Since patent records do not include the registered number <strong>of</strong> a company even if the applicant<br />
is a registered business, it is not possible to merge data sets using a unique firm identifier;<br />
instead, applicant names in the IP documents and firm names in FAME have to be matched.<br />
Both a firm’s current and previous name(s), were used for matching in order to account for<br />
changes in firm names. Matching on the basis <strong>of</strong> company names requires names in both data<br />
sets to be `standardized’ prior to the matching process in order to ensure that small (but <strong>of</strong>ten<br />
systematic) differences in the way names are recorded in the two data sets do not impede the<br />
correct matching. For more details on the matching see Helmers et al. (2011).<br />
7.2 Methodological analysis <strong>of</strong> the triples<br />
measure<br />
This appendix contains results <strong>of</strong> a statistical test <strong>of</strong> the triples measure similar to that provided<br />
by Milo et al. (2002; 2004a) . As discussed in Section 3.2.2 above counts <strong>of</strong> triples have been<br />
shown to be statistically significant measure <strong>of</strong> the structure <strong>of</strong> networks in a number <strong>of</strong> contexts<br />
such as the World Wide Web. To date such an analysis has not been performed in the context<br />
<strong>of</strong> patent data. The aim <strong>of</strong> the analysis is to determine whether triples or any other network motif<br />
are likely to arise with such frequency randomly that their occurrence in the real data we analyze<br />
cannot be interpreted as a signal <strong>of</strong> any kind <strong>of</strong> real structure.<br />
We use the FANMOD s<strong>of</strong>tware developed by Wernicke and Rasche (2006) to count and test the<br />
significance <strong>of</strong> various network motifs. The test <strong>of</strong> the significance <strong>of</strong> these motifs is undertaken<br />
by comparing the frequency <strong>of</strong> occurrence <strong>of</strong> a given motif in the data obtained from the EPO<br />
with the occurrence <strong>of</strong> the motif in 1000 comparison datasets obtained by perturbing the original<br />
data randomly. We set the parameters <strong>of</strong> the s<strong>of</strong>tware such that the randomly created<br />
comparison data preserve the characteristics <strong>of</strong> the original data as closely as possible.<br />
To perform this test we segmented patent applications at the EPO into 34 technology areas<br />
based on the 2008 version <strong>of</strong> the ISI-OST-INPI technology classification (Schmoch, 2009). We<br />
then used data on critical references in three <strong>of</strong> these areas (Telecommunications, Optics and<br />
Medical Technology) for the periods 1997-1999 and 2003-2005 to analyze the significance <strong>of</strong><br />
different network motifs as measures that could characterize the structure <strong>of</strong> the underlying<br />
data. We chose these three technology areas because it is well known that Telecommunications<br />
is affected by a patent thicket, while we expect Optics and Medical Technology to be much less<br />
affected. In the next subsection we show that Optics presents an intermediate case between<br />
Telecommunications and Medical Technology.<br />
Area Years Z-Score<br />
Medical Technology 1997-99 115.6<br />
Optics 1997-99 24.8<br />
Telecommunications 1997-99 9.7<br />
Medical Technology 2003-05 81.6<br />
Optics 2003-05 39.2<br />
Telecommunications 2003-05 9.0<br />
[Table 7.1: Z-Scores for the Triples Measures in Selected Technology Areas]