Scientific Concept of the National Cohort (status ... - Nationale Kohorte
Scientific Concept of the National Cohort (status ... - Nationale Kohorte
Scientific Concept of the National Cohort (status ... - Nationale Kohorte
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
A.4<br />
A.4 Integrated Data Management<br />
cost <strong>of</strong> setting up and maintaining a greater number <strong>of</strong> specialized units each with <strong>the</strong>ir own<br />
infrastructure. Currently planned competence centers are:<br />
� Imaging: Handling <strong>of</strong> all MRI, DXA, ultrasound, and echocardiography data. This includes<br />
providing teleradiology services to different places where experts on specific<br />
parts <strong>of</strong> <strong>the</strong> images are located; running separate competence centers for this would<br />
not be cost efficient.<br />
� Physical activity and ECG: Both <strong>of</strong> <strong>the</strong>se instruments require application <strong>of</strong> algorithms<br />
on <strong>the</strong> collected data to derive characteristic values. For this reason, <strong>the</strong>y could be<br />
handled toge<strong>the</strong>r in one competence center with a common infrastructure and supported<br />
by <strong>the</strong> same IT personnel. While <strong>the</strong> algorithms are relatively well-established<br />
for ECG, methods for deriving measures <strong>of</strong> physical activity from accelerometry data<br />
are in a more experimental state and will probably be improved continuously and reapplied<br />
multiple times while <strong>the</strong> study is running.<br />
� Secondary data: The selection <strong>of</strong> external sources and data elements, <strong>the</strong> collection<br />
<strong>of</strong> data from those sources into temporary storage, and <strong>the</strong> mapping <strong>of</strong> such data<br />
into <strong>the</strong> common data model <strong>of</strong> <strong>the</strong> study database requires not only a considerable<br />
amount <strong>of</strong> manual work but also specialized hardware and s<strong>of</strong>tware. Possible procedures<br />
for <strong>the</strong> use <strong>of</strong> health insurance data and social insurance data are described in<br />
Sect. A.7.2.3 <strong>of</strong> this document.<br />
� Biobanking: A biobank is similar to a competence center as it uses special infrastructure<br />
and processing, even though not (only) for data but for biosamples.<br />
In addition to <strong>the</strong>se proposed competence centers, a number <strong>of</strong> competence panels will be<br />
set up for <strong>the</strong> quality management <strong>of</strong> data collection instruments not covered by competence<br />
centers and also in <strong>the</strong> field <strong>of</strong> disease-specific follow-up.<br />
A.4.2.4 Trust center<br />
The trust center generally takes on all tasks in which personal data <strong>of</strong> <strong>the</strong> study subjects<br />
need to be handled at <strong>the</strong> national level:<br />
� Generation <strong>of</strong> unique pseudonyms for (potential) study subjects: Although it is possible<br />
to generate pseudonyms at <strong>the</strong> study centers and make <strong>the</strong>m unique, for example,<br />
by adding a study center ID, central generation <strong>of</strong> pseudonyms is considered as <strong>the</strong><br />
preferred alternative because it <strong>of</strong>fers <strong>the</strong> additional possibility to recognize and avoid<br />
repeated entry <strong>of</strong> a single person via different study centers with different pseudonyms<br />
(so-called synonyms); such synonyms become more likely when sampling is<br />
done in multiple tranches. For <strong>the</strong> best effect, potential participants will be checked for<br />
double entry immediately after sampling and before contacting <strong>the</strong>m.<br />
� Central back-up storage <strong>of</strong> study participant data: Personal identifying data for <strong>the</strong><br />
study subjects are kept at <strong>the</strong> trust center as a back-up for cases <strong>of</strong> data loss in <strong>the</strong><br />
study centers. Especially in later phases <strong>of</strong> <strong>the</strong> study, <strong>the</strong>se data may be useful for<br />
reidentifying participants (e.g., in case <strong>of</strong> incidental findings).<br />
� Data linkage with external sources at <strong>the</strong> national level: External sources <strong>of</strong> secondary<br />
data that operate at a national level must also be queried centrally (ra<strong>the</strong>r than by<br />
each <strong>of</strong> <strong>the</strong> study centers individually). Such queries must contain personal identifying<br />
information about <strong>the</strong> study subjects, which must not be stored centrally toge<strong>the</strong>r with<br />
medical study data. Therefore, queries for data from external sources are prepared<br />
by <strong>the</strong> competence center for secondary data and supplemented afterwards with personal<br />
identifying information about <strong>the</strong> respective subjects by <strong>the</strong> trust center (which,<br />
however, never sees <strong>the</strong> replies to those queries). See Sect. A.7.2.3 for details.<br />
150