Best practices for multimodal clinical data management and integration: An atopic dermatitis research case

  • Ohta Tazro
    Medical Data Mathematical Reasoning Team, Advanced Data Science Project, RIKEN Information R&D and Strategy Headquarters, RIKEN Institute for Advanced Academic Research, Chiba University Department of Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University
  • Hananoe Ayaka
    Medical Data Mathematical Reasoning Team, Advanced Data Science Project, RIKEN Information R&D and Strategy Headquarters, RIKEN Laboratory for Developmental Genetics, RIKEN Center for Integrative Medical Sciences, RIKEN Department of Dermatology, Keio University School of Medicine
  • Fukushima-Nomura Ayano
    Department of Dermatology, Keio University School of Medicine
  • Ashizaki Koichi
    Laboratory for Developmental Genetics, RIKEN Center for Integrative Medical Sciences, RIKEN Department of Dermatology, Keio University School of Medicine Advanced Data Science Project, RIKEN Information R&D and Strategy Headquarters, RIKEN
  • Sekita Aiko
    Laboratory for Developmental Genetics, RIKEN Center for Integrative Medical Sciences, RIKEN
  • Seita Jun
    Laboratory for Integrative Genomics, RIKEN Center for Integrative Medical Sciences, RIKEN Medical Data Deep Learning Team, Advanced Data Science Project, RIKEN Information R&D and Strategy Headquarters, RIKEN Medical Data Sharing Unit, Infrastructure Research and Development Division, RIKEN Information R&D and Strategy Headquarters, RIKEN
  • Kawakami Eiryo
    Medical Data Mathematical Reasoning Team, Advanced Data Science Project, RIKEN Information R&D and Strategy Headquarters, RIKEN Institute for Advanced Academic Research, Chiba University Department of Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University
  • Sakurada Kazuhiro
    Advanced Data Science Project, RIKEN Information R&D and Strategy Headquarters, RIKEN Department of Extended Intelligence for Medicine, The Ishii-Ishibashi Laboratory, Keio University School of Medicine
  • Amagai Masayuki
    Department of Dermatology, Keio University School of Medicine Laboratory for Skin Homeostasis, RIKEN Center for Integrative Medical Sciences, RIKEN
  • Koseki Haruhiko
    Laboratory for Developmental Genetics, RIKEN Center for Integrative Medical Sciences, RIKEN
  • Kawasaki Hiroshi
    Laboratory for Developmental Genetics, RIKEN Center for Integrative Medical Sciences, RIKEN Department of Dermatology, Keio University School of Medicine Laboratory for Skin Homeostasis, RIKEN Center for Integrative Medical Sciences, RIKEN

抄録

<p>Background: In clinical research on multifactorial diseases such as atopic dermatitis, data-driven medical research has become more widely used as means to clarify diverse pathological conditions and to realize precision medicine. However, modern clinical data, characterized as large-scale, multimodal, and multi-center, causes difficulties in data integration and management, which limits productivity in clinical data science.</p><p>Methods: We designed a generic data management flow to collect, cleanse, and integrate data to handle different types of data generated at multiple institutions by 10 types of clinical studies. We developed MeDIA (Medical Data Integration Assistant), a software to browse the data in an integrated manner and extract subsets for analysis.</p><p>Results: MeDIA integrates and visualizes data and information on research participants obtained from multiple studies. It then provides a sophisticated interface that supports data management and helps data scientists retrieve the data sets they need. Furthermore, the system promotes the use of unified terms such as identifiers or sampling dates to reduce the cost of pre-processing by data analysts. We also propose best practices in clinical data management flow, which we learned from the development and implementation of MeDIA.</p><p>Conclusions: The MeDIA system solves the problem of multimodal clinical data integration, from complex text data such as medical records to big data such as omics data from a large number of patients. The system and the proposed best practices can be applied not only to allergic diseases but also to other diseases to promote data-driven medical research.</p>

収録刊行物

参考文献 (23)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ