SHI7 Is a Self-Learning Pipeline for Multipurpose Short-Read DNA Quality Control

  • Gabriel A. Al-Ghalith
    Bioinformatics and Computational Biology, University of Minnesota—Twin Cities, Minneapolis, Minnesota, USA
  • Benjamin Hillmann
    Computer Science, University of Minnesota—Twin Cities, Minneapolis, Minnesota, USA
  • Kaiwei Ang
    Computer Science, University of Minnesota—Twin Cities, Minneapolis, Minnesota, USA
  • Robin Shields-Cutler
    Biotechnology Institute, University of Minnesota—Twin Cities, Minneapolis, Minnesota, USA
  • Dan Knights
    Bioinformatics and Computational Biology, University of Minnesota—Twin Cities, Minneapolis, Minnesota, USA

説明

<jats:p>Quality control of high-throughput DNA sequencing data is an important but sometimes laborious task requiring background knowledge of the sequencing protocol used (such as adaptor type, sequencing technology, insert size/stitchability, paired-endedness, etc.). Quality control protocols typically require applying this background knowledge to selecting and executing numerous quality control steps with the appropriate parameters, which is especially difficult when working with public data or data from collaborators who use different protocols. We have created a streamlined quality control pipeline intended to substantially simplify the process of DNA quality control from raw machine output files to actionable sequence data. In contrast to other methods, our proposed pipeline is easy to install and use and attempts to learn the necessary parameters from the data automatically with a single command.</jats:p>

収録刊行物

  • mSystems

    mSystems 3 (3), 2018-06-26

    American Society for Microbiology

被引用文献 (2)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ