NanoGalaxy: Nanopore long-read sequencing data analysis in Galaxy

  • Willem de Koning
    Department of Pathology, Clinical Bioinformatics Unit, Erasmus University Medical Centre, Wytemaweg 80, 3015 CN, Rotterdam, the Netherlands
  • Milad Miladi
    Department of Computer Science, Bioinformatics Group, University of Freiburg, 79110 Freiburg im Breisgau, Germany
  • Saskia Hiltemann
    Department of Pathology, Clinical Bioinformatics Unit, Erasmus University Medical Centre, Wytemaweg 80, 3015 CN, Rotterdam, the Netherlands
  • Astrid Heikema
    Department of Medical Microbiology and Infectious Diseases, Erasmus University Medical Centre, ’s Gravendijkwal 230, 3015 CE, Rotterdam, the Netherlands
  • John P Hays
    Department of Medical Microbiology and Infectious Diseases, Erasmus University Medical Centre, ’s Gravendijkwal 230, 3015 CE, Rotterdam, the Netherlands
  • Stephan Flemming
    Department of Computer Science, Bioinformatics Group, University of Freiburg, 79110 Freiburg im Breisgau, Germany
  • Marius van den Beek
    Department of Stem Cells and Tissue Homeostasis, Institut Curie, PSL Research University, 75005 Paris, France
  • Dana A Mustafa
    Department of Pathology, Tumor Immuno-Pathology Laboratory, Erasmus University Medical Centre, ’s Gravendijkwal 230, 3015 CE, Rotterdam, the Netherlands
  • Rolf Backofen
    Department of Computer Science, Bioinformatics Group, University of Freiburg, 79110 Freiburg im Breisgau, Germany
  • Björn Grüning
    Department of Computer Science, Bioinformatics Group, University of Freiburg, 79110 Freiburg im Breisgau, Germany
  • Andrew P Stubbs
    Department of Pathology, Clinical Bioinformatics Unit, Erasmus University Medical Centre, Wytemaweg 80, 3015 CN, Rotterdam, the Netherlands

Description

<jats:title>Abstract</jats:title> <jats:sec> <jats:title>Background</jats:title> <jats:p>Long-read sequencing can be applied to generate very long contigs and even completely assembled genomes at relatively low cost and with minimal sample preparation. As a result, long-read sequencing platforms are becoming more popular. In this respect, the Oxford Nanopore Technologies–based long-read sequencing “nanopore" platform is becoming a widely used tool with a broad range of applications and end-users. However, the need to explore and manipulate the complex data generated by long-read sequencing platforms necessitates accompanying specialized bioinformatics platforms and tools to process the long-read data correctly. Importantly, such tools should additionally help democratize bioinformatics analysis by enabling easy access and ease-of-use solutions for researchers.</jats:p> </jats:sec> <jats:sec> <jats:title>Results</jats:title> <jats:p>The Galaxy platform provides a user-friendly interface to computational command line–based tools, handles the software dependencies, and provides refined workflows. The users do not have to possess programming experience or extended computer skills. The interface enables researchers to perform powerful bioinformatics analysis, including the assembly and analysis of short- or long-read sequence data. The newly developed “NanoGalaxy" is a Galaxy-based toolkit for analysing long-read sequencing data, which is suitable for diverse applications, including de novo genome assembly from genomic, metagenomic, and plasmid sequence reads.</jats:p> </jats:sec> <jats:sec> <jats:title>Conclusions</jats:title> <jats:p>A range of best-practice tools and workflows for long-read sequence genome assembly has been integrated into a NanoGalaxy platform to facilitate easy access and use of bioinformatics tools for researchers. NanoGalaxy is freely available at the European Galaxy server https://nanopore.usegalaxy.eu with supporting self-learning training material available at https://training.galaxyproject.org.</jats:p> </jats:sec>

Journal

  • GigaScience

    GigaScience 9 (10), 2020-10

    Oxford University Press (OUP)

Citations (1)*help

See more

Report a problem

Back to top