PRINSEQ++, a multi-threaded tool for fast and efficient quality control and preprocessing of sequencing datasets
-
- Vito Adrian Cantu
- Computational Science Research Center, San Diego State University, San Diego, California, United States
-
- Jeffrey Sadural
- Department of Computer Science, San Diego State University, San Diego, California, United States
-
- Robert Edwards
- Department of Computer Science, San Diego State University, San Diego, California, United States
抄録
<jats:p>PRINSEQ++ is a C++ implementation of the very popular software prinseq-lite for quality control and preprocessing of sequencing datasets. PRINSEQ++ can run multi-threaded processes, which makes it more than 10 times faster than the original version. It can read from, and write to, compressed files, drastically reducing the use of hard-drive. PRINSEQ++ can filter, trim and reformat sequences by a variety of options to improve downstream analysis. PRINSEQ++ is freely available on GitHub (https://github.com/Adrian-Cantu/PRINSEQ-plus-plus) and runs on all Unix-like systems.</jats:p>