Identification of RNA Virus–Derived RdRp Sequences in Publicly Available Transcriptomic Data Sets
-
- Ingrida Olendraite
- Division of Virology, Department of Pathology, Addenbrookes Hospital, University of Cambridge , Cambridge , United Kingdom
-
- Katherine Brown
- Division of Virology, Department of Pathology, Addenbrookes Hospital, University of Cambridge , Cambridge , United Kingdom
-
- Andrew E Firth
- Division of Virology, Department of Pathology, Addenbrookes Hospital, University of Cambridge , Cambridge , United Kingdom
-
- Thomas Leitner
- editor
書誌事項
- 公開日
- 2023-04-01
- 権利情報
-
- https://creativecommons.org/licenses/by/4.0/
- DOI
-
- 10.1093/molbev/msad060
- 公開者
- Oxford University Press (OUP)
この論文をさがす
説明
<jats:title>Abstract</jats:title><jats:p>RNA viruses are abundant and highly diverse and infect all or most eukaryotic organisms. However, only a tiny fraction of the number and diversity of RNA virus species have been catalogued. To cost-effectively expand the diversity of known RNA virus sequences, we mined publicly available transcriptomic data sets. We developed 77 family-level Hidden Markov Model profiles for the viral RNA-dependent RNA polymerase (RdRp)—the only universal “hallmark” gene of RNA viruses. By using these to search the National Center for Biotechnology Information Transcriptome Shotgun Assembly database, we identified 5,867 contigs encoding RNA virus RdRps or fragments thereof and analyzed their diversity, taxonomic classification, phylogeny, and host associations. Our study expands the known diversity of RNA viruses, and the 77 curated RdRp Profile Hidden Markov Models provide a useful resource for the virus discovery community.</jats:p>
収録刊行物
-
- Molecular Biology and Evolution
-
Molecular Biology and Evolution 40 (4), 1-, 2023-04-01
Oxford University Press (OUP)