reactIDR: Evaluation of the statistical reproducibility of high-throughput structural analyses for a robust RNA reactivity classification

Description

<jats:title>Abstract</jats:title><jats:sec><jats:title>Motivation</jats:title><jats:p>Recently, next-generation sequencing techniques have been applied for the detection of RNA secondary structures called high-throughput RNA structural (HTS) analy- sis, and dozens of different protocols were used to detect comprehensive RNA structures at single-nucleotide resolution. However, the existing computational analyses heavily depend on experimental data generation methodology, which results in many difficulties associated with statistically sound comparisons or combining the results obtained using different HTS methods.</jats:p></jats:sec><jats:sec><jats:title>Results</jats:title><jats:p>Here, we introduced a statistical framework, reactIDR, which is applicable to the experimental data obtained using multiple HTS methodologies, and it classifies the nucleotides into three structural categories, <jats:italic>stem, loop</jats:italic>, and <jats:italic>unmapped</jats:italic>. reactIDR uses the irreproducible discovery rate (IDR) with a hidden Markov model (HMM) to discriminate accurately between the true and spurious signals obtained in the replicated HTS experiments. In reactIDR, IDR and HMM parameters are efficiently optimized by using an expectation-maximization algorithm. Furthermore, if known reference structures are given, a supervised learning can be applicable in a semi-supervised manner. The results of our analyses for real HTS data showed that reactIDR achieved the highest accuracy in the classification problem of stem/loop structures of rRNA using both individual and integrated HTS datasets as well as the best correspondence with the three-dimensional structure. Because reactIDR is the first method to compare HTS datasets obtained from multiple sources in a single unified model, it has a great potential to increase the accuracy of RNA secondary structure prediction at transcriptome-wide level with further experiments performed.</jats:p></jats:sec><jats:sec><jats:title>Availability</jats:title><jats:p>reactIDR is implemented in Python. Source code is publicly available at <jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/carushi/reactIDRhttps://github.com/carushi/reactIDR">https://github.com/carushi/reactIDRhttps://github.com/carushi/reactIDR</jats:ext-link>.</jats:p></jats:sec><jats:sec><jats:title>Contact</jats:title><jats:p><jats:email>kawaguchi-rs@aist.go.jp</jats:email></jats:p></jats:sec><jats:sec sec-type="supplementary-material"><jats:title>Supplementary information</jats:title><jats:p>Supplementary data are available at online.</jats:p></jats:sec>

References(33)*help

See more

Related Articles

See more

Details 詳細情報について

Report a problem

Back to top