RefSeq Data & Scripts for ORF dominance

メタデータ

公開日
2022-01-01
DOI
  • 10.6084/m9.figshare.7269500.v1
  • 10.6084/m9.figshare.7269500
公開者
figshare
データ作成者 (e-Rad)
  • NAGAI, Momoko
  • Makino, Takashi
  • Kogashi, Hiroyuki
  • Nakatani, Kazuma
  • Kobatake, Miho
  • Suenaga, Yusuke
  • Kato, Mamoru
  • Yokoi, Sana

説明

This is the fileset for the article "Protein-coding potential of RNAs measured by open reading frame dominance" by Y.Suenaga, et al.<br><br>This fileset consists of the datasets of human (Feb 2015 and April 2018) and 8 spicies given in the article.<br><br>[Dataset]<br>- RefSeq gzipped fasta files (data/RefSeq)<br>- Scripts to generate open reading frame (ORF) dominance score and other information from RefSeq data. (script)<br><br>[How to Run Scripts]<br>- 1. Unpack a tar.xz file.<br>- 2. Run script/01_MergeFa.sh.<br>- 3. Run script/02_ORFdominance.sh.<br>- 4. Run script/03_Format.sh<br>Under data/03_Format, you can get NM.txt and NR.txt as the result.<br><br>[Note]<br>- Scripts require Linux, bash, and perl.<br>- Scripts use randomized data. Consequently, results differ slightly for the same input data.<br>- Scripts are the same files in "Scripts for ORF dominance" (DOI: 10.6084/m9.figshare.7269518).<br><br><br>

関連論文

もっと見る

詳細情報 詳細情報について

ページトップへ