Improving conditional random field model for prediction of protein‐RNA residue‐base contacts

DOI PDF Web Site Web Site 参考文献38件 オープンアクセス
  • Morihiro Hayashida
    <!--1--> Department of Electrical Engineering and Computer Science National Institute of Technology Matsue College Shimane 690‐8518 Japan
  • Noriyuki Okada
    <!--1--> Department of Electrical Engineering and Computer Science National Institute of Technology Matsue College Shimane 690‐8518 Japan
  • Mayumi Kamada
    <!--2--> Graduate School of Medicine Kyoto University Kyoto 606‐8507 Japan
  • Hitoshi Koyano
    <!--3--> Riken Quantitative Biology Center Hyogo 650‐0047 Japan

説明

<jats:sec><jats:title>Background</jats:title><jats:p>For understanding biological cellular systems, it is important to analyze interactions between protein residues and RNA bases. A method based on conditional random fields (CRFs) was developed for predicting contacts between residues and bases, which receives multiple sequence alignments for given protein and RNA sequences, respectively, and learns the model with many parameters involved in relationships between neighboring residue‐base pairs by maximizing the pseudo likelihood function.</jats:p></jats:sec><jats:sec><jats:title>Methods</jats:title><jats:p>In this paper, we proposed a novel CRF‐based model with more complicated dependency relationships between random variables than the previous model, but which takes less parameters for the sake of avoidance of overfitting to training data.</jats:p></jats:sec><jats:sec><jats:title>Results</jats:title><jats:p>We performed cross‐validation experiments for evaluating the proposed model, and took the average of AUC (area under receiver operating characteristic curve) scores. The result suggests that the proposed CRF‐based model without using <jats:italic>L</jats:italic><jats:sub>1</jats:sub>‐norm regularization (lasso) outperforms the existing model with and without the lasso under several input observations to CRFs.</jats:p></jats:sec><jats:sec><jats:title>Conclusions</jats:title><jats:p>We proposed a novel stochastic model for predicting protein‐RNA residue‐base contacts, and improved the prediction accuracy in terms of the AUC score. It implies that more dependency relationships in a CRF could be controlled by less parameters.</jats:p></jats:sec>

収録刊行物

参考文献 (38)*注記

もっと見る

関連プロジェクト

もっと見る

問題の指摘

ページトップへ