Statistical properties of probabilistic context-sensitive grammars

書誌事項

公開日
2024-08-26
資源種別
journal article
権利情報
  • https://creativecommons.org/licenses/by/4.0/
DOI
  • 10.1103/physrevresearch.6.033216
公開者
American Physical Society (APS)

説明

<jats:p>Probabilistic context-free grammars (PCFGs), which are commonly used to generate trees randomly, have been well analyzed theoretically, leading to applications in various domains. Despite their utility, the distributions that the grammar can express are limited to those in which the distribution of a subtree depends only on its root and not on its context. This limitation presents a challenge for modeling various real-world phenomena, such as natural languages. To overcome this limitation, a probabilistic context-sensitive grammar (PCSG) is introduced, where the distribution of a subtree depends on its context. Numerical analysis of a PCSG reveals that the distribution of a symbol does not constitute a qualitative difference from that in the context-free case, but mutual information does. Furthermore, a novel metric introduced to directly quantify the breaking of this limitation detects a distinct difference between PCFGs and PCSGs. This metric, applicable to an arbitrary distribution of a tree, allows for further investigation and characterization of various tree structures that PCFGs cannot express.</jats:p> <jats:sec> <jats:title/> <jats:supplementary-material> <jats:permissions> <jats:copyright-statement>Published by the American Physical Society</jats:copyright-statement> <jats:copyright-year>2024</jats:copyright-year> </jats:permissions> </jats:supplementary-material> </jats:sec>

収録刊行物

参考文献 (31)*注記

もっと見る

関連プロジェクト

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ