An Exploratory Investigation of IPA Texts with LDA: Using Alfred Tennyson's Poems

Bibliographic Information

Other Title
  • LDAトピックモデルによるIPAテキスト分析の試み : アルフレッド・テニスンの韻文を用いて
  • LDA トピック モデル ニヨル IPA テキスト ブンセキ ノ ココロミ アルフレッド・テニスン ノ インブン ヲ モチイテ

Description

This paper aims to investigate whether Latent Dirichlet Allocation(LDA) topic modelling can also detect the rhymes in poetry. LDA is a stylometric method that detects hidden semantic structures in texts and is now used primarily to examine prose texts. Among a relatively small number of studies on poems using LDA, Navarro-Colorado(2018) suggested that LDA can also detect rhymes in Spanish-written poetry corpus. However, speaking of the Einglish language, the spellings and the pronounciations/sounds do not always work correspond. Additionally, numerous rhymes can be found structured by the units of syllables or phonemes. Thus, applying LDA on raw English texts may not be the optimised method for detectig rhymes in texts. Considering the dissidence between spelling and sound of Einglish, this study analyses two texts-types, written in International Phonetic Alphabets(IPA) and a text of which all words in IPA are devided into vowels and consonants. Through the analysis and discussion of the emerging resurlts of LDA, this paper provides the possibility of employing LDA on IPA texts as well as issue of analysing IPA and more small-unit texts.

Journal

Details 詳細情報について

  • CRID
    1390574036160477312
  • DOI
    10.18910/88358
  • HANDLE
    11094/88358
  • Text Lang
    ja
  • Article Type
    departmental bulletin paper
  • Data Source
    • JaLC
    • IRDB

Report a problem

Back to top