An Exploratory Investigation of IPA Texts with LDA: Using Alfred Tennyson's Poems
Bibliographic Information
- Other Title
-
- LDAトピックモデルによるIPAテキスト分析の試み : アルフレッド・テニスンの韻文を用いて
- LDA トピック モデル ニヨル IPA テキスト ブンセキ ノ ココロミ アルフレッド・テニスン ノ インブン ヲ モチイテ
Description
This paper aims to investigate whether Latent Dirichlet Allocation(LDA) topic modelling can also detect the rhymes in poetry. LDA is a stylometric method that detects hidden semantic structures in texts and is now used primarily to examine prose texts. Among a relatively small number of studies on poems using LDA, Navarro-Colorado(2018) suggested that LDA can also detect rhymes in Spanish-written poetry corpus. However, speaking of the Einglish language, the spellings and the pronounciations/sounds do not always work correspond. Additionally, numerous rhymes can be found structured by the units of syllables or phonemes. Thus, applying LDA on raw English texts may not be the optimised method for detectig rhymes in texts. Considering the dissidence between spelling and sound of Einglish, this study analyses two texts-types, written in International Phonetic Alphabets(IPA) and a text of which all words in IPA are devided into vowels and consonants. Through the analysis and discussion of the emerging resurlts of LDA, this paper provides the possibility of employing LDA on IPA texts as well as issue of analysing IPA and more small-unit texts.
Journal
-
- 言語文化共同研究プロジェクト
-
言語文化共同研究プロジェクト 2021 15-38, 2022-03-31
Graduate School of Language and Culture, Osaka University
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390574036160477312
-
- DOI
- 10.18910/88358
-
- HANDLE
- 11094/88358
-
- Text Lang
- ja
-
- Article Type
- departmental bulletin paper
-
- Data Source
-
- JaLC
- IRDB