Extraction of Similar Words Based on Time-correlation and Co-occurrence Probability from Tweets of the Same Topic

Hisano Yuichiro, Sawase Kazuhito, Nobuhara Hajime

doi:10.14864/fss.28.0_394

Bibliographic Information

Other Title

同一ハッシュタグツイート群における時空間相関情報に基づく単語類似度の計量
ドウイツハッシュタグツイートグンニオケルジクウカンソウカンジョウホウニモトズクタンゴルイジドノケイリョウ

Search this article

Abstract

In order to reduce various onomastic expressions for efficient tweet topic retrieval/clustering, a construction method of twitter dictionaries based on tweets extraction and their time-correlation is proposed. In the proposed method, similarities between keywords are calculated by the time-correlation of each word and co-occurrence probability. Furthermore, the proposed method divides the target time line to reduce the computational cost of twitter dictionaries construction. Through experiments with 101,714 tweets with the hashtags related to ``NHK kohaku-utagassen'', the effectiveness of the proposed division method compared with the method calculated using entire time line region is confirmed.

Journal

Proceedings of the Fuzzy System Symposium

Proceedings of the Fuzzy System Symposium 28 (0), 394-397, 2012

Japan Society for Fuzzy Theory and Intelligent Informatics

Keywords

Details 詳細情報について

CRID: 1390282680650369152

NII Article ID: 130005456159

NII Book ID: AA12165648

ISSN: 18820212

DOI: 10.14864/fss.28.0_394

NDL BIB ID: 023989447

Web Site: http://id.ndl.go.jp/bib/023989447; https://ndlsearch.ndl.go.jp/books/R000000004-I023989447

Data Source

JaLC
NDL
CiNii Articles

Abstract License Flag: Disallowed

Export