ブログにおける記述者本人の氏名判別手法の提案

森, 加夢偉, Kamui, Mori

近年，個人情報保護法の施行や，個人情報の電子化が急速に進むに伴い，Ｗｅｂでは個人情報の保護に対する意識が高まっている．一方，個人のインターネットリテラシの欠如が一部でみられ，特にブログでは無意識な自分のブログにおいて自身の個人情報の漏えいが起きている．こうした個人情報の不用意な露出を防止するためには，個人情報の露出状況を本人に自覚させることが，一つの解決策である．しかし，自覚させるためには、個人情報の露出状況を何らかの方法でフィードバックする必要があり，そのため客観的かつ自動的な評価が必須となる．自動化するに当たっては，自由な形式で書かれているブログ記事に出現する個人情報を抽出し本人のものかどうか判別する必要がある．本稿では，個人情報の中でも特に重要な氏名に着目し，本人のものか他人のものかを判別するモデルを提案する．また，実際のブログ記事データを用いパラメータを決定し評価値のしきい値を明らかにするとともにモデルを検証する．

In recent years, individual information has been rapidly digitized and people have had great concern about personal information protection with enactment of the Personal Information Protection act in Japan. On the other hand, incidents of personal information exposure occur frequently. It is important to recognize the situation of information exposure to prevent the danger by inform the situation to the person. It is necessary to objective evaluation of the situation and automatic evaluation. Technology information is to distinguish self personal information on the blog article is important to extract personal information. In this paper, we focus on "Name" which is most important information as personal information and a model to extract self name from blog articles is proposed. In addition, we estimate the parameter and threshold value are and evaluate the model using actual blog articles.

ブログにおける記述者本人の氏名判別手法の提案

書誌事項

説明

収録刊行物

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

ブログにおける記述者本人の氏名判別手法の提案

書誌事項

説明

収録刊行物

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について