The Extent of the Deep Web in Japanese Institutional Repositories

MIYATA Yosuke, AGATA Teru, IKEUCHI Atsushi, ISHITA Emi, UEDA Shuichi

doi:10.20651/jslis.58.2_97

Bibliographic Information

Other Title

深層ウェブの実態とその要因 : 機関リポジトリに登録された文献を用いた調査
シンソウウェブノジッタイトソノヨウイン : キカンリポジトリニトウロクサレタブンケンオモチイタチョウサ

Search this article

Description

The more the size of Web increases, the more serious the problem of the deep Web (the Web not accessible to search engines) becomes. McCown et al. (2006) and Hagedorn & Santelli (2008) surveyed extent of deep Web using metadata contained in institutional repositories. In this research, applying the method used in that previous work, we measured the extent of the deep Web on a larger scale using PDF file URLs contained in institutional repositories in Japan in September 2009. The results show that the coverage rate of major search engines (Google, Yahoo! and Bing) is 72%, leaving 28% as the maximum extent of the deep Web. And examination of the characteristics of the files revealed that dynamic URLs and longer URLs are associated with decreased coverage rates for search engines.

Journal

Journal of Japan Society of Library and Information Science

Journal of Japan Society of Library and Information Science 58 (2), 97-109, 2012

Japan Society of Library and Information Science

Details 詳細情報について

CRID: 1390001204568588160

NII Article ID: 110009479379

NII Book ID: AA11333306

DOI: 10.20651/jslis.58.2_97

ISSN: 24324027; 13448668

NDL BIB ID: 023789682

Web Site: http://id.ndl.go.jp/bib/023789682; https://ndlsearch.ndl.go.jp/books/R000000004-I023789682

Text Lang: ja

Article Type: journal article

Data Source

JaLC
NDL Search
CiNii Articles
KAKEN

Abstract License Flag: Disallowed

Export

Report a problem