[Updated on Apr. 18] Integration of CiNii Articles into CiNii Research

The Extent of the Deep Web in Japanese Institutional Repositories


Bibliographic Information

Other Title
  • 深層ウェブの実態とその要因 : 機関リポジトリに登録された文献を用いた調査
  • シンソウ ウェブ ノ ジッタイ ト ソノ ヨウイン : キカン リポジトリ ニ トウロク サレタ ブンケン オ モチイタ チョウサ

Search this article


The more the size of Web increases, the more serious the problem of the deep Web (the Web not accessible to search engines) becomes. McCown et al. (2006) and Hagedorn & Santelli (2008) surveyed extent of deep Web using metadata contained in institutional repositories. In this research, applying the method used in that previous work, we measured the extent of the deep Web on a larger scale using PDF file URLs contained in institutional repositories in Japan in September 2009. The results show that the coverage rate of major search engines (Google, Yahoo! and Bing) is 72%, leaving 28% as the maximum extent of the deep Web. And examination of the characteristics of the files revealed that dynamic URLs and longer URLs are associated with decreased coverage rates for search engines.


Citations (0)*help

See more


See more

Related Articles

See more

Related Data

See more

Related Books

See more

Related Dissertations

See more

Related Projects

See more

Related Products

See more


Report a problem

Back to top