-
- Swarup Acharya
- Information Sciences Research Center, Bell Laboratories, Lucent Technologies, 600, Mountain Avenue, Murray Hill, NJ
-
- Viswanath Poosala
- Information Sciences Research Center, Bell Laboratories, Lucent Technologies, 600, Mountain Avenue, Murray Hill, NJ
-
- Sridhar Ramaswamy
- Epiphany Inc., 2300 Geng Road, Suite 200, Palo Alto, CA and Information Sciences Research Center, Bell Laboratories, Lucent Technologies, 600, Mountain Avenue, Murray Hill, NJ
説明
<jats:p> Selectivity estimation of queries is an important and well-studied problem in relational database systems. In this paper, we examine selectivity estimation in the context of Geographic Information Systems, which manage spatial data such as points, lines, poly-lines and polygons. In particular, we focus on point and range queries over two-dimensional rectangular data. We propose several techniques based on using spatial indices, histograms, binary space partitionings (BSPs), and the novel notion of spatial skew. Our techniques carefully partition the input rectangles into subsets and approximate each partition accurately. We present a detailed experimental study comparing the proposed techniques and the best known sampling and parametric techniques. We evaluate them using synthetic as well as real-life TIGER datasets. Based on our experiments, we identify a BSP based partitioning that we call <jats:italic>Min-Skew</jats:italic> which consistently provides the most accurate selectivity estimates for spatial queries. The Min-Skew partitioning can be constructed efficiently, occupies very little space, and provides accurate selectivity estimates over a broad range of spatial queries. </jats:p>
収録刊行物
-
- ACM SIGMOD Record
-
ACM SIGMOD Record 28 (2), 13-24, 1999-06
Association for Computing Machinery (ACM)