Robust and Fast Cardinality Estimator by Non-Autoregressive Model

伊藤, 竜一, 佐々木, 勇和, 肖, 川, 鬼塚, 真, Ryuichi, Ito, Yuya, Sasaki, Chuan, Xiao, Makoto, Onizuka

データベースシステムにおいて，カーディナリティ推定はクエリ応答性能に大きな影響力を持つ重要な要素技術である．既存のデータベースシステムでは属性間の依存関係を考慮しないため，カーディナリティ推定の性能を悪化させる原因となっている．近年，機械学習による属性間の依存関係を考慮したカーディナリティ推定技術が提案されているが，学習時の属性順序に依存して推定精度が大きく左右され推論速度も遅いという問題がある．本論文では，属性間の依存関係を非自己回帰モデルで学習し，推論時に与えられたクエリに応じたカーディナリティ推定を行う手法を提案する．既存技術と異なり属性順序に依存しないため，少ない推論ステップ数で安定したカーディナリティ推定が可能である．複数のベンチマークにおいて，既存手法のピーク性能と同程度の性能を安定して達成したうえで2～3倍の高速化に成功した．また，クエリオプティマイザへの応用を想定した実験を行ったところ，想定されるコストの大きいクエリを中心に既存のデータベースシステムと比較して改善が確認された．

In database systems, cardinality estimation is a fundamental technology that significantly impacts query performance. Recently, machine-learning techniques are employed for cardinality estimation, which learn dependencies among attributes. However, they have a problem that the estimation accuracy is unstable and the inference speed is slow. In this paper, we propose a robust and fast cardinality estimation method that learns dependencies among attributes by a non-autoregressive model and performs estimation in fewer steps and proper order according to a given query at the inference phase. On multiple benchmarks, our method achieves performance comparable to or better than the peak performance of existing methods, while achieving a 2-3x speedup. Experiments conducted for application to a query optimizer showed improvement compared to existing database systems, mainly for queries with high expected costs.

Robust and Fast Cardinality Estimator by Non-Autoregressive Model

Bibliographic Information

Search this article

Description

Journal

Keywords

Details 詳細情報について

Export

Report a problem