Chrome Extension
WeChat Mini Program
Use on ChatGLM

Learned Probing Cardinality Estimation for High-Dimensional Approximate NN Search

ICDE(2023)

Cited 0|Views19
No score
Abstract
Approximate nearest neighbor (ANN) search in high-dimensional space plays an essential role in a variety of real-world applications. A well-known solution to ANN search, inverted file product quantization (IVFPQ) adopts inverted files to avoid exhaustive examination and compresses vectors using product quantization to reduce the space overhead. However, existing implementations use the same fixed probing cardinality (i.e., the number of cells to probe) setting for all queries, which leads to too many or too few cell examinations, thus increasing the average query latency or reducing the recall. To achieve a better trade-off between latency and accuracy, we enable probing cardinality estimation for high-dimensional ANN search by using deep learning techniques. We develop HBK-means, a hierarchical balanced clustering algorithm that reduces the data distribution imbalance of cells to enable a better estimation. Next, we develop PCE-Net, an encoder-decoder based neural network for estimating query-dependent minimum probing cardinality. In addition, we introduce two query optimization strategies: lower bound sorting based pruning (LBS-Pruning) and early termination (ET), to further reduce query latency. Extensive experiments with real-world data offer evidence that the proposed solution is capable of achieving better performance than IVFPQ and its variants.
More
Translated text
Key words
approximate nearest neighbor search,cell examinations,compresses vectors,data distribution,deep learning techniques,early termination,encoder-decoder based neural network,ET,HBK,hierarchical balanced clustering algorithm,high-dimensional ANN search,high-dimensional approximate NN search,high-dimensional space,inverted file product quantization,IVFPQ,LBS-pruning,lower bound sorting based pruning,PCE-Net,query optimization strategies,query-dependent minimum probing cardinality
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined