Compaction techniques for nextword indexes

SPIRE(2001)

引用 23|浏览14
暂无评分
摘要
Most queries to text search engines are ranked or Boolean. Phrase querying is a powerful technique for rening searches, but is expensive to implement on conventional indexes. In other work, a nextword index has been proposed as a structure specically designed for phrase queries. Nextword indexes are, however, relatively large. In this paper we introduce new compaction techniques for nextword indexes. In contrast to most index compression schemes, these techniques are lossy, yet as we show allow full resolution of phrase queries without false match checking. We show experimentally that our novel techniques lead to signican t savings in index size.
更多
查看译文
关键词
concrete,information retrieval,search engine,compaction,search engines,computer science,information technology,indexation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要