Query optimization of distributed pattern matching

ICDE(2014)

引用 42|浏览80
暂无评分
摘要
Greedy algorithms for subgraph pattern matching operations are often sufficient when the graph data set can be held in memory on a single machine. However, as graph data sets increasingly expand and require external storage and partitioning across a cluster of machines, more sophisticated query optimization techniques become critical to avoid explosions in query latency. In this paper, we introduce several query optimization techniques for distributed graph pattern matching. These techniques include (1) a System-R style dynamic programming-based optimization algorithm that considers both linear and bushy plans, (2) a cycle detection-based algorithm that leverages cycles to reduce intermediate result set sizes, and (3) a computation reusing technique that eliminates redundant query execution and data transfer over the network. Experimental results show that these algorithms can lead to an order of magnitude improvement in query performance.
更多
查看译文
关键词
query latency,external storage,cycle detection based algorithm,pattern matching,greedy algorithms,subgraph pattern matching operations,query optimization techniques,dynamic programming,distributed graph pattern matching,distributed processing,query processing,data models,distributed databases,optimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要