【24h】

XQzip: Querying Compressed XML Using Structural Indexing

机译:XQzip:使用结构索引查询压缩的XML

获取原文
获取原文并翻译 | 示例

摘要

XML makes data flexible in representation and easily portable on the Web but it also substantially inflates data size as a consequence of using tags to describe data. Although many effective XML compressors, such as XMill, have been recently proposed to solve this data inflation problem, they do not address the problem of running queries on compressed XML data. More recently, some compressors have been proposed to query compressed XML data. However, the compression ratio of these compressors is usually worse than that of XMill and that of the generic compressor gzip, while their query performance and the expressive power of the query language they support are inadequate. In this paper, we propose XQzip, an XML compressor which supports querying compressed XML data by imposing an indexing structure, which we call Structure Index Tree (SIT), on XML data. XQzip addresses both the compression and query performance problems of existing XML compressors. We evaluate XQzip's performance extensively on a wide spectrum of benchmark XML data sources. On average, XQzip is able to achieve a compression ratio 16.7% better and a querying time 12.84 times less than another known queriable XML compressor. In addition, XQzip supports a wide scope of XPath queries such as multiple, deeply nested predicates and aggregation.
机译:XML使数据的表示形式灵活并且可以轻松地在Web上移植,但是由于使用标签描述数据,因此也大大增加了数据大小。尽管最近提出了许多有效的XML压缩程序(例如XMill)来解决此数据膨胀问题,但它们并未解决在压缩的XML数据上运行查询的问题。最近,已经提出了一些压缩器来查询压缩的XML数据。但是,这些压缩器的压缩率通常比XMill和通用压缩器gzip的压缩率差,而它们的查询性能和所支持的查询语言的表达能力却不足。在本文中,我们提出了XQzip,它是一种XML压缩器,它通过在XML数据上施加索引结构(我们称为结构索引树(SIT))来支持查询压缩的XML数据。 XQzip解决了现有XML压缩器的压缩和查询性能问题。我们在各种基准XML数据源上广泛评估XQzip的性能。与其他已知的可查询XML压缩器相比,平均而言,XQzip的压缩率提高了16.7%,查询时间缩短了12.84倍。另外,XQzip支持广泛的XPath查询,例如多个深度嵌套的谓词和聚合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号