Compressing and maintaining statistics information about resource occurrences in a distributed RDF store
Compressing and maintaining statistics information about resource occurrences in a distributed RDF store
In distributed RDF stores triples are assigned to one or several storage and compute nodes. In order to perform query planning and optimization, statistical information about the occurrences of IRIs and literals on the individual storage and compute nodes is needed. In this paper, we present our novel compressed storage format for statistical information that can be updated with a single read and write operation if resources occur on few storage and compute nodes only. In our experiments this novel storage format reduced the time to collect statistical information by up to 97% and the required space by up to 99%.
Janke, Daniel
4a5d4f28-8add-435f-a223-e2a19d423012
Staab, Steffen
bf48d51b-bd11-4d58-8e1c-4e6e03b30c49
October 2018
Janke, Daniel
4a5d4f28-8add-435f-a223-e2a19d423012
Staab, Steffen
bf48d51b-bd11-4d58-8e1c-4e6e03b30c49
Janke, Daniel and Staab, Steffen
(2018)
Compressing and maintaining statistics information about resource occurrences in a distributed RDF store.
CEUR Workshop Proceedings, 2180.
Abstract
In distributed RDF stores triples are assigned to one or several storage and compute nodes. In order to perform query planning and optimization, statistical information about the occurrences of IRIs and literals on the individual storage and compute nodes is needed. In this paper, we present our novel compressed storage format for statistical information that can be updated with a single read and write operation if resources occur on few storage and compute nodes only. In our experiments this novel storage format reduced the time to collect statistical information by up to 97% and the required space by up to 99%.
This record has no associated files available for download.
More information
Published date: October 2018
Identifiers
Local EPrints ID: 427318
URI: http://eprints.soton.ac.uk/id/eprint/427318
ISSN: 1613-0073
PURE UUID: 8845cec9-600c-4642-bde8-f6494eee7a53
Catalogue record
Date deposited: 11 Jan 2019 17:30
Last modified: 06 Jun 2024 01:54
Export record
Contributors
Author:
Daniel Janke
Author:
Steffen Staab
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics