The University of Southampton
University of Southampton Institutional Repository

Compressing and maintaining statistics information about resource occurrences in a distributed RDF store

Compressing and maintaining statistics information about resource occurrences in a distributed RDF store
Compressing and maintaining statistics information about resource occurrences in a distributed RDF store

In distributed RDF stores triples are assigned to one or several storage and compute nodes. In order to perform query planning and optimization, statistical information about the occurrences of IRIs and literals on the individual storage and compute nodes is needed. In this paper, we present our novel compressed storage format for statistical information that can be updated with a single read and write operation if resources occur on few storage and compute nodes only. In our experiments this novel storage format reduced the time to collect statistical information by up to 97% and the required space by up to 99%.

1613-0073
Janke, Daniel
4a5d4f28-8add-435f-a223-e2a19d423012
Staab, Steffen
bf48d51b-bd11-4d58-8e1c-4e6e03b30c49
Janke, Daniel
4a5d4f28-8add-435f-a223-e2a19d423012
Staab, Steffen
bf48d51b-bd11-4d58-8e1c-4e6e03b30c49

Janke, Daniel and Staab, Steffen (2018) Compressing and maintaining statistics information about resource occurrences in a distributed RDF store. CEUR Workshop Proceedings, 2180.

Record type: Article

Abstract

In distributed RDF stores triples are assigned to one or several storage and compute nodes. In order to perform query planning and optimization, statistical information about the occurrences of IRIs and literals on the individual storage and compute nodes is needed. In this paper, we present our novel compressed storage format for statistical information that can be updated with a single read and write operation if resources occur on few storage and compute nodes only. In our experiments this novel storage format reduced the time to collect statistical information by up to 97% and the required space by up to 99%.

This record has no associated files available for download.

More information

Published date: October 2018

Identifiers

Local EPrints ID: 427318
URI: http://eprints.soton.ac.uk/id/eprint/427318
ISSN: 1613-0073
PURE UUID: 8845cec9-600c-4642-bde8-f6494eee7a53
ORCID for Steffen Staab: ORCID iD orcid.org/0000-0002-0780-4154

Catalogue record

Date deposited: 11 Jan 2019 17:30
Last modified: 06 Jun 2024 01:54

Export record

Contributors

Author: Daniel Janke
Author: Steffen Staab ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×