The University of Southampton
University of Southampton Institutional Repository

Decentralized search over personal online datastores: architecture and performance evaluation

Decentralized search over personal online datastores: architecture and performance evaluation
Decentralized search over personal online datastores: architecture and performance evaluation
Data privacy and sovereignty are open challenges in today’s Web, which the Solid 4 ecosystem aims to meet by providing personal online datastores (pods) where individuals can control access to their data. Solid allows developers to deploy applications with access to data stored in pods, subject to users’ permission. For the decentralised Web to succeed, the problem of search over pods with varying access permissions must be solved. The ESPRESSO framework takes the first step in exploring such a search architecture, enabling large-scale keyword search across Solid pods with varying access rights. This paper provides a comprehensive experimental evaluation of the performance and scalability of decentralised keyword search across pods on the current ESPRESSO prototype. The experiments specifically investigate how controllable experimental parameters influence search performance across a range of decentralised settings. This includes examining the impact of different text dataset sizes (0.5MB to 50MB per pod, divided into 1 to 10,000 files), different access control levels (10%, 25%, 50%, or 100% file access), and a range of configurations for Solid servers and pods (from 1 to 100 pods across 1 to 50 servers). The experimental results confirm the feasibility of deploying a decentralised search system to conduct keyword search at scale in a decentralised environment.
Ragab, Mohamed
70b66274-31dc-474c-82a1-f838ad062a14
Savateev, Yury
92685970-9170-450a-ae7d-a580fdd854a4
Oliver, Helen
a8c3c44b-4cd8-40e9-9e65-280f8669e56f
Tiropanis, Thanassis
d06654bd-5513-407b-9acd-6f9b9c5009d8
Poulovassilis, Alexandra
3b1668fd-3d66-4ea4-aacd-ea75a78fc064
Chapman, Age
721b7321-8904-4be2-9b01-876c430743f1
Russos, George
6bb21245-3b51-488f-ae36-8787ed80ff55
Ragab, Mohamed
70b66274-31dc-474c-82a1-f838ad062a14
Savateev, Yury
92685970-9170-450a-ae7d-a580fdd854a4
Oliver, Helen
a8c3c44b-4cd8-40e9-9e65-280f8669e56f
Tiropanis, Thanassis
d06654bd-5513-407b-9acd-6f9b9c5009d8
Poulovassilis, Alexandra
3b1668fd-3d66-4ea4-aacd-ea75a78fc064
Chapman, Age
721b7321-8904-4be2-9b01-876c430743f1
Russos, George
6bb21245-3b51-488f-ae36-8787ed80ff55

Ragab, Mohamed, Savateev, Yury, Oliver, Helen, Tiropanis, Thanassis, Poulovassilis, Alexandra, Chapman, Age and Russos, George (2024) Decentralized search over personal online datastores: architecture and performance evaluation. 24th International Conference on Web Engineering, Finland. 17 - 20 Jun 2024. (In Press)

Record type: Conference or Workshop Item (Paper)

Abstract

Data privacy and sovereignty are open challenges in today’s Web, which the Solid 4 ecosystem aims to meet by providing personal online datastores (pods) where individuals can control access to their data. Solid allows developers to deploy applications with access to data stored in pods, subject to users’ permission. For the decentralised Web to succeed, the problem of search over pods with varying access permissions must be solved. The ESPRESSO framework takes the first step in exploring such a search architecture, enabling large-scale keyword search across Solid pods with varying access rights. This paper provides a comprehensive experimental evaluation of the performance and scalability of decentralised keyword search across pods on the current ESPRESSO prototype. The experiments specifically investigate how controllable experimental parameters influence search performance across a range of decentralised settings. This includes examining the impact of different text dataset sizes (0.5MB to 50MB per pod, divided into 1 to 10,000 files), different access control levels (10%, 25%, 50%, or 100% file access), and a range of configurations for Solid servers and pods (from 1 to 100 pods across 1 to 50 servers). The experimental results confirm the feasibility of deploying a decentralised search system to conduct keyword search at scale in a decentralised environment.

This record has no associated files available for download.

More information

Accepted/In Press date: 27 March 2024
Venue - Dates: 24th International Conference on Web Engineering, Finland, 2024-06-17 - 2024-06-20

Identifiers

Local EPrints ID: 489925
URI: http://eprints.soton.ac.uk/id/eprint/489925
PURE UUID: b2e64f63-31d1-4a62-aa8d-709869e49d37
ORCID for Thanassis Tiropanis: ORCID iD orcid.org/0000-0002-6195-2852
ORCID for Age Chapman: ORCID iD orcid.org/0000-0002-3814-2587

Catalogue record

Date deposited: 07 May 2024 16:53
Last modified: 12 Oct 2024 02:12

Export record

Contributors

Author: Mohamed Ragab
Author: Yury Savateev
Author: Helen Oliver
Author: Thanassis Tiropanis ORCID iD
Author: Alexandra Poulovassilis
Author: Age Chapman ORCID iD
Author: George Russos

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×