Decentralized search over personal online datastores: architecture and performance evaluation
Decentralized search over personal online datastores: architecture and performance evaluation
Data privacy and sovereignty are open challenges in today’s Web, which the Solid 4 ecosystem aims to meet by providing personal online datastores (pods) where individuals can control access to their data. Solid allows developers to deploy applications with access to data stored in pods, subject to users’ permission. For the decentralised Web to succeed, the problem of search over pods with varying access permissions must be solved. The ESPRESSO framework takes the first step in exploring such a search architecture, enabling large-scale keyword search across Solid pods with varying access rights. This paper provides a comprehensive experimental evaluation of the performance and scalability of decentralised keyword search across pods on the current ESPRESSO prototype. The experiments specifically investigate how controllable experimental parameters influence search performance across a range of decentralised settings. This includes examining the impact of different text dataset sizes (0.5MB to 50MB per pod, divided into 1 to 10,000 files), different access control levels (10%, 25%, 50%, or 100% file access), and a range of configurations for Solid servers and pods (from 1 to 100 pods across 1 to 50 servers). The experimental results confirm the feasibility of deploying a decentralised search system to conduct keyword search at scale in a decentralised environment.
Ragab, Mohamed
70b66274-31dc-474c-82a1-f838ad062a14
Savateev, Yury
92685970-9170-450a-ae7d-a580fdd854a4
Oliver, Helen
a8c3c44b-4cd8-40e9-9e65-280f8669e56f
Tiropanis, Thanassis
d06654bd-5513-407b-9acd-6f9b9c5009d8
Poulovassilis, Alexandra
3b1668fd-3d66-4ea4-aacd-ea75a78fc064
Chapman, Age
721b7321-8904-4be2-9b01-876c430743f1
Russos, George
6bb21245-3b51-488f-ae36-8787ed80ff55
Ragab, Mohamed
70b66274-31dc-474c-82a1-f838ad062a14
Savateev, Yury
92685970-9170-450a-ae7d-a580fdd854a4
Oliver, Helen
a8c3c44b-4cd8-40e9-9e65-280f8669e56f
Tiropanis, Thanassis
d06654bd-5513-407b-9acd-6f9b9c5009d8
Poulovassilis, Alexandra
3b1668fd-3d66-4ea4-aacd-ea75a78fc064
Chapman, Age
721b7321-8904-4be2-9b01-876c430743f1
Russos, George
6bb21245-3b51-488f-ae36-8787ed80ff55
Ragab, Mohamed, Savateev, Yury, Oliver, Helen, Tiropanis, Thanassis, Poulovassilis, Alexandra, Chapman, Age and Russos, George
(2024)
Decentralized search over personal online datastores: architecture and performance evaluation.
24th International Conference on Web Engineering, Finland.
17 - 20 Jun 2024.
(In Press)
Record type:
Conference or Workshop Item
(Paper)
Abstract
Data privacy and sovereignty are open challenges in today’s Web, which the Solid 4 ecosystem aims to meet by providing personal online datastores (pods) where individuals can control access to their data. Solid allows developers to deploy applications with access to data stored in pods, subject to users’ permission. For the decentralised Web to succeed, the problem of search over pods with varying access permissions must be solved. The ESPRESSO framework takes the first step in exploring such a search architecture, enabling large-scale keyword search across Solid pods with varying access rights. This paper provides a comprehensive experimental evaluation of the performance and scalability of decentralised keyword search across pods on the current ESPRESSO prototype. The experiments specifically investigate how controllable experimental parameters influence search performance across a range of decentralised settings. This includes examining the impact of different text dataset sizes (0.5MB to 50MB per pod, divided into 1 to 10,000 files), different access control levels (10%, 25%, 50%, or 100% file access), and a range of configurations for Solid servers and pods (from 1 to 100 pods across 1 to 50 servers). The experimental results confirm the feasibility of deploying a decentralised search system to conduct keyword search at scale in a decentralised environment.
This record has no associated files available for download.
More information
Accepted/In Press date: 27 March 2024
Venue - Dates:
24th International Conference on Web Engineering, Finland, 2024-06-17 - 2024-06-20
Identifiers
Local EPrints ID: 489925
URI: http://eprints.soton.ac.uk/id/eprint/489925
PURE UUID: b2e64f63-31d1-4a62-aa8d-709869e49d37
Catalogue record
Date deposited: 07 May 2024 16:53
Last modified: 12 Oct 2024 02:12
Export record
Contributors
Author:
Mohamed Ragab
Author:
Yury Savateev
Author:
Helen Oliver
Author:
Thanassis Tiropanis
Author:
Alexandra Poulovassilis
Author:
George Russos
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics