How to measure scalability of distributed stream processing engines?
How to measure scalability of distributed stream processing engines?
Scalability is promoted as a key quality feature of modern big data stream processing engines. However, even though research made huge efforts to provide precise definitions and corresponding metrics for the term scalability, experimental scalability evaluations or benchmarks of stream processing engines apply different and inconsistent metrics. With this paper, we aim to establish general metrics for scalability of stream processing engines. Derived from common definitions of scalability in cloud computing, we propose two metrics: a load capacity function and a resource demand function. Both metrics relate provisioned resources and load intensities, while requiring specific service level objectives to be fulfilled. We show how these metrics can be employed for scalability benchmarking and discuss their advantages in comparison to other metrics, used for stream processing engines and other software systems.
Cloud computing, Metrics, Scalability, Stream processing
85-88
Association for Computing Machinery
Henning, Sören
e09ef4ea-8a2f-4d11-903b-db51d6371fcb
Hasselbring, Wilhelm
ee89c5c9-a900-40b1-82c1-552268cd01bd
19 April 2021
Henning, Sören
e09ef4ea-8a2f-4d11-903b-db51d6371fcb
Hasselbring, Wilhelm
ee89c5c9-a900-40b1-82c1-552268cd01bd
Henning, Sören and Hasselbring, Wilhelm
(2021)
How to measure scalability of distributed stream processing engines?
In ICPE 2021 - Companion of the ACM/SPEC International Conference on Performance Engineering.
Association for Computing Machinery.
.
(doi:10.1145/3447545.3451190).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Scalability is promoted as a key quality feature of modern big data stream processing engines. However, even though research made huge efforts to provide precise definitions and corresponding metrics for the term scalability, experimental scalability evaluations or benchmarks of stream processing engines apply different and inconsistent metrics. With this paper, we aim to establish general metrics for scalability of stream processing engines. Derived from common definitions of scalability in cloud computing, we propose two metrics: a load capacity function and a resource demand function. Both metrics relate provisioned resources and load intensities, while requiring specific service level objectives to be fulfilled. We show how these metrics can be employed for scalability benchmarking and discuss their advantages in comparison to other metrics, used for stream processing engines and other software systems.
This record has no associated files available for download.
More information
Published date: 19 April 2021
Additional Information:
Publisher Copyright:
© 2021 Association for Computing Machinery.
Venue - Dates:
2021 ACM/SPEC International Conference on Performance Engineering, ICPE 2021, , Virtual, Online, France, 2021-04-19 - 2021-04-21
Keywords:
Cloud computing, Metrics, Scalability, Stream processing
Identifiers
Local EPrints ID: 488882
URI: http://eprints.soton.ac.uk/id/eprint/488882
PURE UUID: b7f97deb-6eea-4306-a7a7-37f360748efb
Catalogue record
Date deposited: 09 Apr 2024 10:02
Last modified: 10 Apr 2024 02:15
Export record
Altmetrics
Contributors
Author:
Sören Henning
Author:
Wilhelm Hasselbring
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics