Characterising a grid site's traffic
Characterising a grid site's traffic
Grid computing has been widely adopted for intensive high performance computing. Since grid resources are distributed over complex large-scale infrastructures, understanding grid site data traffic behaviour is important for efficient resource utilisation, ...
Grid computing has been widely adopted for intensive high performance computing. Since grid resources are distributed over complex large-scale infrastructures, understanding grid site data traffic behaviour is important for efficient resource utilisation, performance optimisation, and the design of future grid sites as well as traffic-aware grid applications. In this paper, we study and analyse the traffic generated at a grid site in the Large Hadron Collider (LHC) Computing Grid (LCG). We find that most of the generated traffic is TCP-based and that a small set of grid applications generate significant amounts of the data. Upon analysing the different traffic metrics, we also find that the traffic exhibits long-range dependence and self-similarity. We also investigate packet-level metrics such as throughput, packet rate, round trip time (RTT) and packet loss. Our study establishes that these metrics can be well represented by Gaussian mixture models. The findings we present in this paper will enable accurate grid site traffic monitoring and potentially on-the-fly traffic modelling and prediction. It will also lead to a better understanding of grid site's traffic behaviour and contribute to more efficient grid site planning, traffic management, data transmission protocol optimisation, and data-aware grid application design.
grid computing, network performance, traffic modelling
707-716
Ma, Tiejun
1f591849-f17c-4209-9f42-e6587b499bae
El-khatib, Y.
49c0e92c-3ca2-40f5-ac0e-692990d7da85
Mackay, M.
f93eaf5d-ba4f-4b23-a48e-55bc5db25c7f
Edwards, C.
d2633d87-4f9f-42a4-ab6d-162894a5f9b8
2010
Ma, Tiejun
1f591849-f17c-4209-9f42-e6587b499bae
El-khatib, Y.
49c0e92c-3ca2-40f5-ac0e-692990d7da85
Mackay, M.
f93eaf5d-ba4f-4b23-a48e-55bc5db25c7f
Edwards, C.
d2633d87-4f9f-42a4-ab6d-162894a5f9b8
Ma, Tiejun, El-khatib, Y., Mackay, M. and Edwards, C.
(2010)
Characterising a grid site's traffic.
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (DIDC), Chicago Ridge, United States.
21 - 25 Jun 2010.
.
(doi:10.1145/1851476.1851581).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Grid computing has been widely adopted for intensive high performance computing. Since grid resources are distributed over complex large-scale infrastructures, understanding grid site data traffic behaviour is important for efficient resource utilisation, ...
Grid computing has been widely adopted for intensive high performance computing. Since grid resources are distributed over complex large-scale infrastructures, understanding grid site data traffic behaviour is important for efficient resource utilisation, performance optimisation, and the design of future grid sites as well as traffic-aware grid applications. In this paper, we study and analyse the traffic generated at a grid site in the Large Hadron Collider (LHC) Computing Grid (LCG). We find that most of the generated traffic is TCP-based and that a small set of grid applications generate significant amounts of the data. Upon analysing the different traffic metrics, we also find that the traffic exhibits long-range dependence and self-similarity. We also investigate packet-level metrics such as throughput, packet rate, round trip time (RTT) and packet loss. Our study establishes that these metrics can be well represented by Gaussian mixture models. The findings we present in this paper will enable accurate grid site traffic monitoring and potentially on-the-fly traffic modelling and prediction. It will also lead to a better understanding of grid site's traffic behaviour and contribute to more efficient grid site planning, traffic management, data transmission protocol optimisation, and data-aware grid application design.
This record has no associated files available for download.
More information
Published date: 2010
Venue - Dates:
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (DIDC), Chicago Ridge, United States, 2010-06-21 - 2010-06-25
Keywords:
grid computing, network performance, traffic modelling
Organisations:
Southampton Business School
Identifiers
Local EPrints ID: 204605
URI: http://eprints.soton.ac.uk/id/eprint/204605
PURE UUID: f01ab895-eb5b-4a97-a8da-a0f749c92415
Catalogue record
Date deposited: 30 Nov 2011 14:28
Last modified: 14 Mar 2024 04:31
Export record
Altmetrics
Contributors
Author:
Y. El-khatib
Author:
M. Mackay
Author:
C. Edwards
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics