The University of Southampton
University of Southampton Institutional Repository

Characterising a grid site's traffic

Characterising a grid site's traffic
Characterising a grid site's traffic
Grid computing has been widely adopted for intensive high performance computing. Since grid resources are distributed over complex large-scale infrastructures, understanding grid site data traffic behaviour is important for efficient resource utilisation, ...

Grid computing has been widely adopted for intensive high performance computing. Since grid resources are distributed over complex large-scale infrastructures, understanding grid site data traffic behaviour is important for efficient resource utilisation, performance optimisation, and the design of future grid sites as well as traffic-aware grid applications. In this paper, we study and analyse the traffic generated at a grid site in the Large Hadron Collider (LHC) Computing Grid (LCG). We find that most of the generated traffic is TCP-based and that a small set of grid applications generate significant amounts of the data. Upon analysing the different traffic metrics, we also find that the traffic exhibits long-range dependence and self-similarity. We also investigate packet-level metrics such as throughput, packet rate, round trip time (RTT) and packet loss. Our study establishes that these metrics can be well represented by Gaussian mixture models. The findings we present in this paper will enable accurate grid site traffic monitoring and potentially on-the-fly traffic modelling and prediction. It will also lead to a better understanding of grid site's traffic behaviour and contribute to more efficient grid site planning, traffic management, data transmission protocol optimisation, and data-aware grid application design.
grid computing, network performance, traffic modelling
707-716
Ma, Tiejun
1f591849-f17c-4209-9f42-e6587b499bae
El-khatib, Y.
49c0e92c-3ca2-40f5-ac0e-692990d7da85
Mackay, M.
f93eaf5d-ba4f-4b23-a48e-55bc5db25c7f
Edwards, C.
d2633d87-4f9f-42a4-ab6d-162894a5f9b8
Ma, Tiejun
1f591849-f17c-4209-9f42-e6587b499bae
El-khatib, Y.
49c0e92c-3ca2-40f5-ac0e-692990d7da85
Mackay, M.
f93eaf5d-ba4f-4b23-a48e-55bc5db25c7f
Edwards, C.
d2633d87-4f9f-42a4-ab6d-162894a5f9b8

Ma, Tiejun, El-khatib, Y., Mackay, M. and Edwards, C. (2010) Characterising a grid site's traffic. Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (DIDC), Chicago Ridge, United States. 21 - 25 Jun 2010. pp. 707-716 . (doi:10.1145/1851476.1851581).

Record type: Conference or Workshop Item (Paper)

Abstract

Grid computing has been widely adopted for intensive high performance computing. Since grid resources are distributed over complex large-scale infrastructures, understanding grid site data traffic behaviour is important for efficient resource utilisation, ...

Grid computing has been widely adopted for intensive high performance computing. Since grid resources are distributed over complex large-scale infrastructures, understanding grid site data traffic behaviour is important for efficient resource utilisation, performance optimisation, and the design of future grid sites as well as traffic-aware grid applications. In this paper, we study and analyse the traffic generated at a grid site in the Large Hadron Collider (LHC) Computing Grid (LCG). We find that most of the generated traffic is TCP-based and that a small set of grid applications generate significant amounts of the data. Upon analysing the different traffic metrics, we also find that the traffic exhibits long-range dependence and self-similarity. We also investigate packet-level metrics such as throughput, packet rate, round trip time (RTT) and packet loss. Our study establishes that these metrics can be well represented by Gaussian mixture models. The findings we present in this paper will enable accurate grid site traffic monitoring and potentially on-the-fly traffic modelling and prediction. It will also lead to a better understanding of grid site's traffic behaviour and contribute to more efficient grid site planning, traffic management, data transmission protocol optimisation, and data-aware grid application design.

This record has no associated files available for download.

More information

Published date: 2010
Venue - Dates: Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (DIDC), Chicago Ridge, United States, 2010-06-21 - 2010-06-25
Keywords: grid computing, network performance, traffic modelling
Organisations: Southampton Business School

Identifiers

Local EPrints ID: 204605
URI: http://eprints.soton.ac.uk/id/eprint/204605
PURE UUID: f01ab895-eb5b-4a97-a8da-a0f749c92415

Catalogue record

Date deposited: 30 Nov 2011 14:28
Last modified: 14 Mar 2024 04:31

Export record

Altmetrics

Contributors

Author: Tiejun Ma
Author: Y. El-khatib
Author: M. Mackay
Author: C. Edwards

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×