The University of Southampton
University of Southampton Institutional Repository

Knowledge representation of a multi-centre adolescent and young adult (AYA) cancer infrastructure; development of the STRONG AYA knowledge graph

Knowledge representation of a multi-centre adolescent and young adult (AYA) cancer infrastructure; development of the STRONG AYA knowledge graph
Knowledge representation of a multi-centre adolescent and young adult (AYA) cancer infrastructure; development of the STRONG AYA knowledge graph
Purpose: rare diseases are difficult to fully capture, and regularly call for large, geographically dispersed initiatives. Such initiatives are often met with data harmonization challenges. These challenges render data incompatible and impede successful realization. The STRONG AYA project is such an initiative, specifically focusing on adolescent and young adult (AYAs) with cancer. STRONG AYA is setting up a federated data infrastructure containing data of varying format. Here, we elaborate on how we used health care–agnostic semantic web technologies to overcome such challenges.

Methods: we structured the STRONG AYA case-mix and core outcome measures concepts and their properties as knowledge graphs. Having identified the corresponding standard terminologies, we developed a semantic map on the basis of the knowledge graphs and the here introduced annotation helper plugin for Flyover. Flyover is a tool that converts structured data into resource description framework (RDF) triples and enables semantic interoperability. As a demonstration, we mapped data that are to be included in the STRONG AYA infrastructure.

Results: the knowledge graphs provided a comprehensive overview of the large number of STRONG AYA concepts. The semantic terminology mapping and annotation helper allowed us to query data with incomprehensible terminologies, without changing them. Both the knowledge graphs and semantic map were made available on a Hugo webpage for increased transparency and understanding.

Conclusion: the use of semantic web technologies, such as RDF and knowledge graphs, is a viable solution to overcome challenges regarding data interoperability and reusability for a federated AYA cancer data infrastructure without being bound to rigid standardized schemas. The linkage of semantically meaningful concepts to otherwise incomprehensible data elements demonstrates how by using these domain-agnostic technologies we made nonstandardized health care data interoperable.
e2500177
Hoggenboom, Joshi
Gouthamchand, Varsha
Cairns, Charlotte
f3dae8b8-657f-420c-ad2a-ad9c0a22fecb
Janssen, Silvie H.M.
958e8186-8f35-4d40-8926-3e283f35acb9
Way, Kirsty
f2ae4376-1fc6-443b-be33-7a642e744776
Dekker, Andre L.A.J.
van der Graaf, Winette T.A.
ca6c9186-6bb1-4774-88cd-0fc01d6e351e
Darlington, Anne-Sophie
472fcfc9-160b-4344-8113-8dd8760ff962
Husson, Olga
7a3df44f-fbe8-4115-a20b-ed2e96b84eed
Wee, Leonard Y.L.
van Soest, Johan
Gomes, Aiara Lobo
Hoggenboom, Joshi
Gouthamchand, Varsha
Cairns, Charlotte
f3dae8b8-657f-420c-ad2a-ad9c0a22fecb
Janssen, Silvie H.M.
958e8186-8f35-4d40-8926-3e283f35acb9
Way, Kirsty
f2ae4376-1fc6-443b-be33-7a642e744776
Dekker, Andre L.A.J.
van der Graaf, Winette T.A.
ca6c9186-6bb1-4774-88cd-0fc01d6e351e
Darlington, Anne-Sophie
472fcfc9-160b-4344-8113-8dd8760ff962
Husson, Olga
7a3df44f-fbe8-4115-a20b-ed2e96b84eed
Wee, Leonard Y.L.
van Soest, Johan
Gomes, Aiara Lobo

Hoggenboom, Joshi, Gouthamchand, Varsha, Cairns, Charlotte, Janssen, Silvie H.M., Way, Kirsty, Dekker, Andre L.A.J., van der Graaf, Winette T.A., Darlington, Anne-Sophie, Husson, Olga, Wee, Leonard Y.L., van Soest, Johan and Gomes, Aiara Lobo (2026) Knowledge representation of a multi-centre adolescent and young adult (AYA) cancer infrastructure; development of the STRONG AYA knowledge graph. Journal of Clinical Oncology Clinical Cancer Informatics, 10, e2500177, [e2500177]. (doi:10.1200/CCI-25-00177).

Record type: Article

Abstract

Purpose: rare diseases are difficult to fully capture, and regularly call for large, geographically dispersed initiatives. Such initiatives are often met with data harmonization challenges. These challenges render data incompatible and impede successful realization. The STRONG AYA project is such an initiative, specifically focusing on adolescent and young adult (AYAs) with cancer. STRONG AYA is setting up a federated data infrastructure containing data of varying format. Here, we elaborate on how we used health care–agnostic semantic web technologies to overcome such challenges.

Methods: we structured the STRONG AYA case-mix and core outcome measures concepts and their properties as knowledge graphs. Having identified the corresponding standard terminologies, we developed a semantic map on the basis of the knowledge graphs and the here introduced annotation helper plugin for Flyover. Flyover is a tool that converts structured data into resource description framework (RDF) triples and enables semantic interoperability. As a demonstration, we mapped data that are to be included in the STRONG AYA infrastructure.

Results: the knowledge graphs provided a comprehensive overview of the large number of STRONG AYA concepts. The semantic terminology mapping and annotation helper allowed us to query data with incomprehensible terminologies, without changing them. Both the knowledge graphs and semantic map were made available on a Hugo webpage for increased transparency and understanding.

Conclusion: the use of semantic web technologies, such as RDF and knowledge graphs, is a viable solution to overcome challenges regarding data interoperability and reusability for a federated AYA cancer data infrastructure without being bound to rigid standardized schemas. The linkage of semantically meaningful concepts to otherwise incomprehensible data elements demonstrates how by using these domain-agnostic technologies we made nonstandardized health care data interoperable.

Text
CCI-25-00177 - REVISED - FOR PRODUCTION - Accepted Manuscript
Download (41kB)
Text
hogenboom-et-al-2026-knowledge-representation-of-a-multicenter-adolescent-and-young-adult-cancer-infrastructure - Version of Record
Download (2MB)

More information

Accepted/In Press date: 21 November 2025
Published date: 14 January 2026

Identifiers

Local EPrints ID: 508733
URI: http://eprints.soton.ac.uk/id/eprint/508733
PURE UUID: 46e0811e-502a-4928-81ae-da29b5de135e
ORCID for Anne-Sophie Darlington: ORCID iD orcid.org/0000-0003-4387-7278

Catalogue record

Date deposited: 02 Feb 2026 17:47
Last modified: 03 Feb 2026 02:44

Export record

Altmetrics

Contributors

Author: Joshi Hoggenboom
Author: Varsha Gouthamchand
Author: Charlotte Cairns
Author: Silvie H.M. Janssen
Author: Kirsty Way
Author: Andre L.A.J. Dekker
Author: Winette T.A. van der Graaf
Author: Olga Husson
Author: Leonard Y.L. Wee
Author: Johan van Soest
Author: Aiara Lobo Gomes

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×