Knowledge representation of a multi-centre adolescent and young adult (AYA) cancer infrastructure; development of the STRONG AYA knowledge graph
Knowledge representation of a multi-centre adolescent and young adult (AYA) cancer infrastructure; development of the STRONG AYA knowledge graph
Purpose: rare diseases are difficult to fully capture, and regularly call for large, geographically dispersed initiatives. Such initiatives are often met with data harmonization challenges. These challenges render data incompatible and impede successful realization. The STRONG AYA project is such an initiative, specifically focusing on adolescent and young adult (AYAs) with cancer. STRONG AYA is setting up a federated data infrastructure containing data of varying format. Here, we elaborate on how we used health care–agnostic semantic web technologies to overcome such challenges.
Methods: we structured the STRONG AYA case-mix and core outcome measures concepts and their properties as knowledge graphs. Having identified the corresponding standard terminologies, we developed a semantic map on the basis of the knowledge graphs and the here introduced annotation helper plugin for Flyover. Flyover is a tool that converts structured data into resource description framework (RDF) triples and enables semantic interoperability. As a demonstration, we mapped data that are to be included in the STRONG AYA infrastructure.
Results: the knowledge graphs provided a comprehensive overview of the large number of STRONG AYA concepts. The semantic terminology mapping and annotation helper allowed us to query data with incomprehensible terminologies, without changing them. Both the knowledge graphs and semantic map were made available on a Hugo webpage for increased transparency and understanding.
Conclusion: the use of semantic web technologies, such as RDF and knowledge graphs, is a viable solution to overcome challenges regarding data interoperability and reusability for a federated AYA cancer data infrastructure without being bound to rigid standardized schemas. The linkage of semantically meaningful concepts to otherwise incomprehensible data elements demonstrates how by using these domain-agnostic technologies we made nonstandardized health care data interoperable.
e2500177
Cairns, Charlotte
f3dae8b8-657f-420c-ad2a-ad9c0a22fecb
Janssen, Silvie H.M.
958e8186-8f35-4d40-8926-3e283f35acb9
Way, Kirsty
f2ae4376-1fc6-443b-be33-7a642e744776
van der Graaf, Winette T.A.
ca6c9186-6bb1-4774-88cd-0fc01d6e351e
Darlington, Anne-Sophie
472fcfc9-160b-4344-8113-8dd8760ff962
Husson, Olga
7a3df44f-fbe8-4115-a20b-ed2e96b84eed
14 January 2026
Cairns, Charlotte
f3dae8b8-657f-420c-ad2a-ad9c0a22fecb
Janssen, Silvie H.M.
958e8186-8f35-4d40-8926-3e283f35acb9
Way, Kirsty
f2ae4376-1fc6-443b-be33-7a642e744776
van der Graaf, Winette T.A.
ca6c9186-6bb1-4774-88cd-0fc01d6e351e
Darlington, Anne-Sophie
472fcfc9-160b-4344-8113-8dd8760ff962
Husson, Olga
7a3df44f-fbe8-4115-a20b-ed2e96b84eed
Hoggenboom, Joshi, Gouthamchand, Varsha, Cairns, Charlotte, Janssen, Silvie H.M., Way, Kirsty, Dekker, Andre L.A.J., van der Graaf, Winette T.A., Darlington, Anne-Sophie, Husson, Olga, Wee, Leonard Y.L., van Soest, Johan and Gomes, Aiara Lobo
(2026)
Knowledge representation of a multi-centre adolescent and young adult (AYA) cancer infrastructure; development of the STRONG AYA knowledge graph.
Journal of Clinical Oncology Clinical Cancer Informatics, 10, , [e2500177].
(doi:10.1200/CCI-25-00177).
Abstract
Purpose: rare diseases are difficult to fully capture, and regularly call for large, geographically dispersed initiatives. Such initiatives are often met with data harmonization challenges. These challenges render data incompatible and impede successful realization. The STRONG AYA project is such an initiative, specifically focusing on adolescent and young adult (AYAs) with cancer. STRONG AYA is setting up a federated data infrastructure containing data of varying format. Here, we elaborate on how we used health care–agnostic semantic web technologies to overcome such challenges.
Methods: we structured the STRONG AYA case-mix and core outcome measures concepts and their properties as knowledge graphs. Having identified the corresponding standard terminologies, we developed a semantic map on the basis of the knowledge graphs and the here introduced annotation helper plugin for Flyover. Flyover is a tool that converts structured data into resource description framework (RDF) triples and enables semantic interoperability. As a demonstration, we mapped data that are to be included in the STRONG AYA infrastructure.
Results: the knowledge graphs provided a comprehensive overview of the large number of STRONG AYA concepts. The semantic terminology mapping and annotation helper allowed us to query data with incomprehensible terminologies, without changing them. Both the knowledge graphs and semantic map were made available on a Hugo webpage for increased transparency and understanding.
Conclusion: the use of semantic web technologies, such as RDF and knowledge graphs, is a viable solution to overcome challenges regarding data interoperability and reusability for a federated AYA cancer data infrastructure without being bound to rigid standardized schemas. The linkage of semantically meaningful concepts to otherwise incomprehensible data elements demonstrates how by using these domain-agnostic technologies we made nonstandardized health care data interoperable.
Text
CCI-25-00177 - REVISED - FOR PRODUCTION
- Accepted Manuscript
Text
hogenboom-et-al-2026-knowledge-representation-of-a-multicenter-adolescent-and-young-adult-cancer-infrastructure
- Version of Record
More information
Accepted/In Press date: 21 November 2025
Published date: 14 January 2026
Identifiers
Local EPrints ID: 508733
URI: http://eprints.soton.ac.uk/id/eprint/508733
PURE UUID: 46e0811e-502a-4928-81ae-da29b5de135e
Catalogue record
Date deposited: 02 Feb 2026 17:47
Last modified: 03 Feb 2026 02:44
Export record
Altmetrics
Contributors
Author:
Joshi Hoggenboom
Author:
Varsha Gouthamchand
Author:
Charlotte Cairns
Author:
Silvie H.M. Janssen
Author:
Kirsty Way
Author:
Andre L.A.J. Dekker
Author:
Winette T.A. van der Graaf
Author:
Olga Husson
Author:
Leonard Y.L. Wee
Author:
Johan van Soest
Author:
Aiara Lobo Gomes
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics