From GPT to Mistral: cross-domain ontology learning with NeOn-GPT
From GPT to Mistral: cross-domain ontology learning with NeOn-GPT
We present the extended NeOn-GPT pipeline, an LLM-powered, domain-agnostic ontology learning framework grounded in the NeOn methodology. The pipeline comprises two components: (i) ontology draft generation through multi-step prompting — covering requirement specification, competency questions, conceptualization, formal modeling, population, and documentation — and(ii) automated verification and repair through orchestrated calls to third-party tools complemented by LLM-suggested fixes. The extended pipeline introduces an explicit ontology reuse step to guide LLMs toward more consistent modeling decisions. We evaluate NeOn-GPT across four domains (Wine, Cheminformatics, Environmental Microbiology, and Sewer Networks) using both proprietary (GPT-4o)and open-source (Mistral, Llama-4, Deep Seek) models. Gold-standard alignment is assessed via structural metrics (class, property, and axiom profiles), lexical metrics, and semantic metrics based on sentence embeddings. Results show that LLMs consistently generate ontologies with rich relational structures and meaningful semantic alignment, with most entity and triple similarities falling in the 0.5–0.8 range. This study provides a comprehensive, cross-domain evaluation of NeOn-guided LLM ontology learning, clarifying its capabilities and limitations.
Fatallah, Nadeen
b2933cbb-b1be-45c2-933f-fa3bcddf9b4b
Das, Arunav
baaaf6ad-0797-491f-ab59-84ebe80abce4
De Giorgis, Stefano
bf5d676e-9d3a-4906-82d5-a4a87a9428da
Poltronieri, Andrea
ce05d09f-ae1f-4e6e-950b-7839b97d82cb
Haase, Peter
bbe0f4c0-06cc-4114-a837-3fa8759c0da5
Kovriguina, Liubov
15bb4836-be77-4e9f-bfc4-5425c49508f9
Simperl, Elena
68e2d4e7-e1f7-414b-b478-f8b3f7eb085e
Meroño-Peñuela, Albert
73e1e180-b51f-4c20-9186-78bc6d389f3b
Staab, Steffen
bf48d51b-bd11-4d58-8e1c-4e6e03b30c49
Algergawy, Alsayed
7eb592e4-bbc8-45e8-81f3-9ae0b8026adb
Fatallah, Nadeen
b2933cbb-b1be-45c2-933f-fa3bcddf9b4b
Das, Arunav
baaaf6ad-0797-491f-ab59-84ebe80abce4
De Giorgis, Stefano
bf5d676e-9d3a-4906-82d5-a4a87a9428da
Poltronieri, Andrea
ce05d09f-ae1f-4e6e-950b-7839b97d82cb
Haase, Peter
bbe0f4c0-06cc-4114-a837-3fa8759c0da5
Kovriguina, Liubov
15bb4836-be77-4e9f-bfc4-5425c49508f9
Simperl, Elena
68e2d4e7-e1f7-414b-b478-f8b3f7eb085e
Meroño-Peñuela, Albert
73e1e180-b51f-4c20-9186-78bc6d389f3b
Staab, Steffen
bf48d51b-bd11-4d58-8e1c-4e6e03b30c49
Algergawy, Alsayed
7eb592e4-bbc8-45e8-81f3-9ae0b8026adb
Fatallah, Nadeen, Das, Arunav, De Giorgis, Stefano, Poltronieri, Andrea, Haase, Peter, Kovriguina, Liubov, Simperl, Elena, Meroño-Peñuela, Albert, Staab, Steffen and Algergawy, Alsayed
(2026)
From GPT to Mistral: cross-domain ontology learning with NeOn-GPT.
Semantic Web.
(In Press)
Abstract
We present the extended NeOn-GPT pipeline, an LLM-powered, domain-agnostic ontology learning framework grounded in the NeOn methodology. The pipeline comprises two components: (i) ontology draft generation through multi-step prompting — covering requirement specification, competency questions, conceptualization, formal modeling, population, and documentation — and(ii) automated verification and repair through orchestrated calls to third-party tools complemented by LLM-suggested fixes. The extended pipeline introduces an explicit ontology reuse step to guide LLMs toward more consistent modeling decisions. We evaluate NeOn-GPT across four domains (Wine, Cheminformatics, Environmental Microbiology, and Sewer Networks) using both proprietary (GPT-4o)and open-source (Mistral, Llama-4, Deep Seek) models. Gold-standard alignment is assessed via structural metrics (class, property, and axiom profiles), lexical metrics, and semantic metrics based on sentence embeddings. Results show that LLMs consistently generate ontologies with rich relational structures and meaningful semantic alignment, with most entity and triple similarities falling in the 0.5–0.8 range. This study provides a comprehensive, cross-domain evaluation of NeOn-guided LLM ontology learning, clarifying its capabilities and limitations.
Text
neon_gpt (2) final
- Accepted Manuscript
More information
Accepted/In Press date: 30 April 2026
Identifiers
Local EPrints ID: 511839
URI: http://eprints.soton.ac.uk/id/eprint/511839
ISSN: 1570-0844
PURE UUID: fe8a145a-dbe5-4645-9bf7-fd1d28e0f8b0
Catalogue record
Date deposited: 08 Jun 2026 16:32
Last modified: 09 Jun 2026 01:47
Export record
Contributors
Author:
Nadeen Fatallah
Author:
Arunav Das
Author:
Stefano De Giorgis
Author:
Andrea Poltronieri
Author:
Peter Haase
Author:
Liubov Kovriguina
Author:
Elena Simperl
Author:
Albert Meroño-Peñuela
Author:
Steffen Staab
Author:
Alsayed Algergawy
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics