A glimpse into Babel: an analysis of multilinguality in Wikidata
A glimpse into Babel: an analysis of multilinguality in Wikidata
Multilinguality is an important topic for knowledge bases, especially Wikidata, that was build to serve the multilingual requirements of an international community. Its labels are the way for humans to interact with the data. In this paper, we explore the state of languages in Wikidata as of now, especially in regard to its ontology, and the relationship to Wikipedia. Furthermore, we set the multilinguality of Wikidata in the context of the real world by comparing it to the distribution of native speakers. We find an existing language maldistribution, which is less urgent in the ontology, and promising results for future improvements.
Multilinguality, Wikidata, Community-driven knowledge base, Linked Data
Association for Computing Machinery
Kaffee, Lucie-Aimée
8975c12f-9033-47ed-a2eb-b674b707c2ac
Piscopo, Alessandro
0cf9852e-96f2-4658-be4d-c7a5ac330c0d
Vougiouklis, Pavlos
4cd0a8f1-c5e2-4ba2-8dcd-753db616b215
Simperl, Elena
40261ae4-c58c-48e4-b78b-5187b10e4f67
Carr, Leslie
0572b10e-039d-46c6-bf05-57cce71d3936
Pintscher, Lydia
2d332cce-ad17-4824-bc03-d3925b898ab5
Kaffee, Lucie-Aimée
8975c12f-9033-47ed-a2eb-b674b707c2ac
Piscopo, Alessandro
0cf9852e-96f2-4658-be4d-c7a5ac330c0d
Vougiouklis, Pavlos
4cd0a8f1-c5e2-4ba2-8dcd-753db616b215
Simperl, Elena
40261ae4-c58c-48e4-b78b-5187b10e4f67
Carr, Leslie
0572b10e-039d-46c6-bf05-57cce71d3936
Pintscher, Lydia
2d332cce-ad17-4824-bc03-d3925b898ab5
Kaffee, Lucie-Aimée, Piscopo, Alessandro, Vougiouklis, Pavlos, Simperl, Elena, Carr, Leslie and Pintscher, Lydia
(2017)
A glimpse into Babel: an analysis of multilinguality in Wikidata.
In OpenSym '17 Proceedings of the 13th International Symposium on Open Collaboration.
Association for Computing Machinery.
5 pp
.
(doi:10.1145/3125433.3125465).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Multilinguality is an important topic for knowledge bases, especially Wikidata, that was build to serve the multilingual requirements of an international community. Its labels are the way for humans to interact with the data. In this paper, we explore the state of languages in Wikidata as of now, especially in regard to its ontology, and the relationship to Wikipedia. Furthermore, we set the multilinguality of Wikidata in the context of the real world by comparing it to the distribution of native speakers. We find an existing language maldistribution, which is less urgent in the ontology, and promising results for future improvements.
Text
Open Sym Short Paper Wikidata Multilingual
- Version of Record
More information
Accepted/In Press date: 6 July 2017
e-pub ahead of print date: 23 August 2017
Venue - Dates:
OpenSym '17: The International Symposium on Open Collaboration Galway, Ireland, , Galway, Ireland, 2017-08-23 - 2017-08-25
Keywords:
Multilinguality, Wikidata, Community-driven knowledge base, Linked Data
Identifiers
Local EPrints ID: 413433
URI: http://eprints.soton.ac.uk/id/eprint/413433
PURE UUID: 2d275c59-e3fc-4958-ae2b-3701398095ac
Catalogue record
Date deposited: 24 Aug 2017 16:30
Last modified: 16 Mar 2024 02:33
Export record
Altmetrics
Contributors
Author:
Lucie-Aimée Kaffee
Author:
Alessandro Piscopo
Author:
Pavlos Vougiouklis
Author:
Lydia Pintscher
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics