The University of Southampton
University of Southampton Institutional Repository

A glimpse into Babel: an analysis of multilinguality in Wikidata

A glimpse into Babel: an analysis of multilinguality in Wikidata
A glimpse into Babel: an analysis of multilinguality in Wikidata
Multilinguality is an important topic for knowledge bases, especially Wikidata, that was build to serve the multilingual requirements of an international community. Its labels are the way for humans to interact with the data. In this paper, we explore the state of languages in Wikidata as of now, especially in regard to its ontology, and the relationship to Wikipedia. Furthermore, we set the multilinguality of Wikidata in the context of the real world by comparing it to the distribution of native speakers. We find an existing language maldistribution, which is less urgent in the ontology, and promising results for future improvements.
Multilinguality, Wikidata, Community-driven knowledge base, Linked Data
Association for Computing Machinery
Kaffee, Lucie-Aimée
8975c12f-9033-47ed-a2eb-b674b707c2ac
Piscopo, Alessandro
0cf9852e-96f2-4658-be4d-c7a5ac330c0d
Vougiouklis, Pavlos
4cd0a8f1-c5e2-4ba2-8dcd-753db616b215
Simperl, Elena
40261ae4-c58c-48e4-b78b-5187b10e4f67
Carr, Leslie
0572b10e-039d-46c6-bf05-57cce71d3936
Pintscher, Lydia
2d332cce-ad17-4824-bc03-d3925b898ab5
Kaffee, Lucie-Aimée
8975c12f-9033-47ed-a2eb-b674b707c2ac
Piscopo, Alessandro
0cf9852e-96f2-4658-be4d-c7a5ac330c0d
Vougiouklis, Pavlos
4cd0a8f1-c5e2-4ba2-8dcd-753db616b215
Simperl, Elena
40261ae4-c58c-48e4-b78b-5187b10e4f67
Carr, Leslie
0572b10e-039d-46c6-bf05-57cce71d3936
Pintscher, Lydia
2d332cce-ad17-4824-bc03-d3925b898ab5

Kaffee, Lucie-Aimée, Piscopo, Alessandro, Vougiouklis, Pavlos, Simperl, Elena, Carr, Leslie and Pintscher, Lydia (2017) A glimpse into Babel: an analysis of multilinguality in Wikidata. In OpenSym '17 Proceedings of the 13th International Symposium on Open Collaboration. Association for Computing Machinery. 5 pp . (doi:10.1145/3125433.3125465).

Record type: Conference or Workshop Item (Paper)

Abstract

Multilinguality is an important topic for knowledge bases, especially Wikidata, that was build to serve the multilingual requirements of an international community. Its labels are the way for humans to interact with the data. In this paper, we explore the state of languages in Wikidata as of now, especially in regard to its ontology, and the relationship to Wikipedia. Furthermore, we set the multilinguality of Wikidata in the context of the real world by comparing it to the distribution of native speakers. We find an existing language maldistribution, which is less urgent in the ontology, and promising results for future improvements.

Text
Open Sym Short Paper Wikidata Multilingual - Version of Record
Available under License Creative Commons Attribution Share Alike.
Download (699kB)

More information

Accepted/In Press date: 6 July 2017
e-pub ahead of print date: 23 August 2017
Venue - Dates: OpenSym '17: The International Symposium on Open Collaboration Galway, Ireland, , Galway, Ireland, 2017-08-23 - 2017-08-25
Keywords: Multilinguality, Wikidata, Community-driven knowledge base, Linked Data

Identifiers

Local EPrints ID: 413433
URI: http://eprints.soton.ac.uk/id/eprint/413433
PURE UUID: 2d275c59-e3fc-4958-ae2b-3701398095ac
ORCID for Lucie-Aimée Kaffee: ORCID iD orcid.org/0000-0002-1514-8505
ORCID for Alessandro Piscopo: ORCID iD orcid.org/0000-0002-0362-4826
ORCID for Elena Simperl: ORCID iD orcid.org/0000-0003-1722-947X
ORCID for Leslie Carr: ORCID iD orcid.org/0000-0002-2113-9680

Catalogue record

Date deposited: 24 Aug 2017 16:30
Last modified: 16 Mar 2024 02:33

Export record

Altmetrics

Contributors

Author: Lucie-Aimée Kaffee ORCID iD
Author: Alessandro Piscopo ORCID iD
Author: Pavlos Vougiouklis
Author: Elena Simperl ORCID iD
Author: Leslie Carr ORCID iD
Author: Lydia Pintscher

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×