An Empirical Study of Programming Languages in Open-Source Software Projects based on Mining Software Repositories
An Empirical Study of Programming Languages in Open-Source Software Projects based on Mining Software Repositories
There are dozens of programming languages in use today, and new languages and language features are being introduced frequently. However, there are only a few empirical studies on the usage and practice of programming languages. In this research we explored languages from an empirical/pragmatic perspective to address their association
with open-source software (OSS) projects and practices. The research was conducted in a comparative setting to investigate whether a significant association exists. That is, a comparison was made between languages both individually and in groups to understand similarities and examine differences, if any, in popularity and user adoption, feature usage, and OSS project attributes. The methodology was based on mining software repositories, and the results obtained from an analysis of possibly the largest open-source
dataset (a sample of 5,350 projects from a total of 15,000 projects), where a main language was identified. The investigation revealed that a considerable association exists; however, the effect size of such association was modest. When accounting for confounding factors such as project size and type, the findings held only in a small number of the tested cases. Thus, the choice of language has a limited effect on OSS development.
University of Southampton
Altherwi, Muna
03ff3298-0612-42d3-9658-545617776d13
Altherwi, Muna
03ff3298-0612-42d3-9658-545617776d13
Gravell, Andrew
f3a261c5-f057-4b5f-b6ac-c1ca37d72749
Altherwi, Muna
(2021)
An Empirical Study of Programming Languages in Open-Source Software Projects based on Mining Software Repositories.
University of Southampton, Doctoral Thesis, 139pp.
Record type:
Thesis
(Doctoral)
Abstract
There are dozens of programming languages in use today, and new languages and language features are being introduced frequently. However, there are only a few empirical studies on the usage and practice of programming languages. In this research we explored languages from an empirical/pragmatic perspective to address their association
with open-source software (OSS) projects and practices. The research was conducted in a comparative setting to investigate whether a significant association exists. That is, a comparison was made between languages both individually and in groups to understand similarities and examine differences, if any, in popularity and user adoption, feature usage, and OSS project attributes. The methodology was based on mining software repositories, and the results obtained from an analysis of possibly the largest open-source
dataset (a sample of 5,350 projects from a total of 15,000 projects), where a main language was identified. The investigation revealed that a considerable association exists; however, the effect size of such association was modest. When accounting for confounding factors such as project size and type, the findings held only in a small number of the tested cases. Thus, the choice of language has a limited effect on OSS development.
Text
MunaAltherwi_PhD_CyberPhysicalSystemsECS_051021 (1)
- Version of Record
Text
Muna Altherwi Permission To Deposit Thesis
Restricted to Repository staff only
More information
Submitted date: 27 August 2021
Identifiers
Local EPrints ID: 456713
URI: http://eprints.soton.ac.uk/id/eprint/456713
PURE UUID: dd884808-c94e-4c4b-b3d3-874dc6780ab1
Catalogue record
Date deposited: 10 May 2022 16:30
Last modified: 17 Mar 2024 07:18
Export record
Contributors
Author:
Muna Altherwi
Thesis advisor:
Andrew Gravell
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics