The University of Southampton
University of Southampton Institutional Repository

The impact of forum content on data science open innovation performance: a system dynamics-based causal machine learning approach

The impact of forum content on data science open innovation performance: a system dynamics-based causal machine learning approach
The impact of forum content on data science open innovation performance: a system dynamics-based causal machine learning approach
Open innovation in data science generally takes the form of public competitions where teams exchange messages and solutions by competing and collaborating simultaneously. Team behaviours are widely heterogeneous in terms of the performance of their solutions and the participation in knowledge creation. We present a novel research framework for open innovation by integrating system dynamics and structural topic modelling to extract open factors and adopting a machine learning-based difference-in-differences estimator to understand the impact of team behaviour on their performance using data from Kaggle's competition. Our results identify four team behaviour categories—active, learner, lurker, and passive— in data science open innovation competitions which depend on the performance of their solutions and actions related to posting and reading messages in the forum. Furthermore, the activities of model evaluation, community support, and business understanding are the top three most positive and significant factors affecting team performance. Our research contributes to the literature by highlighting the value of forum feedback and exploring the data science activities in the forum discussion, in relation to innovation performance, to enrich the empirical understanding of open innovation. Research implications for researchers and practitioners participating in, organising, and supporting data science open innovation activities are provided.
Causal machine learning, Open innovation, System dynamics, Topic modelling
0040-1625
Li, Libo
838dda30-da62-41ad-b57c-bee6ad59acd3
Yu, Huan
071c97e4-f277-4fdf-a6f8-e3fe25f98769
Kunc, Martin
0b254052-f9f5-49f9-ac0b-148c257ba412
Li, Libo
838dda30-da62-41ad-b57c-bee6ad59acd3
Yu, Huan
071c97e4-f277-4fdf-a6f8-e3fe25f98769
Kunc, Martin
0b254052-f9f5-49f9-ac0b-148c257ba412

Li, Libo, Yu, Huan and Kunc, Martin (2024) The impact of forum content on data science open innovation performance: a system dynamics-based causal machine learning approach. Technological Forecasting and Social Change, 198, [122936]. (doi:10.1016/j.techfore.2023.122936).

Record type: Article

Abstract

Open innovation in data science generally takes the form of public competitions where teams exchange messages and solutions by competing and collaborating simultaneously. Team behaviours are widely heterogeneous in terms of the performance of their solutions and the participation in knowledge creation. We present a novel research framework for open innovation by integrating system dynamics and structural topic modelling to extract open factors and adopting a machine learning-based difference-in-differences estimator to understand the impact of team behaviour on their performance using data from Kaggle's competition. Our results identify four team behaviour categories—active, learner, lurker, and passive— in data science open innovation competitions which depend on the performance of their solutions and actions related to posting and reading messages in the forum. Furthermore, the activities of model evaluation, community support, and business understanding are the top three most positive and significant factors affecting team performance. Our research contributes to the literature by highlighting the value of forum feedback and exploring the data science activities in the forum discussion, in relation to innovation performance, to enrich the empirical understanding of open innovation. Research implications for researchers and practitioners participating in, organising, and supporting data science open innovation activities are provided.

Text
1-s2.0-S0040162523006212-main - Version of Record
Available under License Creative Commons Attribution.
Download (7MB)

More information

Accepted/In Press date: 17 October 2023
e-pub ahead of print date: 16 November 2023
Published date: January 2024
Additional Information: Funding Information: We thank editors for handling our manuscript. We also thank peer reviewers for their invaluable suggestions. Publisher Copyright: © 2023 The Authors
Keywords: Causal machine learning, Open innovation, System dynamics, Topic modelling

Identifiers

Local EPrints ID: 484587
URI: http://eprints.soton.ac.uk/id/eprint/484587
ISSN: 0040-1625
PURE UUID: b23c12c6-1f59-4879-9f0c-b9661cad5824
ORCID for Libo Li: ORCID iD orcid.org/0000-0003-1658-5157
ORCID for Huan Yu: ORCID iD orcid.org/0000-0003-1214-8478
ORCID for Martin Kunc: ORCID iD orcid.org/0000-0002-3411-4052

Catalogue record

Date deposited: 17 Nov 2023 17:53
Last modified: 18 Mar 2024 03:56

Export record

Altmetrics

Contributors

Author: Libo Li ORCID iD
Author: Huan Yu ORCID iD
Author: Martin Kunc ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×