The impact of forum content on data science open innovation performance: a system dynamics-based causal machine learning approach
The impact of forum content on data science open innovation performance: a system dynamics-based causal machine learning approach
Open innovation in data science generally takes the form of public competitions where teams exchange messages and solutions by competing and collaborating simultaneously. Team behaviours are widely heterogeneous in terms of the performance of their solutions and the participation in knowledge creation. We present a novel research framework for open innovation by integrating system dynamics and structural topic modelling to extract open factors and adopting a machine learning-based difference-in-differences estimator to understand the impact of team behaviour on their performance using data from Kaggle's competition. Our results identify four team behaviour categories—active, learner, lurker, and passive— in data science open innovation competitions which depend on the performance of their solutions and actions related to posting and reading messages in the forum. Furthermore, the activities of model evaluation, community support, and business understanding are the top three most positive and significant factors affecting team performance. Our research contributes to the literature by highlighting the value of forum feedback and exploring the data science activities in the forum discussion, in relation to innovation performance, to enrich the empirical understanding of open innovation. Research implications for researchers and practitioners participating in, organising, and supporting data science open innovation activities are provided.
Causal machine learning, Open innovation, System dynamics, Topic modelling
Li, Libo
838dda30-da62-41ad-b57c-bee6ad59acd3
Yu, Huan
071c97e4-f277-4fdf-a6f8-e3fe25f98769
Kunc, Martin
0b254052-f9f5-49f9-ac0b-148c257ba412
January 2024
Li, Libo
838dda30-da62-41ad-b57c-bee6ad59acd3
Yu, Huan
071c97e4-f277-4fdf-a6f8-e3fe25f98769
Kunc, Martin
0b254052-f9f5-49f9-ac0b-148c257ba412
Li, Libo, Yu, Huan and Kunc, Martin
(2024)
The impact of forum content on data science open innovation performance: a system dynamics-based causal machine learning approach.
Technological Forecasting and Social Change, 198, [122936].
(doi:10.1016/j.techfore.2023.122936).
Abstract
Open innovation in data science generally takes the form of public competitions where teams exchange messages and solutions by competing and collaborating simultaneously. Team behaviours are widely heterogeneous in terms of the performance of their solutions and the participation in knowledge creation. We present a novel research framework for open innovation by integrating system dynamics and structural topic modelling to extract open factors and adopting a machine learning-based difference-in-differences estimator to understand the impact of team behaviour on their performance using data from Kaggle's competition. Our results identify four team behaviour categories—active, learner, lurker, and passive— in data science open innovation competitions which depend on the performance of their solutions and actions related to posting and reading messages in the forum. Furthermore, the activities of model evaluation, community support, and business understanding are the top three most positive and significant factors affecting team performance. Our research contributes to the literature by highlighting the value of forum feedback and exploring the data science activities in the forum discussion, in relation to innovation performance, to enrich the empirical understanding of open innovation. Research implications for researchers and practitioners participating in, organising, and supporting data science open innovation activities are provided.
Text
1-s2.0-S0040162523006212-main
- Version of Record
More information
Accepted/In Press date: 17 October 2023
e-pub ahead of print date: 16 November 2023
Published date: January 2024
Additional Information:
Funding Information:
We thank editors for handling our manuscript. We also thank peer reviewers for their invaluable suggestions.
Publisher Copyright:
© 2023 The Authors
Keywords:
Causal machine learning, Open innovation, System dynamics, Topic modelling
Identifiers
Local EPrints ID: 484587
URI: http://eprints.soton.ac.uk/id/eprint/484587
ISSN: 0040-1625
PURE UUID: b23c12c6-1f59-4879-9f0c-b9661cad5824
Catalogue record
Date deposited: 17 Nov 2023 17:53
Last modified: 18 Mar 2024 03:56
Export record
Altmetrics
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics