The University of Southampton
University of Southampton Institutional Repository

Improvement of Biomedical Dataset Search Through the Integration of Provenance (Dataset)

Improvement of Biomedical Dataset Search Through the Integration of Provenance (Dataset)
Improvement of Biomedical Dataset Search Through the Integration of Provenance (Dataset)
This dataset supports the University of Southampton doctoral thesis “Improvement of Biomedical Dataset Search Through the Integration of Provenance.” The dataset contains two files: Prompts_experiment This file contains the data presented in Chapter 6. It includes the proposed prompts for extracting provenance information from publications, along with the results of testing these prompts. Scalability_experiment This file contains the data presented in Chapter 6 relating to the scalability of the extractor. The experiment assessed scalability by evaluating the extractor’s performance as the dataset size (i.e., number of files) increased. Several experiments were conducted to measure two performance metrics: cost and response time, based on dataset size.
University of Southampton
Almuntashiri, Abdullah
aa118cfa-3b60-4717-9855-2816bbbb28d0
Almuntashiri, Abdullah
aa118cfa-3b60-4717-9855-2816bbbb28d0

Almuntashiri, Abdullah (2025) Improvement of Biomedical Dataset Search Through the Integration of Provenance (Dataset). University of Southampton doi:10.5258/SOTON/D3656 [Dataset]

Record type: Dataset

Abstract

This dataset supports the University of Southampton doctoral thesis “Improvement of Biomedical Dataset Search Through the Integration of Provenance.” The dataset contains two files: Prompts_experiment This file contains the data presented in Chapter 6. It includes the proposed prompts for extracting provenance information from publications, along with the results of testing these prompts. Scalability_experiment This file contains the data presented in Chapter 6 relating to the scalability of the extractor. The experiment assessed scalability by evaluating the extractor’s performance as the dataset size (i.e., number of files) increased. Several experiments were conducted to measure two performance metrics: cost and response time, based on dataset size.

Archive
Dataset_25.zip - Dataset
Available under License Creative Commons Attribution.
Download (459kB)
Text
ReadMe_Almuntashiri_Thesis_Dataset.txt - Dataset
Available under License Creative Commons Attribution.
Download (1kB)

More information

Published date: 2025

Identifiers

Local EPrints ID: 504390
URI: http://eprints.soton.ac.uk/id/eprint/504390
PURE UUID: b6bdfecf-3d53-42ae-9a70-e17247e17f71
ORCID for Abdullah Almuntashiri: ORCID iD orcid.org/0000-0002-7343-6468

Catalogue record

Date deposited: 08 Sep 2025 17:03
Last modified: 10 Sep 2025 10:51

Export record

Altmetrics

Contributors

Creator: Abdullah Almuntashiri ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×