The University of Southampton
University of Southampton Institutional Repository

Data mining techniques in health informatics: A case study from breast cancer research

Data mining techniques in health informatics: A case study from breast cancer research
Data mining techniques in health informatics: A case study from breast cancer research

This paper presents a case study of using data mining techniques in the analysis of diagnosis and treatment events related to Breast Cancer disease. Data from over 16,000 patients has been pre-processed and several data mining techniques have been implemented by using Weka (Waikato Environment for Knowledge Analysis). In particular, Generalized Sequential Patterns mining has been used to discover frequent patterns from disease event sequence profiles based on groups of living and deceased patients. Furthermore, five models have been evaluated in Classification with the objective to classify the patients based on selected attributes. This research showcases the data mining process and techniques to transform large amounts of patient data into useful information and potentially valuable patterns to help understand cancer outcomes.

Breast cancer datasets, Clinical data environment, Data mining techniques, Database technology, Electronic patient records, Health informatics, Knowledge discovery
0302-9743
56-70
Springer
Lu, Jing
51addc48-28e4-4a31-b68b-62d4d77c4c32
Hales, Alan
66a20906-7b0e-4d23-b65a-08932f23900b
Rew, David
36dcc3ad-2379-4b61-a468-5c623d796887
Keech, Malcolm
6aa13471-c162-40b8-9363-c85a8356331d
Fröhlingsdorf, Christian
fa759d74-4fa4-470f-a4d6-a30af3295574
Mills-Mullett, Alex
69b919b9-c8cb-4b7f-9b1c-40716cac88fa
Wette, Christian
5f041dcd-cfae-4f9d-ae7f-4ae1d37b6d2f
Elena Renda, M.
Holzinger, Andreas
Khuri, Sami
Bursa, Miroslav
Lu, Jing
51addc48-28e4-4a31-b68b-62d4d77c4c32
Hales, Alan
66a20906-7b0e-4d23-b65a-08932f23900b
Rew, David
36dcc3ad-2379-4b61-a468-5c623d796887
Keech, Malcolm
6aa13471-c162-40b8-9363-c85a8356331d
Fröhlingsdorf, Christian
fa759d74-4fa4-470f-a4d6-a30af3295574
Mills-Mullett, Alex
69b919b9-c8cb-4b7f-9b1c-40716cac88fa
Wette, Christian
5f041dcd-cfae-4f9d-ae7f-4ae1d37b6d2f
Elena Renda, M.
Holzinger, Andreas
Khuri, Sami
Bursa, Miroslav

Lu, Jing, Hales, Alan, Rew, David, Keech, Malcolm, Fröhlingsdorf, Christian, Mills-Mullett, Alex and Wette, Christian (2015) Data mining techniques in health informatics: A case study from breast cancer research. Elena Renda, M., Holzinger, Andreas, Khuri, Sami and Bursa, Miroslav (eds.) In Information Technology in Bio- and Medical Informatics - 6th International Conference, ITBAM 2015, Proceedings. vol. 9267, Springer. pp. 56-70 . (doi:10.1007/978-3-319-22741-2_6).

Record type: Conference or Workshop Item (Paper)

Abstract

This paper presents a case study of using data mining techniques in the analysis of diagnosis and treatment events related to Breast Cancer disease. Data from over 16,000 patients has been pre-processed and several data mining techniques have been implemented by using Weka (Waikato Environment for Knowledge Analysis). In particular, Generalized Sequential Patterns mining has been used to discover frequent patterns from disease event sequence profiles based on groups of living and deceased patients. Furthermore, five models have been evaluated in Classification with the objective to classify the patients based on selected attributes. This research showcases the data mining process and techniques to transform large amounts of patient data into useful information and potentially valuable patterns to help understand cancer outcomes.

This record has no associated files available for download.

More information

Published date: 2015
Additional Information: Publisher Copyright: © Springer International Publishing Switzerland 2015. Copyright: Copyright 2015 Elsevier B.V., All rights reserved.
Venue - Dates: 6th International Conference on Information Technology in Bio- and Medical Informatics, ITBAM 2015, , Valencia, Spain, 2015-09-03 - 2015-09-04
Keywords: Breast cancer datasets, Clinical data environment, Data mining techniques, Database technology, Electronic patient records, Health informatics, Knowledge discovery

Identifiers

Local EPrints ID: 447994
URI: http://eprints.soton.ac.uk/id/eprint/447994
ISSN: 0302-9743
PURE UUID: cbe29e4a-3187-4932-bb3d-2beee4af1acf
ORCID for David Rew: ORCID iD orcid.org/0000-0002-4518-2667

Catalogue record

Date deposited: 29 Mar 2021 16:38
Last modified: 17 Mar 2024 03:56

Export record

Altmetrics

Contributors

Author: Jing Lu
Author: Alan Hales
Author: David Rew ORCID iD
Author: Malcolm Keech
Author: Christian Fröhlingsdorf
Author: Alex Mills-Mullett
Author: Christian Wette
Editor: M. Elena Renda
Editor: Andreas Holzinger
Editor: Sami Khuri
Editor: Miroslav Bursa

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×