The University of Southampton
University of Southampton Institutional Repository

Novel centroid selection approaches for KMeans-clustering based recommender systems

Novel centroid selection approaches for KMeans-clustering based recommender systems
Novel centroid selection approaches for KMeans-clustering based recommender systems
Recommender systems have the ability to filter unseen information for predicting whether a particular user would prefer a given item when making a choice. Over the years, this process has been dependent on robust applications of data mining and machine learning techniques, which are known to have scalability issues when being applied for recommender systems. In this paper, we propose a k-means clustering-based recommendation algorithm, which addresses the scalability issues associated with traditional recommender systems. An issue with traditional k-means clustering algorithms is that they choose the initial k centroid randomly, which leads to inaccurate recommendations and increased cost for offline training of clusters. The work in this paper highlights how centroid selection in k-means based recommender systems can improve performance as well as being cost saving. The proposed centroid selection method has the ability to exploit underlying data correlation structures, which has been proven to exhibit superior accuracy and performance in comparison to the traditional centroid selection strategies, which choose centroids randomly. The proposed approach has been validated with an extensive set of experiments based on five different datasets (from movies, books, and music domain). These experiments prove that the proposed approach provides a better quality cluster and converges quicker than existing approaches, which in turn improves accuracy of the recommendation provided.
recommender systems, collaborative filtering, k-means clustering, centroid (seed) selection in k-means clustering
0020-0255
156-189
Zahra, Sobia
f6713426-f4c5-478c-967a-557b29b68438
Ghazanfar, Mustansar Ali
d188e6f7-ad66-46e9-ad86-dfff2a5d8b78
Khalid, Asra
5e84ffeb-830d-4e97-84bc-fd0b9af0341c
Azam, Muhammad Awais
c21900a4-612c-4bec-acc7-693d8dc6a035
Naeem, Usman
f110bb2d-943c-445d-8a61-6ccd8de80c10
Prugel-Bennett, Adam
b107a151-1751-4d8b-b8db-2c395ac4e14e
Zahra, Sobia
f6713426-f4c5-478c-967a-557b29b68438
Ghazanfar, Mustansar Ali
d188e6f7-ad66-46e9-ad86-dfff2a5d8b78
Khalid, Asra
5e84ffeb-830d-4e97-84bc-fd0b9af0341c
Azam, Muhammad Awais
c21900a4-612c-4bec-acc7-693d8dc6a035
Naeem, Usman
f110bb2d-943c-445d-8a61-6ccd8de80c10
Prugel-Bennett, Adam
b107a151-1751-4d8b-b8db-2c395ac4e14e

Zahra, Sobia, Ghazanfar, Mustansar Ali, Khalid, Asra, Azam, Muhammad Awais, Naeem, Usman and Prugel-Bennett, Adam (2015) Novel centroid selection approaches for KMeans-clustering based recommender systems. Information Sciences, 320, 156-189. (doi:10.1016/j.ins.2015.03.062).

Record type: Article

Abstract

Recommender systems have the ability to filter unseen information for predicting whether a particular user would prefer a given item when making a choice. Over the years, this process has been dependent on robust applications of data mining and machine learning techniques, which are known to have scalability issues when being applied for recommender systems. In this paper, we propose a k-means clustering-based recommendation algorithm, which addresses the scalability issues associated with traditional recommender systems. An issue with traditional k-means clustering algorithms is that they choose the initial k centroid randomly, which leads to inaccurate recommendations and increased cost for offline training of clusters. The work in this paper highlights how centroid selection in k-means based recommender systems can improve performance as well as being cost saving. The proposed centroid selection method has the ability to exploit underlying data correlation structures, which has been proven to exhibit superior accuracy and performance in comparison to the traditional centroid selection strategies, which choose centroids randomly. The proposed approach has been validated with an extensive set of experiments based on five different datasets (from movies, books, and music domain). These experiments prove that the proposed approach provides a better quality cluster and converges quicker than existing approaches, which in turn improves accuracy of the recommendation provided.

Text
Zahra_Novel.pdf - Accepted Manuscript
Download (1MB)

More information

Accepted/In Press date: 27 March 2015
e-pub ahead of print date: 8 May 2015
Published date: 1 November 2015
Keywords: recommender systems, collaborative filtering, k-means clustering, centroid (seed) selection in k-means clustering
Organisations: Electronics & Computer Science

Identifiers

Local EPrints ID: 376972
URI: http://eprints.soton.ac.uk/id/eprint/376972
ISSN: 0020-0255
PURE UUID: 070c6980-e17d-49f4-b7c2-d07a79bec592

Catalogue record

Date deposited: 13 May 2015 10:58
Last modified: 15 Mar 2024 05:16

Export record

Altmetrics

Contributors

Author: Sobia Zahra
Author: Mustansar Ali Ghazanfar
Author: Asra Khalid
Author: Muhammad Awais Azam
Author: Usman Naeem
Author: Adam Prugel-Bennett

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×