Scalable Recommendations Using Clustering Based Collaborative Filtering

descriptionPublicationkeyboard_double_arrow_right Article 01 Dec 2019Publisher:IEEEJournal:2019 International Conference on Information Technology (ICIT)

Authors: Joydeep Das; Madhurima Banerjee; Kalyani Mali; Subhashis Majumder;

doi: 10.1109/icit48102.2019.00056

Scalable Recommendations Using Clustering Based Collaborative Filtering

- Summary
- Metrics

Abstract

The development and growth in recommender systems address the issue of information overload faced by the online users while searching for products or services. However, recommender systems typically face challenges like data sparsity and scalability as they often handle large datasets. The most widely used recommendation technique is Collaborative Filtering (CF) that pins down the recommendations on the opinions of the most similar users. The core of a CF algorithm is the similarity computations among the users or items, which becomes extremely expensive when new users and items join the system at a very rapid rate. The proposed work deals with this scalability problem by implementing a clustering based CF approach. Typically in a recommendation problem there exists a set of users, a set of items and a rating matrix, that records the ratings assigned by the users to the items. In this work, we first partition the set of users using a CURE (Clustering using representatives) based method and then leverage the resultant clusters to formulate recommendations for the target user. In the proposed method, the CF algorithm is not applied to the entire user-item database, rather the algorithm is applied separately to each of the clusters resulting in reduced recommendation time. Moreover, Clustering also helps to improve the sparsity problem by reducing the dimension of the rating matrix and filtering out noisy data. The results of the experiments conducted on MovieLens-10M and MovieLens-20M datasets indicate that our method significantly reduces the runtime and at the same time preserves good recommendation quality.

Related Organizations

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now