Movie Recommendation KNN
Movie Recommendation KNN
net/publication/344627182
CITATIONS READS
9 5,560
5 authors, including:
Gaurav Srivastav
Datta Meghe Institute of Medical Sciences (Deemed University)
10 PUBLICATIONS 17 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Gaurav Srivastav on 13 October 2020.
Abstract—Over the past years, the internet has broadened the Recommendation systems are primarily using three
horizon of various domains to interact and share meaningful approaches [6]. In content-based filtering, we do profiling
information. As it is said that everything has its pros and cons based on what type of content any user is interested in and
therefore, along with the expansion of domain comes
using the collected information, it recommends items.
information overload and difficulty in extraction of data. To
Another one is collaborative filtering, where we make
overcome this problem the recommendation system plays a vital
role. It is used to enhance the user experience by giving fast and clusters of similar users and use that information to make
coherent suggestions. This paper describes an approach which recommendations. Hybrid systems are the one which takes
offers generalized recommendations to every user, based on into account both above stated approaches to deal with
movie popularity and/or genre. Content-Based Recommender operational data more concisely [7]. Our goal is to provide
System is implemented using various deep learning approaches. accurate recommendations with less computational
This paper also gives an insight into problems which are faced in complexity.
content-based recommendation system and we have made an
effort to rectify them.
II. RELATED WORK
Keywords: Recommendation System, Content-Based
Recommender System, Deep learning Some of the common approaches of recommender
system are:
I. INTRODUCTION
1. Content-based filtering
Advancement in technology is reaching new heights every 2. Collaborative filtering
day and due to which we can see enormous growth in 3. Hybrid filtering
information. To deal with such large data we use machine A. Content Based Filtering
learning that automates analytical model building [1]. The
early classification of machine learning is divided into three This approach filters the items based on the likings of the
broad categories: Supervised learning, Unsupervised user. It gives result based on what the user has rated earlier.
learning and Reinforcement learning [2]. We use computers The method to model this approach is the Vector Space
to make predictions to help us achieve better results using Model (VSM). It derives the similarity of the item from its
various computational statistics. Tasks can be performed description and introduces the concept of TF-IDF (Term
without being explicitly programmed to do so [3]. It Frequency-Inverse Document Frequency) [28].
becomes a tedious task to extract the relevant information.
frequency occurrence of term t in document
Search engines solve the problem to some extent but it does Tf(t)= totalnumberoftermsindocumen 𝑡
not solve the personalization problem. Recommendation
System framework plays a vital role in today‟s internet totalnumberofdocument
surfing, be it buying a product from an e-commerce site or If(t)= 𝑙𝑜𝑔10numberofdocumentscontainingtermt
watching a movie on some video-on-demand service [4]. In
our everyday life, we depend on recommendations given by
other people either by word of mouth or reviews of general The similarity between item vectors can be computed by
surveys. People often use recommender systems over the three methods:
1. Cosine similarity
web to make decisions for the items related to their choice.
2. Euclidian distance
Recommendation systems are software tools and techniques
3. Pearson‟s correlation
whose goal is to make useful and sensible recommendations COSINE SIMILARITY
to a collection of users for items or products that might
interest them [5]. In other words, the recommender system Cosine similarity among two objects measures the angle of
or recommendation systems belongs to a class of cosine between the two objects. It compares two documents
information filtering system that aims at predicting the on a normalized scale. It can be done by finding the dot
„preference‟ or „rating‟ given to an item. product between the two identities.
Published By:
Retrieval Number: E9666069520/2020©BEIESP Blue Eyes Intelligence Engineering
DOI: 10.35940/ijeat.E9666.069520 556
& Sciences Publication
Movie Recommendation System using Cosine Similarity and KNN
IV. CONCLUSION
We have illustrated the modelling of a movie
recommendation system by making the use of content-based
filtering in the movie recommendation system. The KNN
algorithm is implemented in this model along with the
principle of cosine similarity as it gives more accuracy than
the other distance metrics and the complexity is
comparatively low too
Recommendations systems have become the most essential
fount of a relevant and reliable source of information in the
world of internet. Simple ones consider one or a few
Fig: To implement Cosine similarity we take an example parameters while the more complex ones make use of more
of 2 movies of different genre adventure and comedy parameters to filter the results and make it more user
friendly. With the inclusion of advanced deep learning and
other filtering techniques like collaborative filtering and
hybrid filtering a strong movie recommendation system can
be built. This can be a major step towards the further
development of this model as it will not only become more
efficient to use but also increase the business value even
further.
REFERENCES
1. Gediminas Adomavicius and Alexander Tuzhilin. Toward the next
generation of recommender systems: A survey of the state-of-the-art
Fig: Cosine similarity and possible extensions. Knowledge and Data Engineering, IEEE
The angle theta between the two movies will determine the Transactions on, 17(6):734–749, 2005.
similarity between the two movies. The theta ranges from 0- 2. Ricardo Baeza-Yates, Berthier Ribeiro-Neto, et al. Modern
information retrieval, volume 463. ACM Press New York, 1999
1. If the value of the theta is near 1 then it is most similar 3. ShumeetBaluja, Rohan Seth, D Sivakumar, Yushi Jing, Jay Yagnik,
and if it's near to 0 then it is least similar. The movie will be Shankar Kumar, Deepak Ravichandran, and Mohamed Aly. Video
recommended if it is close to 1 otherwise there would be no suggestion and discovery for youtube: taking random walks through
the view graph. In Proceedings of the 17th international conference on
similarity between them. It will recommend the best movies World Wide Web, pages 895–904. ACM, 2008.
to the user according to the Cosine similarity. After the 4. Xu Hailing, Wu Xiao, Li Xiaodong, and Yan Baoping.
cosine similarity, we have used a normalised popular score Comparison study of internet recommendation system. Journal of
Software, 20(2):350–362, 2009.
through which we get our function of computing distance. 5. T. E. D. Mining, “Enhancing teaching and learning through
Then by using the KNN functionality, we have found the educational data mining and learning analytics: An issue brief,” in
nearest neighbour which will be recommended to the user. Proceedings of a conference on advanced technology for
education, 2012.
6. Nakagawa and T. Ito, “An implementation of a knowledge
recommendation system based on similarity among users‟
profiles,” in Sice 2002. roceedings of the Sice Conference, 2002,
pp. 326–327 vol.1.
7. T. K. Quan, I. Fuyuki, and H. Shinichi, “Improving the accuracy
of recommender system by clustering items based on the stability
of user similarity,” in International Conference on Computational
Intelligence for Modelling Control and Automation, 2006, p. 61.
Published By:
Retrieval Number: E9666069520/2020©BEIESP Blue Eyes Intelligence Engineering
DOI: 10.35940/ijeat.E9666.069520 558
& Sciences Publication
Movie Recommendation System using Cosine Similarity and KNN