0% found this document useful (0 votes)

34 views9 pages

DL Mini Project

Uploaded by

shibhanisathish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views9 pages

DL Mini Project

Uploaded by

shibhanisathish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

index.

html :
styles.css :
TECHNOLOGY USED:

The code uses various machine learning, natural language processing (NLP), and data handling
techniques to build a content-based recommendation system. Here's a detailed breakdown of the
technologies and concepts used:

1. Data Handling with Pandas

- Library Used: pandas

- Purpose:

- Load and manipulate structured data (CSV file) using a DataFrame.

- Create a new feature (`combined_features`) by concatenating different columns like `genres`,

`director`, and `cast`.

- Significance:

- Pandas is essential for preprocessing and managing datasets in a tabular format.

2. Text Vectorization with TF-IDF

- Library Used: `sklearn.feature_extraction.text.TfidfVectorizer`

- Technique:

- TF-IDF (Term Frequency-Inverse Document Frequency):

- Term Frequency (TF): Measures how often a word appears in a document.

- Inverse Document Frequency (IDF): Reduces the weight of common terms across all documents,
emphasizing unique terms.

- Converts text data (`combined_features`) into numerical vectors.

- Stop Words:

- Commonly used words (e.g., "the," "and") are ignored (`stop_words='english'`) as they add no
significant value for content similarity.
- Significance:

- Converts raw text into a numerical format that machine learning models can process.

- Helps capture the semantic meaning of the movie features.

3. Similarity Computation Using Cosine Similarity

- Library Used: `sklearn.metrics.pairwise.linear_kernel`

- Technique:

- Cosine Similarity:

- Measures the cosine of the angle between two vectors (ranges from `-1` to `1`).

- High similarity means the angle is closer to `0°` (i.e., vectors point in the same direction).

- Formula:

- Used here to calculate similarity scores between the TF-IDF vectors of all movies.

- Significance:

- Identifies movies that are most similar in terms of content.

- Efficient and widely used in NLP and recommendation systems.

4. Content-Based Recommendation System

- Technique:

- A content-based filtering approach is implemented:

- Uses metadata (features like `genres`, `director`, `cast`) to find similar items.

- No need for user interaction or feedback data (e.g., ratings or viewing history).
- Implementation Steps:

1. Find the index of the input movie title in the dataset.

2. Compute similarity scores between the input movie and all others.

3. Sort and filter the top 5 most similar movies (excluding the input movie).

- Significance:

- Provides tailored recommendations based on the movie's attributes.

- Transparent and explainable since recommendations are based on content.

5. Python Programming

- Concepts Used:

- Indexing: Locate the movie's index using `movies_df[movies_df['title'].str.lower() ==

title.lower()].index[0]`.

- List Comprehensions: Simplify operations like extracting movie indices.

- Functions: Encapsulate logic in a reusable `get_recommendations` function.

- Significance:

- Demonstrates efficient programming practices and modular code design.

6. Scikit-learn (ML Library)

- Library Used: `scikit-learn`

- Components:

- `TfidfVectorizer`: Text feature extraction.

- `linear_kernel`: Efficient computation of cosine similarity.

- Significance:

- Scikit-learn provides robust tools for preprocessing, feature extraction, and similarity computation.
7. Natural Language Processing (NLP)

- Technique:

- Preprocessing text data by removing stop words and converting it to a vectorized form.

- Leveraging TF-IDF to extract meaningful textual information.

- Significance:

- NLP techniques make the system capable of understanding and processing textual movie metadata.

8. Algorithm Design

- Recommendation Logic:

- Retrieves the top 5 similar movies using cosine similarity scores.

- Excludes the input movie from the recommendations.

- Significance:

- Implements a practical application of machine learning and NLP for real-world tasks.

Why These Techniques?

- Efficiency: TF-IDF and cosine similarity are computationally efficient, even for large datasets.

- Explainability: Recommendations are based on explicit content, making the system transparent.

- Scalability: Works well with datasets where detailed user behavior data is unavailable.

DSBDA Mini Project
No ratings yet
DSBDA Mini Project
11 pages
NM (2) - Merged
No ratings yet
NM (2) - Merged
16 pages
Movie Recommendation System Using Machine Learning
No ratings yet
Movie Recommendation System Using Machine Learning
15 pages
NM (2) - Merged - Organized
No ratings yet
NM (2) - Merged - Organized
16 pages
ML 210490131009 Oep
No ratings yet
ML 210490131009 Oep
8 pages
Movie Recommendation System Using Machine Learning
No ratings yet
Movie Recommendation System Using Machine Learning
6 pages
MR Synopsis
No ratings yet
MR Synopsis
5 pages
Synopsis
No ratings yet
Synopsis
12 pages
Project Report On Movie Recommendation System
No ratings yet
Project Report On Movie Recommendation System
10 pages
Vaibhav - Project Report On Movie Recommender System Using Machine Learning
No ratings yet
Vaibhav - Project Report On Movie Recommender System Using Machine Learning
11 pages
Vignesh Report
No ratings yet
Vignesh Report
20 pages
Project Ai
No ratings yet
Project Ai
12 pages
Movie - Recommendations - System - Synopsis
No ratings yet
Movie - Recommendations - System - Synopsis
11 pages
Iv Year - Mini Project - Final Review PPT Sample Format
No ratings yet
Iv Year - Mini Project - Final Review PPT Sample Format
25 pages
Project Report MRS
No ratings yet
Project Report MRS
47 pages
Assignment 5zeerak
No ratings yet
Assignment 5zeerak
6 pages
Future Stack Modified PPP
No ratings yet
Future Stack Modified PPP
8 pages
Movie Recommendation System Project
No ratings yet
Movie Recommendation System Project
9 pages
Report System Predaction
No ratings yet
Report System Predaction
5 pages
Project Synopsis
No ratings yet
Project Synopsis
14 pages
Chatbot For Banking Project Report - Phase - 1,2,3
No ratings yet
Chatbot For Banking Project Report - Phase - 1,2,3
32 pages
Dsbda Mini Project Aissms CLG
No ratings yet
Dsbda Mini Project Aissms CLG
10 pages
Hackathon Submission Template (Level-1-Solution)
No ratings yet
Hackathon Submission Template (Level-1-Solution)
6 pages
Movie Recommendation Presentation
No ratings yet
Movie Recommendation Presentation
13 pages
MOvie Recommendation System Project Report
No ratings yet
MOvie Recommendation System Project Report
30 pages
Final Report Ai Application
No ratings yet
Final Report Ai Application
18 pages
Movix Project Report Final
No ratings yet
Movix Project Report Final
15 pages
Case Title
No ratings yet
Case Title
3 pages
Review 2 SEM 6
No ratings yet
Review 2 SEM 6
25 pages
Movie Recommendation Review
No ratings yet
Movie Recommendation Review
2 pages
Final Synopsis
No ratings yet
Final Synopsis
18 pages
Movie Recommendation System
No ratings yet
Movie Recommendation System
3 pages
Newmovies
No ratings yet
Newmovies
28 pages
Movie Recommendation System Report
No ratings yet
Movie Recommendation System Report
5 pages
Karan Mini Proj
No ratings yet
Karan Mini Proj
11 pages
SRMDB - in (B28 - Research Paper)
No ratings yet
SRMDB - in (B28 - Research Paper)
5 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
6 pages
Hackathon SubmissiON Janani
No ratings yet
Hackathon SubmissiON Janani
22 pages
Rosp
No ratings yet
Rosp
17 pages
DL Project
No ratings yet
DL Project
9 pages
2C13 AI Project1
No ratings yet
2C13 AI Project1
18 pages
Parnit 05
No ratings yet
Parnit 05
15 pages
Movie - Recommendation - System Research Paper
No ratings yet
Movie - Recommendation - System Research Paper
9 pages
Content-Based Movie Recommendation System Using TF-IDF and Cosine Similarity
No ratings yet
Content-Based Movie Recommendation System Using TF-IDF and Cosine Similarity
8 pages
Colorful Vintage Illustration Trendy History Theater Presentation
No ratings yet
Colorful Vintage Illustration Trendy History Theater Presentation
7 pages
Movie Recommendation Report
No ratings yet
Movie Recommendation Report
27 pages
Document
No ratings yet
Document
4 pages
MRS Mou Mca
No ratings yet
MRS Mou Mca
7 pages
ML Project Movie Recommendation System
No ratings yet
ML Project Movie Recommendation System
2 pages
Micro Project Report Format (1) FPSD
No ratings yet
Micro Project Report Format (1) FPSD
15 pages
BDA Report-Numbered
No ratings yet
BDA Report-Numbered
11 pages
Cream Simple Nature Project Presentation
No ratings yet
Cream Simple Nature Project Presentation
16 pages
Synopsis
No ratings yet
Synopsis
14 pages
Scalable AI-Driven Movie Recommender System
No ratings yet
Scalable AI-Driven Movie Recommender System
16 pages
Seminar Report
No ratings yet
Seminar Report
13 pages
IJRTI2207198
No ratings yet
IJRTI2207198
5 pages
Dsbda Mini 2
No ratings yet
Dsbda Mini 2
23 pages
Batch D17
No ratings yet
Batch D17
17 pages
GS38300143 - Safety Relay Output
No ratings yet
GS38300143 - Safety Relay Output
4 pages
HL225M Dri-Prime Pumps: Features
No ratings yet
HL225M Dri-Prime Pumps: Features
2 pages
Jurnal Contrastive Analysis 2
No ratings yet
Jurnal Contrastive Analysis 2
17 pages
Al Billet Cutting
No ratings yet
Al Billet Cutting
5 pages
Prospectus
No ratings yet
Prospectus
13 pages
22 Business English Expressions You Cant Live Without and How To Use Them
No ratings yet
22 Business English Expressions You Cant Live Without and How To Use Them
8 pages
Atisha Kasturi CV
No ratings yet
Atisha Kasturi CV
1 page
A Case Study On-Shutdown Audit of AFBC Boiler
100% (2)
A Case Study On-Shutdown Audit of AFBC Boiler
12 pages
Prof. Bholanath Dutta: Visionary in Management Education
No ratings yet
Prof. Bholanath Dutta: Visionary in Management Education
9 pages
Self Loading Concrete Mixer: Product Performance Pride
No ratings yet
Self Loading Concrete Mixer: Product Performance Pride
4 pages
Group 3 ECESCI NOTES
No ratings yet
Group 3 ECESCI NOTES
3 pages
Vocabulary Exercises for Students
No ratings yet
Vocabulary Exercises for Students
2 pages
Dualband Antenna Specs
0% (1)
Dualband Antenna Specs
3 pages
Bland-Altman Plot and Analysis
No ratings yet
Bland-Altman Plot and Analysis
25 pages
2 - 2022-23 Land Course Information Page (LLAW)
No ratings yet
2 - 2022-23 Land Course Information Page (LLAW)
4 pages
Module 1 OSCM
No ratings yet
Module 1 OSCM
11 pages
J2W Portfolio
No ratings yet
J2W Portfolio
13 pages
MaintainX Quote Document
No ratings yet
MaintainX Quote Document
8 pages
Eatwell
No ratings yet
Eatwell
19 pages
An Eponym Is A Person, Place, or Thing After Whom or Which Someone or Something Is (Or Is Believed To Be) Named
No ratings yet
An Eponym Is A Person, Place, or Thing After Whom or Which Someone or Something Is (Or Is Believed To Be) Named
2 pages
RC Advanced Document
No ratings yet
RC Advanced Document
299 pages
Spec KWH050ST26 F01 PDF
No ratings yet
Spec KWH050ST26 F01 PDF
25 pages
Resumen Comfort Zone
No ratings yet
Resumen Comfort Zone
2 pages
The Effect of Debate Technique To Students' Speaking Ability
No ratings yet
The Effect of Debate Technique To Students' Speaking Ability
8 pages
Final Draft LWD
No ratings yet
Final Draft LWD
21 pages
Parents' Views on Genetic Counseling
No ratings yet
Parents' Views on Genetic Counseling
7 pages
SS en 12620 Aggregate Testing
100% (1)
SS en 12620 Aggregate Testing
9 pages
BEST PRACTICES Commissioning Steam Air Line Blowing Rev05web
No ratings yet
BEST PRACTICES Commissioning Steam Air Line Blowing Rev05web
11 pages
pr1 MIDTERM Edited 2
No ratings yet
pr1 MIDTERM Edited 2
4 pages
Sample - Snail Market, 2032
No ratings yet
Sample - Snail Market, 2032
33 pages

DL Mini Project

Uploaded by

DL Mini Project

Uploaded by

index.

1. Data Handling with Pandas

- Library Used: pandas

- Load and manipulate structured data (CSV file) using a DataFrame.

- Create a new feature (`combined_features`) by concatenating different columns like `genres`,

- Pandas is essential for preprocessing and managing datasets in a tabular format.

2. Text Vectorization with TF-IDF

- Library Used: `sklearn.feature_extraction.text.TfidfVectorizer`

- TF-IDF (Term Frequency-Inverse Document Frequency):

- Term Frequency (TF): Measures how often a word appears in a document.

- Converts text data (`combined_features`) into numerical vectors.

- Helps capture the semantic meaning of the movie features.

3. Similarity Computation Using Cosine Similarity

- Library Used: `sklearn.metrics.pairwise.linear_kernel`

- Identifies movies that are most similar in terms of content.

- Efficient and widely used in NLP and recommendation systems.

4. Content-Based Recommendation System

- A content-based filtering approach is implemented:

1. Find the index of the input movie title in the dataset.

- Provides tailored recommendations based on the movie's attributes.

- Transparent and explainable since recommendations are based on content.

- Indexing: Locate the movie's index using `movies_df[movies_df['title'].str.lower() ==

- List Comprehensions: Simplify operations like extracting movie indices.

- Functions: Encapsulate logic in a reusable `get_recommendations` function.

- Demonstrates efficient programming practices and modular code design.

6. Scikit-learn (ML Library)

- Library Used: `scikit-learn`

- `TfidfVectorizer`: Text feature extraction.

- `linear_kernel`: Efficient computation of cosine similarity.

- Leveraging TF-IDF to extract meaningful textual information.

- Retrieves the top 5 similar movies using cosine similarity scores.

- Excludes the input movie from the recommendations.

Why These Techniques?

You might also like