Build software better, together

ContextLab / hypertools

A Python toolbox for gaining geometric insights into high-dimensional data

visualization python time-series data-visualization high-dimensional-data topic-modeling data-wrangling text-vectorization

Updated Jul 10, 2025
Python

kulwinderkk / recipe_recommender_nlp

Star

This project is an unsupervised NLP-based recipe recommender system designed to provide personalized recipe suggestions. The system employs content-based filtering techniques, utilizing cosine similarity to measure the resemblance between user inputs and a database of recipes.

nlp word-embeddings nltk cosine-similarity lda text-vectorization gensim-word2vec tfidf-vectorizer gensim-topic-modeling

Updated Sep 7, 2023
Jupyter Notebook

mansipatel2508 / Yelp-Review-Stars-Prediction-with-Machine-Learning

Star

The project has text vectorization, handling big data with merging and cleaning the text and getting the required columns while boosting the performance by feature extraction and parameter tuning for NN, compares the Performances through applied different models treating the problem as classification and regression both.

Updated Aug 9, 2019
Jupyter Notebook

amansrivastava17 / bns-short-text-similarity

Star

📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.

nlp text-classification text-similarity term-frequency tf-idf cosine-similarity bns text-vectorization short-text-semantic-similarity bns-vectorizer

Updated Aug 21, 2018
Python

Rishabbh-Sahu / information_retrieval

Star

Given a document, identifying the closest documents within the list of documents using tf-idf matrix and cosine similarity

information-retrieval matrix-multiplication similarity-search text-vectorization root-cause-analysis tfidf-vectorizer similar-patterns lookalike-queries

Updated Aug 24, 2022
Python

Ganesh2409 / Course-Recommendation-System

Star

🚀 Course Recommendation System is a machine learning-powered web application designed to recommend similar courses from Coursera's vast dataset of over 3,000 courses. Built using Python, Scikit-learn, and Streamlit, the app preprocesses course data, applies text vectorization, and leverages cosine similarity to offer personalized recommendations.

python nlp docker data-science machine-learning recommendation-system cosine-similarity text-vectorization streamlit-webapp

Updated Oct 10, 2024
Jupyter Notebook

Minku-Koo / Comment-Sentiment-Analysis

Star

Comment Sentiment Analysis using Deep Learning

python deep-learning sentiment-analysis keras selenium religion text-vectorization covid-19

Updated Sep 14, 2021
Python

mkearney / wactor

Star

Word Factor Vectors

r text-classification text word2vec word-embeddings rstats text-processing r-package text-vectorization word-vectors

Updated Dec 13, 2019
R

rid17pawar / Sentiment-Analysis-Model-Experiments

Star

Experiments in the field of Sentiment Analysis using ML Algorithms namely Logistic Regression, Naive Bayes along with tfidf, one hot encoding, bag of words vectorization. Different MLP and RNN models viz. LSTM, GRU, Bidirectional LSTM. Lastly, state of the art BERT model

sentiment-analysis naive-bayes lstm gru neural-networks rnn bag-of-words logistic-regression twitter-sentiment-analysis tfidf bert sentiment-classification bidirectional-lstm text-vectorization ml-algorithms tfidf-vectorizer transformer-architecture one-hot-encoding

Updated Jul 2, 2023
Jupyter Notebook

IanCarmona / Recommendation-Songs-Taylor-Swift

Star

This program is a project carried out in the Natural Language Processing course, which is a Taylor Swift song recommender. It utilizes topics such as sentiment analysis in texts, text vectorization, and the removal of stopwords.

natural-language-processing sentiment-analysis stopwords sentiment-classification text-vectorization

Updated Feb 12, 2024
Python

andreytsimbalov / News_Classification_and_Vectorization

Star

Evaluation of the accuracy of vectorization and text classification methods

nlp text-classification transformers text-vectorization

Updated Aug 5, 2021
Jupyter Notebook

SarangGami / Topic-modeling-on-News-Articles-Unsupervised-Learning

Star

In this project, task involves analyzing the content of the articles to extract key concepts and themes that are discussed across the articles to identify major themes/topics across a collection of BBC news articles.

spacy nltk topic-modeling gensim bag-of-words tf-idf latent-dirichlet-allocation nlp-machine-learning latent-semantic-analysis text-vectorization text-preprocessing

Updated Mar 31, 2023
Jupyter Notebook

rosette-api-community / visualize-embeddings

Star

A simple Python script for transforming a corpus of documents into text vectors suitable for visualization

visualization python nlp tsv machine-learning natural-language-processing text-embedding text-vectorization

Updated Mar 16, 2017
Python

ni3choudhary / Toxic-Comment-Classifier

Star

A DL project that helps in classifying Toxic Comment weather it is positive or not.

python flask deep-neural-networks deep-learning tensorflow text model cnn text-vectorization toxic-comment-classification toxic-comments mcauc

Updated Jul 25, 2022
Jupyter Notebook

KaavyaRekanar / Master_Thesis

Star

Text Classification of Legitimate and Rogue Online Privacy Policies: A manual analysis and an experimental procedure

Updated Oct 14, 2024
Java

nikhil1209ui / movie_recommender

Star

Movie Recommender based on Content based filtering.

python exploratory-data-analysis deployments data-collection web-hosting model-building text-vectorization feature-selection-and-engineering

Updated Nov 9, 2024
Jupyter Notebook

MaryvilleUniversity-AI / job-matcher

Star

Resume Matcher: A Streamlit app that compares resumes with job descriptions, highlights matched and missing skills, and calculates text similarity and skill coverage.

python data-science machine-learning natural-language-processing text-similarity cosine-similarity resume-parser text-vectorization job-matcher streamlit career-tools skills-extraction

Updated Dec 16, 2025
Python

vladimiralbrekhtccr / topic_modeling_top2vec_scientific-texts

Star

A diploma project focused on vectorizing scientific texts using the Top2Vec algorithm, with the aim of analyzing thematic groups, identifying trends, and visualizing the dynamics of interest in various topics in the field of computer science.

computer-science topic-modeling text-vectorization science-article

Updated Jun 12, 2024
Jupyter Notebook

vlada-pv / Prediction-Sociolinguistic-Data-Based-on-the-Diaries-Texts-of-the-Prozhito-Project

Star

The repository contains notebooks created for collecting and preprocessing the corpus of diary entries and for experiments on creating models for predicting gender, age groups of authors and the time period of text creation.

deep-learning word-embeddings recurrent-neural-networks naive-bayes-classifier neural-networks bag-of-words logistic-regression convolutional-neural-networks diary-entries sociolinguistics text-vectorization bilstm tf-idf-vectorizer text-preprocessing convol author-profiling

Updated Jun 12, 2024
Jupyter Notebook

alla-g / infosearch_hw

Star

Homeworks and final project for Infosearch course

tf-idf bm25 text-vectorization streamlit

Updated Aug 23, 2022
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text-vectorization

Here are 55 public repositories matching this topic...

ContextLab / hypertools

kulwinderkk / recipe_recommender_nlp

mansipatel2508 / Yelp-Review-Stars-Prediction-with-Machine-Learning

amansrivastava17 / bns-short-text-similarity

Rishabbh-Sahu / information_retrieval

Ganesh2409 / Course-Recommendation-System

Minku-Koo / Comment-Sentiment-Analysis

mkearney / wactor

rid17pawar / Sentiment-Analysis-Model-Experiments

IanCarmona / Recommendation-Songs-Taylor-Swift

andreytsimbalov / News_Classification_and_Vectorization

SarangGami / Topic-modeling-on-News-Articles-Unsupervised-Learning

rosette-api-community / visualize-embeddings

ni3choudhary / Toxic-Comment-Classifier

KaavyaRekanar / Master_Thesis

nikhil1209ui / movie_recommender

MaryvilleUniversity-AI / job-matcher

vladimiralbrekhtccr / topic_modeling_top2vec_scientific-texts

vlada-pv / Prediction-Sociolinguistic-Data-Based-on-the-Diaries-Texts-of-the-Prozhito-Project

alla-g / infosearch_hw

Improve this page

Add this topic to your repo