text-clustering

Here are 111 public repositories matching this topic...

mituskillologies / pgdai-nlpcv-dec25

Programs conducted at CDAC Pune's course PGDAI and subject "Natural Language Processing and Computer Vision" in November - December 2025

python natural-language-processing deep-neural-networks computer-vision sentiment-analysis text-classification image-processing recurrent-neural-networks artificial-intelligence image-classification convolutional-neural-networks object-detection transfer-learning text-clustering

Updated Dec 16, 2025
Jupyter Notebook

ThorstenDoherr / searchengine

Star

heuristic matching of large databases by fuzzy criteria like addresses

machine-learning entity-resolution foxpro matching-engine address-matching text-clustering foxpro-database-files

Updated Dec 15, 2025
xBase

jameswniu / research_doc_extraction_rag_agent

Star

Turn messy survey responses into clean research insights. Dual-model pipeline: Claude Opus 4.5 extracts themes and assigns participants, GPT-5.1 writes executive summaries. Tuned temperatures for precision where it matters.

nlp text-analysis survey-analysis text-clustering qualitative-research openai-api llm thematic-analysis research-automation claude-api llm-pipeline dual-model

Updated Dec 3, 2025
Python

RodolfoLSS / wine_analysis

Star

Data analysis of a wine's dataset.

python nlp machine-learning data-analysis data-preprocessing nlp-machine-learning text-clustering

Updated Dec 16, 2025
Jupyter Notebook

TranTungDuong1611 / CTAI_MachineLearning_Project

Star

A comprehensive news aggregation and text analysis system that leverages advanced machine learning techniques to process Vietnamese news articles.

machine-learning mvc deep-learning text-classification text-summarization system-design text-clustering mlops stacking-ensemble

Updated Sep 29, 2025
Python

vickvey / modern-tabasco

Star

A modern tool for detecting intra-domain textual ambiguities using word sense disambiguation techniques. Built with FastAPI (backend) and Next.js (frontend) for a modular and modern developer experience.

word-sense-disambiguation text-clustering fastapi nextjs15

Updated Sep 27, 2025
Shell

Digioref / NLP-Project

Star

Repository of Natural Language Processing project at Polytechnic of Milan. Generative chatbots, with audio, images and RAG.

Updated Sep 19, 2025
Jupyter Notebook

ArikReuter / TopicGPT

Star

TopicGPT allows to integrate the benefits of LLMs into Topic Modelling

nlp natural-language-processing text-mining topic-modeling gpt text-clustering topic-modeling-analysis gpt-3 openai-api gpt-4 chatgpt

Updated Sep 19, 2025
Python

tiansztiansz / python-data-science

Star

b站 AI日日新不定期更新使用Python框架完成机器学习、深度学习、数据科学任务

text-classification image-classification embedding video-classification text-clustering token-classification

Updated Jul 19, 2025
Jupyter Notebook

JuanLara18 / Text-Classification-System

Star

Modular pipeline for text clustering, classification, and evaluation using TF-IDF and unsupervised ML techniques

nlp unsupervised-learning tfidf text-clustering

Updated Jun 30, 2025
Python

yuuusha / topic-modeling

Star

The repository contains files (notebooks, data) for the course work of the 2nd course: "Topic modeling for text document analysis".

data-science machine-learning topic-modeling data-analysis nlp-machine-learning lda-model text-clustering nmf-matrix-factorization lsa-model

Updated Jun 13, 2025
Python

Navy10021 / SLS

Star

SLS : Neural Information Retrieval(IR)-based Semantic Search model

nlp embeddings topic-modeling text-clustering pre-trained-language-models semantic-search-algorithm information-retrieval-system

Updated Mar 21, 2025
Jupyter Notebook

Navy10021 / Parallel_Clustering_based_TM

Star

Parallel clustering-based Topic Modeling

nlp topic-modeling keyword-extraction text-clustering bert-model bert-embeddings

Updated Mar 18, 2025
Python

till-tietz / gsdmm

Star

GSDMM Short Text Clustering via Dirichlet Mixture Models

r rcpp cpp text-analytics text-clustering

Updated Mar 15, 2025
C++

SkywardAI / hackathon-leaderboard

Star

Automated Leaderboard System for Hackathon Evaluation Using Large Language Models

ai text-classification hackathon bedrock text-clustering llms

Updated Feb 19, 2025
JavaScript

pngo1997 / Document-Clustering-using-K-Means

Star

Performs unsupervised clustering on text documents.

python clustering wordcloud unsupervised-learning sparse-matrix kmeans-clustering text-clustering wordcloud-visualization

Updated Jan 31, 2025
Jupyter Notebook

xlang-ai / instructor-embedding

Star

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

information-retrieval text-classification embeddings language-model text-embedding text-clustering text-semantic-similarity text-evaluation prompt-retrieval text-reranking

Updated Jan 15, 2025
Python

michellemashutian / clusteringText

Star

The repository provides a pipeline for preprocessing text data, extracting features, and applying clustering algorithms like K-means, DBSCAN, or hierarchical clustering.

python lda kmeans-clustering dbscan-clustering text-clustering lsi-model sklearn-library sklearn-clustering

Updated Dec 26, 2024
Python

ScottishFold007 / FastThresholdClustering

Star

FastThresholdClustering is an efficient vector clustering algorithm based on FAISS, particularly suitable for large-scale vector data clustering tasks. The algorithm features intuitive and easy-to-select hyperparameters, uses cosine similarity as its distance metric, and supports GPU acceleration.

clustering-algorithm text-clustering

Updated Dec 17, 2024
Python

michabirklbauer / hgb_dse_text_mining

Star

Contents for the practical part of the lecture Text Mining

python nlp machine-learning text-mining deep-learning text-classification tensorflow keras spacy how-to educational text-clustering

Updated Nov 7, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the text-clustering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-clustering topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text-clustering

Here are 111 public repositories matching this topic...

mituskillologies / pgdai-nlpcv-dec25

ThorstenDoherr / searchengine

jameswniu / research_doc_extraction_rag_agent

RodolfoLSS / wine_analysis

TranTungDuong1611 / CTAI_MachineLearning_Project

vickvey / modern-tabasco

Digioref / NLP-Project

ArikReuter / TopicGPT

tiansztiansz / python-data-science

JuanLara18 / Text-Classification-System

yuuusha / topic-modeling

Navy10021 / SLS

Navy10021 / Parallel_Clustering_based_TM

till-tietz / gsdmm

SkywardAI / hackathon-leaderboard

pngo1997 / Document-Clustering-using-K-Means

xlang-ai / instructor-embedding

michellemashutian / clusteringText

ScottishFold007 / FastThresholdClustering

michabirklbauer / hgb_dse_text_mining

Improve this page

Add this topic to your repo