PrithivirajDamodaran

🏠

Working from home

Prithivida PrithivirajDamodaran

🏠

Working from home

Dense, Sparse and Hybrid Embeddings for LLMs, Multimodal Modelling & Data Engineering. Checkout my (YouTube series on V+L, linked below)

660 followers · 7 following

Bangkok
18:52 (UTC +07:00)
https://www.youtube.com/@prithivida

Achievements

x2 x3

Achievements

x2 x3

Lists (23)

Sort

Stars

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,460 1,999 Updated Nov 1, 2025

PrithivirajDamodaran / Route0x

Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da

Python 119 9 Updated Mar 31, 2025

pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 8,717 678 Updated Dec 18, 2025

CubiCasa / CubiCasa5k

CubiCasa5k floor plan dataset

Jupyter Notebook 438 129 Updated Dec 5, 2025

JulianJuaner / DeepFloorPlan_Pytorch

A PyTorch implementation for the floor plan segmentation on the r2v dataset. As well as a simple 3D mesh modeling script with the ModelNet dataset.

Python 48 4 Updated Apr 19, 2021

PrithivirajDamodaran / flashembed

Lightweight Python library to add low-footprint (all-MiniLM-* equivalent) multilingual retrievers to your RAG and Search & Retrieval pipelines.

Python 7 1 Updated Jun 8, 2024

PrithivirajDamodaran / FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…

Python 905 64 Updated Sep 15, 2025

qdrant / qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 27,784 1,947 Updated Dec 23, 2025

Social-AI-Studio / HatReD

Dataset and code implementation for the paper "Decoding the Underlying Meaning of Multimodal Hateful Memes" (IJCAI'23).

Python 19 4 Updated Jun 15, 2023

pietrobarbiero / pytorch_explain

PyTorch Explain: Interpretable Deep Learning in Python.

Jupyter Notebook 165 15 Updated May 16, 2024

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,833 1,348 Updated Oct 6, 2025

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 122,500 19,053 Updated Dec 23, 2025

EvolvingLMMs-Lab / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,283 209 Updated Mar 5, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,771 2,928 Updated Sep 2, 2024

run-llama / llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 45,980 6,657 Updated Dec 22, 2025

vibrantlabsai / nemesis

Reward Model framework for LLM RLHF

Python 61 6 Updated Jun 7, 2023

rsaryev / auto-copilot-cli

TypeScript 368 17 Updated Oct 29, 2024

zilliztech / GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,883 567 Updated Jul 11, 2025

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 4,055 320 Updated Aug 31, 2024

0hq / WebGPT

Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~1500 lines of vanilla Javascript.

JavaScript 3,776 223 Updated Jan 12, 2024

nomic-ai / pygpt4all

Official supported Python bindings for llama.cpp + gpt4all

C++ 1,015 157 Updated May 12, 2023

ConiferLabsWA / flan-ul2-dolly

Python 34 3 Updated Apr 23, 2023

ultramsg / python-whatsApp-bot

Make WhatsApp ChatBot and use WhatsApp API to send the WhatsApp messages in python .

Python 124 86 Updated Jan 11, 2023

langchain-ai / langchain

🦜🔗 The platform for reliable agents.

Python 122,520 20,193 Updated Dec 23, 2025

nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 76,983 8,303 Updated May 27, 2025

google-research-datasets / presto

A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs

115 6 Updated Mar 17, 2023

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,322 2,131 Updated Dec 18, 2025

Lightning-AI / lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,092 527 Updated Jul 1, 2025

rackerlabs / lambda-uploader

Helps package and upload Python lambda functions to AWS

Python 273 56 Updated May 9, 2023

lambci / yumda

Yum for AWS Lambda

Dockerfile 286 21 Updated Oct 30, 2020

Prithivida PrithivirajDamodaran

Lists (23)

4 CLIP

Attention Viz

Awesome Metrics

Awesome tools

ChatGPT

Compiler

GoEmotions

Gradient Tricks

Image Captioning

Image Viz

Multimodal Datasets

Multimodal models

Portfolio pages

Recommendation

Summary

Superb datasets

Text2Image

Topic modelling

Topic Models

UI

VideosObjTracking

VQA

Z datasets

Stars