Skip to content
View Hritikbansal's full-sized avatar

Block or report Hritikbansal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Data recipes and robust infrastructure for training AI agents

Python 69 5 Updated Dec 24, 2025

Official Repo for HoneyBee Paper

Python 9 1 Updated Oct 22, 2025
Python 3 Updated Aug 23, 2025

This is a pytorch implementation of k-means clustering algorithm

Python 336 42 Updated Mar 4, 2025
Python 108 9 Updated Sep 13, 2025

😎 Finding duplicate images made easy!

Python 5,553 475 Updated Aug 15, 2025

A Python Perceptual Image Hashing Module

Python 3,771 339 Updated Apr 17, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,763 2,888 Updated Dec 24, 2025

Scalable toolkit for efficient model reinforcement

Python 1,169 201 Updated Dec 24, 2025

Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding

Python 186 9 Updated Dec 17, 2025

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 280 13 Updated Sep 25, 2025

The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]

Python 170 7 Updated Jun 5, 2025

Dream 7B, a large diffusion language model

Python 1,119 72 Updated Nov 21, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,426 230 Updated Nov 12, 2025

get things from one computer to another, safely

Python 22,114 718 Updated Dec 16, 2025

An open-source implementation for training LLaVA-NeXT.

Python 428 22 Updated Oct 23, 2024

[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning"

Python 14 1 Updated Oct 31, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,177 53 Updated Aug 27, 2025

OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement

Python 123 6 Updated Jul 24, 2025

A light-weight tool for evaluating LLMs in rule-based ways.

Python 79 7 Updated Jun 19, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,976 2,223 Updated Dec 15, 2025
Python 108 8 Updated May 7, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,479 295 Updated Dec 19, 2025

Efficient Triton Kernels for LLM Training

Python 5,975 454 Updated Dec 23, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,650 839 Updated Dec 18, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,857 228 Updated Dec 24, 2025

Fully open data curation for reasoning models

Python 2,174 182 Updated Dec 2, 2025

A library for advanced large language model reasoning

Python 2,320 204 Updated Jun 10, 2025

Synthetic data curation for post-training and structured data extraction

Python 1,586 126 Updated Jul 29, 2025
Next