Skip to content
View udit-rawat's full-sized avatar
🌊
Building
🌊
Building

Block or report udit-rawat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"

Python 182 21 Updated Oct 22, 2025

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,225 71 Updated Mar 9, 2025

ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution

Python 749 128 Updated Dec 14, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

70,838 8,105 Updated Dec 21, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,769 1,891 Updated Dec 11, 2025

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 883 72 Updated Nov 26, 2025

2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.

Jupyter Notebook 2,360 152 Updated Nov 19, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,423 12,175 Updated Dec 21, 2025

A Tree Search Library with Flexible API for LLM Inference-Time Scaling

Python 504 65 Updated Dec 9, 2025

Universal memory layer for AI Agents

Python 44,522 4,836 Updated Dec 17, 2025

Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)

Python 380 22 Updated Dec 10, 2025
C++ 717 41 Updated Aug 15, 2025
Python 235 23 Updated Nov 27, 2025

A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

TypeScript 10,130 824 Updated Dec 18, 2025
Python 40 2 Updated May 15, 2025

Research papers and blogs to transition to AI Engineering

2,001 272 Updated Nov 19, 2025

Official inference framework for 1-bit LLMs

Python 24,461 1,914 Updated Jun 3, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 2,218 153 Updated Oct 14, 2025

Synthetic Data Quality Assurance 🔎

HTML 65 12 Updated Dec 9, 2025

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

Rust 10,720 746 Updated Dec 21, 2025
Python 213 20 Updated Dec 16, 2025

🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.

Python 1,881 219 Updated May 19, 2025

Code release for DynamicTanh (DyT)

Python 1,028 86 Updated Mar 30, 2025

Adding guardrails to large language models.

Python 6,173 493 Updated Dec 18, 2025

Flock is a workflow-based low-code platform for rapidly building chatbots, RAG, and coordinating multi-agent teams, powered by LangGraph, Langchain, FastAPI, and NextJS.(Flock 是一个基于workflow工作流的低代码平…

TypeScript 1,057 131 Updated Aug 20, 2025

Delivery infrastructure for agents. Arch is a models-native proxy and data plane for agents that handles plumbing work in AI - like agent routing and orchestration, guardrails, zero-code logs and t…

Rust 4,644 261 Updated Dec 20, 2025
Jupyter Notebook 2 Updated Feb 28, 2025

Just like the beloved character Doraemon who pulls out gadgets from his pocket, this agent can dynamically create, save, and utilize its own tools when needed.

Python 17 4 Updated Jan 27, 2025

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 68,036 7,213 Updated Dec 20, 2025
Next