- Waterloo, Canada
- https://ddhruvkr.github.io/
Stars
AI-powered citation search & paper review for Overleaf β Chrome extension. Think Google Scholar but inside Overleaf. Also works with OpenAI Prism.
Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.
Post-training with Tinker
Large Language Model based Multi-Agents: A Survey of Progress and Challenges (In IJCAI 2024)
Paper list of multi-agent reinforcement learning (MARL)
Modular and structured prompt caching for low-latency LLM inference
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
A bibliography and survey of the papers surrounding o1
A high-throughput and memory-efficient inference and serving engine for LLMs
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
Learn how to develop, deploy and iterate on production-grade ML applications.
π§βπ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes π; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gaβ¦
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
π Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Implementation of popular ML algorithms from scratch
Reference implementation of Mistral AI 7B v0.1 model.
Tutorial on neural theorem proving
π A list of open LLMs available for commercial use.
A playbook for systematically maximizing the performance of deep learning models.
π Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)
Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)