Skip to content
View AakashKumarNain's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@tensorflow @ml-gde @magic-with-latents

Block or report AakashKumarNain

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Jupyter Notebook 32 7 Updated Jun 3, 2024
Jupyter Notebook 6 Updated Apr 20, 2026

A Python DSL to write Nvidia PTX for Hopper and Blackwell in JAX and PyTorch

Python 311 26 Updated May 8, 2026

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 2,443 164 Updated Apr 6, 2026

Implement a reasoning LLM in PyTorch from scratch, step by step

Jupyter Notebook 4,515 664 Updated Jun 12, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,404 700 Updated May 17, 2026

The best ChatGPT that $100 can buy.

Python 55,082 7,511 Updated May 5, 2026

Supplementary code for https://astralord.github.io/posts/exploring-parallel-strategies-with-jax/

Python 9 2 Updated May 2, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,751 201 Updated Jun 25, 2024

Turn expensive prompts into cheap fine-tuned models

TypeScript 2,813 174 Updated May 25, 2024

MTEB: Massive Text Embedding Benchmark

Python 3,304 625 Updated Jun 15, 2026

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 14,936 1,254 Updated Jun 15, 2026

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,632 361 Updated May 13, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 4,164 510 Updated Mar 23, 2026

Textbook on reinforcement learning from human feedback

Python 1,995 207 Updated Jun 15, 2026

Distributed Compiler based on Triton for Parallel Systems

Python 1,461 151 Updated Apr 22, 2026

A mixed-curvature approach to deal with transformer representation anisotropy

Python 4 Updated Mar 25, 2026

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,863 95 Updated Apr 18, 2025

An example starter repo for Python projects

Python 313 62 Updated Jun 16, 2025

Multi-backend recommender systems with Keras 3

Python 173 18 Updated Jun 1, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

8,004 287 Updated May 15, 2025

Code for studying the super weight in LLM

Jupyter Notebook 124 16 Updated Dec 3, 2024

Notes from the Latent Space paper club. Follow along or start your own!

250 13 Updated Jul 31, 2024

supporting pytorch FSDP for optimizers

Python 84 4 Updated Dec 8, 2024

Claude skills for Synalinks OSS

Python 900 85 Updated May 8, 2026

A monitor of resources

C++ 32,868 1,055 Updated Jun 6, 2026

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,761 273 Updated Jul 18, 2025
Next