Skip to content
View brjathu's full-sized avatar

Block or report brjathu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,049 2,067 Updated Mar 27, 2026

Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality

HTML 340 20 Updated Jan 5, 2026

Temporal Neural Networks

Python 29 2 Updated Mar 2, 2026

An AI agent system for solving International Mathematical Olympiad (IMO) problems using Google's Gemini, OpenAI, and XAI APIs.

Python 928 121 Updated Oct 1, 2025

Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?

Jupyter Notebook 43 2 Updated Jul 26, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,527 115 Updated Jan 19, 2026

Official Implementation of weights2weights

Jupyter Notebook 156 8 Updated Mar 7, 2025

Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)

Python 46 2 Updated May 1, 2025

Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?

Python 149 8 Updated Feb 11, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 19,023 2,423 Updated Apr 7, 2026

Utilities for efficient fine-tuning, inference and evaluation of code generation models

Python 21 3 Updated Oct 3, 2023

Schedule-Free Optimization in PyTorch

Python 2,276 75 Updated May 21, 2025

Gaia Physics Engine

Raku 853 63 Updated Oct 4, 2025

Synthesizing Moving People with 3D Control

Python 140 4 Updated Sep 2, 2025

Run PyTorch in JAX. 🤝

Python 315 17 Updated Oct 13, 2025

CUDA accelerated rasterization of gaussian splatting

Python 4,927 810 Updated Apr 24, 2026

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,430 224 Updated May 19, 2025

Official inference library for Mistral models

Jupyter Notebook 10,782 1,043 Updated Apr 20, 2026

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"

Python 2,029 252 Updated Dec 25, 2025

Pytorch Implementation for Neural Point Characters (NPC)

Python 27 3 Updated Apr 1, 2024

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 7,182 797 Updated Apr 24, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,203 4,807 Updated Apr 24, 2026

A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!

Python 112 16 Updated May 11, 2023

Nerfbusters project

Jupyter Notebook 232 16 Updated Sep 20, 2025

FAIR Sequence Modeling Toolkit 2

Python 1,128 140 Updated Apr 27, 2026

Generative Agents: Interactive Simulacra of Human Behavior

21,196 2,977 Updated Aug 5, 2024

Inference code for Llama models

Python 59,370 9,818 Updated Jan 26, 2025

Code repository for the paper "On the Benefits of 3D Pose and Tracking for Human Action Recognition", (CVPR 2023)

Jupyter Notebook 285 31 Updated Jan 19, 2024

Easily create large video dataset from video urls

Python 656 76 Updated Jul 30, 2024

An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers

Shell 57 5 Updated Jul 11, 2023
Next