-
Meta Reality Labs Research
- WA, USA
- http://people.csail.mit.edu/jstraub/
- https://orcid.org/0000-0003-2339-1262
- @jstraub6
Stars
A project that optimizes Whisper for low latency inference using NVIDIA TensorRT
AI agents running research on single-GPU nanochat training automatically
A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future …
PyTorch native quantization and sparsity for training and inference
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
pytorch implementation of VAE-Gumble-Softmax
Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024
Bayesian entropy estimation in Python - via the Nemenman-Schafee-Bialek algorithm
EDGE: Scalable and optimum mutual information estimator for high-dimensional applications including deep learning
A toolkit for developing and comparing reinforcement learning algorithms.
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
[CVPR'25 Highlight] Official repository of Sonata: Self-Supervised Learning of Reliable Point Representations
A generative world for general-purpose robotics & embodied AI learning.
Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Implementation of MagViT2 Tokenizer in Pytorch
🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
Differentiable gaussian rasterization with depth, alpha, normal map and extra per-Gaussian attributes, also support camera pose gradient
This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arxiv.org/abs/2406.10224).
This is the official repository for "EgoLifter Open-world 3D Segmentation for Egocentric Perception, ECCV 2024"
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral
Python code to fuse multiple RGB-D images into a TSDF voxel volume.