Skip to content
View lu-m13's full-sized avatar

Block or report lu-m13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026 Oral] VGGT Omega

Python 441 8 Updated May 17, 2026

[RSS 2026] Causal video-action world model for generalist robot control

Python 1,181 93 Updated Apr 29, 2026

[ICLR 2026] Efficient Agent Training for Computer Use

Python 142 8 Updated Sep 5, 2025

Use DINOv3’s powerful, self-supervised visual features + YOLOv12’s blazing-fast detection, all in one repo. Whether you have only a few hundred labeled images or a medium-sized dataset, DINOV3-YOLO…

Python 283 43 Updated Nov 27, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,346 4,831 Updated May 15, 2026

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

Python 1,178 129 Updated May 14, 2026

Hy3 preview (295B A21B), a leading reasoning and agent model in its size, with great cost efficiency

Python 331 13 Updated Apr 23, 2026

Helios: Real Real-Time Long Video Generation Model

Python 1,811 142 Updated Apr 16, 2026

[CVPR 2026] Pixio: a capable vision encoder dedicated to dense prediction, simply by pixel reconstruction

Python 433 10 Updated Jan 22, 2026

[NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO

Python 159 1 Updated Nov 26, 2025
Python 1,564 229 Updated Mar 25, 2026

MS-Agent: a lightweight framework to empower agentic execution of complex tasks

Python 4,258 497 Updated Apr 15, 2026

The official project website of "Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models" (CoM-PT for short, accepted to CVPR 2026)

Python 7 Updated Apr 15, 2026

A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Speech-Language Models (SLMs), and more.

107 7 Updated May 16, 2026

Official PyTorch implementation of GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers (NeurIPS 2025)

Python 8 1 Updated Apr 13, 2026

[ICLR 2026] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

Python 120 9 Updated Mar 11, 2026

HY-Embodied: Embodied Foundation Models for Real-World Agents

Python 727 14 Updated Apr 14, 2026

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 31,142 3,722 Updated May 14, 2026

Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)

Python 350 17 Updated May 15, 2026

Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration

C++ 86 14 Updated May 26, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,918 2,220 Updated Jul 24, 2024

[NeurIPS 2025] LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS

Python 224 14 Updated Oct 17, 2025

AI agents running research on single-GPU nanochat training automatically

Python 81,426 11,839 Updated Mar 26, 2026

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,916 349 Updated Feb 19, 2026

FASTER: Rethinking Real-Time Flow VLAs

Python 111 5 Updated May 14, 2026

Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).

Python 208 12 Updated Mar 20, 2026

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 3,915 478 Updated Mar 23, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,160 281 Updated May 17, 2026

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,899 180 Updated Feb 27, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 124,214 20,458 Updated May 15, 2026
Next