Skip to content
View g-jing's full-sized avatar

Highlights

  • Pro

Block or report g-jing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Skills for Real Engineers. Straight from my .claude directory.

Shell 136,444 11,824 Updated Jun 18, 2026

The ultimate training toolkit for finetuning diffusion models

Python 10,915 1,361 Updated Jun 19, 2026
Python 15 1 Updated Jul 22, 2025

Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)

Python 31 Updated Oct 28, 2025

Pioneering Automated GUI Interaction with Native Agents

Python 11,006 829 Updated Jan 27, 2026

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,426 8 Updated Feb 25, 2026

[CVPR 2025] WildAvatar: Learning In-the-wild 3D Avatars from the Web

Python 130 6 Updated Mar 11, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,288 2,880 Updated Mar 5, 2026

Scalable and memory-optimized training of diffusion models

Python 1,359 140 Updated May 26, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,333 18,231 Updated Jun 19, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,244 6,612 Updated Jun 19, 2026

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 24,556 2,810 Updated May 25, 2026

[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 856 42 Updated Dec 17, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 56,597 9,844 Updated Feb 11, 2026

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,931 161 Updated Mar 3, 2026

The official implementation of RealisDance

Python 611 28 Updated Jun 20, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,688 309 Updated Mar 10, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,239 1,107 Updated Jun 2, 2026

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 21,315 1,833 Updated Mar 5, 2026

Official repository for LTX-Video

Python 10,529 1,045 Updated Jan 5, 2026

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 12,223 1,255 Updated Nov 21, 2025

NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

Jupyter Notebook 10,363 686 Updated Jun 17, 2026

A simple pip-installable Python tool to generate your HTML citation world map from your Google Scholar ID.

Python 713 65 Updated Jun 15, 2026

FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024

Python 22 1 Updated Dec 9, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,296 154 Updated Jun 11, 2026

The best OSS video generation models, created by Genmo

Python 3,670 485 Updated Nov 14, 2025

Agent S: an open agentic framework that uses computers like a human

Python 11,887 1,400 Updated May 13, 2026

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,797 1,307 Updated Nov 4, 2025
Next