Skip to content
View g-jing's full-sized avatar

Highlights

  • Pro

Block or report g-jing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 8 Updated Jul 22, 2025

Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)

Python 27 Updated Oct 28, 2025

Pioneering Automated GUI Interaction with Native Agents

Python 8,673 612 Updated Dec 26, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,324 192 Updated Jun 5, 2025

[CVPR 2025] WildAvatar: Learning In-the-wild 3D Avatars from the Web

Python 125 6 Updated Mar 11, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,997 2,227 Updated Dec 15, 2025

Scalable and memory-optimized training of diffusion models

Python 1,312 143 Updated Jun 4, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,250 12,208 Updated Dec 26, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,998 3,876 Updated Dec 26, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 22,515 2,634 Updated Dec 24, 2025

[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 810 39 Updated Dec 17, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,480 8,987 Updated Nov 17, 2025

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,882 152 Updated Oct 4, 2025

The official implementation of RealisDance

Python 607 28 Updated Jun 20, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,610 296 Updated Mar 10, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,085 1,090 Updated Nov 18, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 20,028 1,675 Updated Nov 26, 2025

Official repository for LTX-Video

Python 8,941 838 Updated Oct 25, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,507 1,159 Updated Nov 21, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,058 523 Updated Jun 9, 2025

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 644 59 Updated Dec 15, 2025

FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024

Python 22 1 Updated Dec 9, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,145 135 Updated Dec 15, 2025

The best OSS video generation models, created by Genmo

Python 3,540 469 Updated Nov 14, 2025

Agent S: an open agentic framework that uses computers like a human

Python 9,121 1,029 Updated Dec 16, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,274 1,237 Updated Nov 4, 2025
Python 17 4 Updated Oct 22, 2024

[CVPR 2024] On the Content Bias in Fréchet Video Distance

Python 136 9 Updated Sep 28, 2024
Next