Skip to content
View Neil-HZC's full-sized avatar

Block or report Neil-HZC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Python 410 12 Updated Dec 5, 2025

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,986 334 Updated Jun 12, 2024

[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Python 733 35 Updated Aug 2, 2025

OmniGen2: Exploration to Advanced Multimodal Generation.

Jupyter Notebook 3,973 12 Updated Dec 2, 2025

This is the dataset and code release of the OpenRooms Dataset. For more information, please refer to our webpage below. Thanks a lot for your interest in our research!

158 8 Updated Mar 26, 2024

Go ahead and axolotl questions

Python 10,978 1,223 Updated Dec 19, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,760 1,071 Updated Dec 21, 2025

适用于中山大学(SYSU)课程/实验报告的一个简单的 LaTeX 小模板

TeX 37 3 Updated Sep 26, 2024

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Python 1,937 146 Updated Oct 1, 2025

[ICCV'25] Official PyTorch Implementation of "JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers"

Python 23 1 Updated Nov 27, 2025

A framework for few-shot evaluation of language models.

Python 10,990 2,915 Updated Dec 18, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,444 705 Updated Dec 17, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,662 2,861 Updated Dec 21, 2025

[COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Python 291 16 Updated Aug 25, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,637 746 Updated Sep 22, 2025

Code and data for Shading Annotations in the Wild

Python 30 8 Updated Apr 8, 2017

[ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)

Python 36 5 Updated Sep 8, 2025

A lightweight implementation of the Qwen-Image-Edit model for inference and LoRA fine-tuning on 8×V100 GPUs

Python 73 1 Updated Dec 3, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,183 120 Updated Nov 9, 2025

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering (nvidia)

Python 15 3 Updated Sep 24, 2024

[CVPR2025] Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Python 186 3 Updated Apr 3, 2025

STGAN-based framework for material appearance editing.

Python 9 2 Updated Oct 30, 2024
Python 4,461 435 Updated Sep 14, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,879 12,103 Updated Dec 21, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,868 3,823 Updated Dec 21, 2025

Official Repository for Ouroboros - ICCV 2025

Python 15 1 Updated Nov 12, 2025

Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation

Python 22 Updated Jul 30, 2025

The collection of awesome papers on alignment of diffusion models.

381 16 Updated Oct 27, 2025

Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).

Python 40 Updated May 9, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,282 7,790 Updated Dec 21, 2025
Next