-
Tsinghua University
- Beijing
- https://menghaoguo.github.io/
- @MenghaoGuo1
Stars
JittorInfer is a high-performance C++ inference framework designed for large language models on Huawei's Ascend AI processor.
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
Evaluation code for RBench-V. Based on https://github.com/open-compass/VLMEvalKit.
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
[SIGGRAPH 2025] One Model to Rig Them All: Diverse Skeleton Rigging with UniRig
JittorGeometric is a Jittor-based graph machine learning library.
Jittor implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
[NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs
PyTorch implementation of the paper : Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation.
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
JDiffusion is a diffusion model library for generating images or videos based on Diffusers and Jittor.
The official implementation of Self-Play Fine-Tuning (SPIN)
A collection of papers on diffusion models for 3D generation.
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
Official implementation of ICCV2023 VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
OpenLLaMA-Chinese, a permissively licensed open source instruction-following models based on OpenLLaMA