Skip to content
View zeyofu's full-sized avatar

Highlights

  • Pro

Block or report zeyofu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,449 1,688 Updated Sep 24, 2025

Official implementation of BLIP3o-Series

Python 1,614 74 Updated Nov 29, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,464 125 Updated Dec 25, 2025

Contexts Optical Compression

Python 21,574 1,929 Updated Oct 25, 2025

Fully Open Framework for Democratized Multimodal Training

Python 663 53 Updated Dec 15, 2025

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 783 51 Updated Oct 15, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,654 55 Updated Nov 15, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,295 204 Updated May 19, 2025

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 13,051 1,386 Updated Dec 18, 2025

Minimalistic large language model 3D-parallelism training

Python 2,382 262 Updated Dec 11, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,412 153 Updated Aug 12, 2025

Official codebase for the paper Latent Visual Reasoning

Python 66 5 Updated Oct 22, 2025
Python 65 3 Updated Nov 5, 2025

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Python 88 3 Updated Aug 8, 2025

[NeurIPS 2024] Visual Perception by Large Language Model’s Weights

Python 55 1 Updated Mar 31, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,733 1,360 Updated Dec 24, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,392 1,457 Updated Nov 28, 2025

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,904 8,720 Updated Oct 11, 2024

Muon is Scalable for LLM Training

1,387 78 Updated Aug 3, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,974 3,867 Updated Dec 26, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,656 840 Updated Dec 18, 2025

Open-source unified multimodal model

Python 5,509 481 Updated Oct 27, 2025

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,972 80 Updated Aug 24, 2025

This is the della guide for Zhuang's group at Princeton University.

Python 13 Updated Dec 14, 2025

[ICCV 2025] Video-T1: Test-Time Scaling for Video Generation

Python 303 17 Updated Jun 29, 2025

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 644 59 Updated Dec 15, 2025

EB1A DIY Collection

15 5 Updated Nov 17, 2025

DIY for NIW/EB1A

28 11 Updated Jan 19, 2024
TeX 99 44 Updated Jan 29, 2025
Next