Skip to content
View ZhaiYanbo's full-sized avatar
  • Xi'an Jiaotong University
  • No.28 Xianning West Road, Xi'an, Shaanxi 710049, P.R. China

Highlights

  • Pro

Block or report ZhaiYanbo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,677 309 Updated Nov 13, 2025

A framework for few-shot evaluation of language models.

Python 10,989 2,915 Updated Dec 18, 2025
Python 15 2 Updated Jun 25, 2025

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 532 66 Updated Jan 17, 2025

[ICML 2025] I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts.

Python 55 11 Updated May 31, 2025

This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 744 20 Updated Sep 10, 2025

The official repos of "Knowledge Bridger: Towards Training-Free Missing Modality Completion"

Python 19 Updated Jun 30, 2025

DeepConf: Deep Think with Confidence

Python 334 50 Updated Sep 18, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,298 1,445 Updated Nov 28, 2025

ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement learning.

Python 43 2 Updated Aug 6, 2025

My learning notes for ML SYS.

Python 4,724 299 Updated Dec 19, 2025

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,472 526 Updated Oct 8, 2025

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

628 44 Updated Apr 22, 2024

We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理

588 43 Updated Dec 15, 2025

This repo is the official implementation of "Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning" accepted by AAAI 2025.

Python 55 3 Updated Dec 6, 2025

HumanOmni

Python 209 12 Updated Mar 10, 2025
Python 986 69 Updated Mar 24, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,678 76 Updated May 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,660 2,860 Updated Dec 21, 2025

[NeurIPS 2024 Spotlight] Code for the paper "Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts"

Python 68 8 Updated Jun 9, 2025
Python 60 8 Updated Oct 12, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 1,080 79 Updated Aug 14, 2025

[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model

Python 341 18 Updated Nov 4, 2024

[CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

Python 81 4 Updated Jul 4, 2025
Python 9 1 Updated Sep 28, 2024

Missing Modality Imagination Network for Emotion Recognition with Uncertain Missing Modalities

71 14 Updated Nov 9, 2022

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,290 327 Updated Dec 15, 2025

[TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

468 23 Updated Jul 23, 2025

A fast MoE impl for PyTorch

Python 1,825 197 Updated Feb 10, 2025

MoH: Multi-Head Attention as Mixture-of-Head Attention

Python 297 15 Updated Oct 29, 2024
Next