ZhaiYanbo

Yanbo Zhai ZhaiYanbo

I am Yanbo Zhai, a student in Xi'an Jiaotong University, I am eager to learn on GitHub and contribute my part to this community.

3 followers · 7 following

Xi'an Jiaotong University
No.28 Xianning West Road, Xi'an, Shaanxi 710049, P.R. China

Highlights

Stars

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,677 309 Updated Nov 13, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 10,989 2,915 Updated Dec 18, 2025

THU-KEG / LRM-FactEval

Python 15 2 Updated Jun 25, 2025

voidism / DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 532 66 Updated Jan 17, 2025

Raina-Xin / I2MoE

[ICML 2025] I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts.

Python 55 11 Updated May 31, 2025

Osilly / Vision-R1

This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 744 20 Updated Sep 10, 2025

Guanzhou-Ke / Knowledge-Bridger

The official repos of "Knowledge Bridger: Towards Training-Free Missing Modality Completion"

Python 19 Updated Jun 30, 2025

facebookresearch / deepconf

DeepConf: Deep Think with Confidence

Python 334 50 Updated Sep 18, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,298 1,445 Updated Nov 28, 2025

wizard-III / ArcherCodeR

ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement learning.

Python 43 2 Updated Aug 6, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 4,724 299 Updated Dec 19, 2025

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,472 526 Updated Oct 8, 2025

SpursGoZmy / Tabular-LLM

本项目旨在收集开源的表格智能任务数据集（比如表格问答、表格-文本生成等），将原始数据整理为指令微调格式的数据并微调LLM，进而增强LLM对于表格数据的理解，最终构建出专门面向表格智能任务的大型语言模型。

628 44 Updated Apr 22, 2024

SpursGoZmy / Awesome-Tabular-LLMs

We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理

588 43 Updated Dec 15, 2025

Jian-Lang / RAGPT

This repo is the official implementation of "Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning" accepted by AAAI 2025.

Python 55 3 Updated Dec 6, 2025

HumanMLLM / HumanOmni

HumanOmni

Python 209 12 Updated Mar 10, 2025

HumanMLLM / R1-Omni

Python 986 69 Updated Mar 24, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,678 76 Updated May 11, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,660 2,860 Updated Dec 21, 2025

UNITES-Lab / Flex-MoE

[NeurIPS 2024 Spotlight] Code for the paper "Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts"

Python 68 8 Updated Jun 9, 2025

Mr-Righter / xjtu-sport-bot

Python 60 8 Updated Oct 12, 2025

DAMO-NLP-SG / VideoLLaMA3

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 1,080 79 Updated Aug 14, 2025

lzw-lzw / GroundingGPT

[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model

Python 341 18 Updated Nov 4, 2024

appletea233 / LLaVA-ST

[CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

Python 81 4 Updated Jul 4, 2025

ZixianGao / EUAR

Python 9 1 Updated Sep 28, 2024

AIM3-RUC / MMIN

Missing Modality Imagination Network for Emotion Recognition with Uncertain Missing Modalities

71 14 Updated Nov 9, 2022

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,290 327 Updated Dec 15, 2025

withinmiaov / A-Survey-on-Mixture-of-Experts-in-LLMs

[TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

468 23 Updated Jul 23, 2025

laekov / fastmoe

A fast MoE impl for PyTorch

Python 1,825 197 Updated Feb 10, 2025

SkyworkAI / MoH

MoH: Multi-Head Attention as Mixture-of-Head Attention

Python 297 15 Updated Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly