Skip to content
View tongbc's full-sized avatar

Block or report tongbc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 16,998 2,883 Updated Sep 10, 2025

Text-audio foundation model from Boson AI

Python 7,415 536 Updated Sep 15, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 820 53 Updated May 14, 2025

Official Repo for Open-Reasoner-Zero

Python 2,045 117 Updated Jun 2, 2025

LLM Arena by KCORES team

HTML 947 39 Updated Apr 29, 2025

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.

Python 2,941 270 Updated Aug 2, 2025

AIMO2 2nd place solution

Python 66 12 Updated May 28, 2025

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

635 30 Updated Sep 16, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 737 28 Updated Sep 7, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,431 416 Updated Oct 9, 2025

Fully open data curation for reasoning models

Python 2,113 175 Updated Sep 3, 2025

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Python 343 81 Updated May 23, 2023

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,572 68 Updated May 11, 2025

Fully open reproduction of DeepSeek-R1

Python 25,520 2,396 Updated Sep 8, 2025
Python 744 49 Updated Sep 3, 2025

Explore the Multimodal “Aha Moment” on 2B Model

Python 610 22 Updated Mar 18, 2025

A brief and partial summary of RLHF algorithms.

132 3 Updated Mar 4, 2025

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,807 133 Updated Jul 5, 2024

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,719 281 Updated Oct 6, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 3,156 294 Updated Jul 7, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,048 1,649 Updated Sep 24, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 21,194 2,490 Updated Aug 3, 2025

经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新

CSS 26,008 2,095 Updated Oct 3, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 18,702 3,096 Updated Oct 9, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,463 262 Updated Oct 8, 2025

ChatYuan: Large Language Model for Dialogue in Chinese and English

Python 1,884 180 Updated Jun 16, 2023

PromptCLUE, 全中文任务支持零样本学习模型

Jupyter Notebook 664 67 Updated Jun 16, 2023

计算机自学指南

HTML 67,882 7,631 Updated Oct 9, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,419 1,064 Updated Sep 24, 2025
Next