Skip to content
View rulixiang's full-sized avatar
😅
I may be slow to respond.
😅
I may be slow to respond.

Block or report rulixiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 73 3 Updated Jul 18, 2025

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 680 27 Updated Dec 21, 2025

🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,1分钟手机通知,无需…

Python 39,963 20,789 Updated Dec 20, 2025
Python 48 12 Updated Apr 8, 2025

Post-training with Tinker

Python 2,594 253 Updated Dec 20, 2025

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 444 24 Updated Dec 15, 2025

Native Multimodal Models are World Learners

Python 1,367 52 Updated Nov 28, 2025

[NeurIPS 2025] Official code for paper: Latent Chain-of-Thought for Visual Reasoning

Python 21 Updated Oct 16, 2025

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,473 526 Updated Oct 8, 2025

MiniMax-M2, a model built for Max coding & agentic workflows.

2,053 156 Updated Nov 13, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 368 18 Updated Aug 26, 2025

Contexts Optical Compression

Python 21,520 1,925 Updated Oct 25, 2025

Fully open data curation for reasoning models

Python 2,171 182 Updated Dec 2, 2025

A lightweight, powerful framework for multi-agent workflows

Python 17,906 3,000 Updated Dec 22, 2025

The SAIL-VL2 series model developed by the BytedanceDouyinContent Group

75 6 Updated Sep 18, 2025
Python 8,615 607 Updated Nov 12, 2025

Paper collections of multi-modal LLM for Math/STEM/Code.

132 7 Updated Nov 17, 2025

MiroThinker is a series of open-source agentic models trained for deep research and complex tool use scenarios.

Python 1,346 94 Updated Dec 20, 2025

The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"

129 3 Updated Sep 4, 2025

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,425 85 Updated Sep 22, 2025

🦜🔗 The platform for reliable agents.

Python 122,395 20,173 Updated Dec 22, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,880 655 Updated Nov 20, 2025
Python 24 2 Updated Jul 18, 2025

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,064 140 Updated Dec 18, 2025
Python 55 3 Updated Aug 21, 2025

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 513 20 Updated Nov 5, 2025

The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.

Dockerfile 276 7 Updated Sep 26, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,679 76 Updated May 11, 2025
Next