Highlights
- Pro
Stars
MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE
OpenAI-compatible API server for Apple on-device models
A Conversational Speech Generation Model
Fully open reproduction of DeepSeek-R1
My learning notes for ML SYS.
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
Align Anything: Training All-modality Model with Feedback
Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。
A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone
Examples and guides for using the Gemini API
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
An implementation for MLLM oversensitivity evaluation
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
✨✨Latest Advances on Multimodal Large Language Models
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Fight the forgetting curve by reviewing flashcards & entire notes on Obsidian
A plugin to edit and view Excalidraw drawings in Obsidian
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
RuLES: a benchmark for evaluating rule-following in language models
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.