Highlights
- Pro
Stars
Production-ready platform for agentic workflow development.
The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.
(ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators
Self-Guided Function Calling in Large Language Models via Stepwise Experience Recall
“AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势,并通过实践掌握从理论到落地的全过程。
[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".
⏩ Ship faster with Continuous AI. Build and run custom agents across your IDE, terminal, and CI
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
verl: Volcano Engine Reinforcement Learning for LLMs
Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents
Open-source implementation of AlphaEvolve
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Benchmarking LLMs' Gaming Ability in Multi-Agent Environments
A framework for few-shot evaluation of language models.
This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"
A curated list of Diffusion Model in RL resources (continually updated)
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.