Lists (8)
Sort Name ascending (A-Z)
Stars
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Tongyi Deep Research, the Leading Open-source Deep Research Agent
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)
A fork to add multimodal model training to open-r1
从无到有构建一个电影知识图谱,并基于该KG,开发一个简易的KBQA程序。
[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"
This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL.