Skip to content
View WxxShirley's full-sized avatar
🤔
focus
🤔
focus

Block or report WxxShirley

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
166 stars written in Python
Clear filter

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,620 8,389 Updated Sep 20, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,114 3,945 Updated Nov 10, 2025

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 27,697 3,481 Updated Sep 23, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,306 1,764 Updated Oct 13, 2025

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,440 2,053 Updated Jul 29, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,595 2,211 Updated Mar 11, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,106 1,950 Updated Apr 4, 2024

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 18,309 2,123 Updated Sep 24, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,058 1,298 Updated Nov 10, 2025

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Python 15,430 2,268 Updated Aug 15, 2025

新浪微博爬虫,用python爬取新浪微博数据

Python 9,276 2,053 Updated Sep 21, 2025

[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI

Python 4,434 630 Updated Jun 26, 2024

Simple RL training for reasoning

Python 3,783 279 Updated Aug 3, 2025

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,579 359 Updated May 13, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,269 421 Updated Nov 10, 2025

名校公开课程评价网

Python 3,218 291 Updated Jun 17, 2022

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,070 254 Updated Jul 25, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,390 185 Updated Nov 10, 2025

A library for graph deep learning research

Python 1,996 289 Updated Jul 15, 2024

CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)

Python 1,807 310 Updated Feb 1, 2024

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Python 1,676 293 Updated Sep 8, 2022

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

Python 1,632 408 Updated Dec 26, 2023

A fork to add multimodal model training to open-r1

Python 1,416 70 Updated Feb 8, 2025
Python 1,334 120 Updated Sep 12, 2025

从无到有构建一个电影知识图谱,并基于该KG,开发一个简易的KBQA程序。

Python 1,325 424 Updated Aug 6, 2022

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,224 110 Updated Sep 19, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,184 111 Updated Aug 16, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,156 97 Updated Oct 20, 2025

PyTorch implementation of Barlow Twins.

Python 995 128 Updated Mar 3, 2022

Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"

Python 948 108 Updated Mar 4, 2024
Next