Lists (1)
Sort Name ascending (A-Z)
Stars
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
Fully open data curation for reasoning models
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
A Model Context Protocol server for searching and analyzing arXiv papers
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. …
A quickstart and benchmark for pytorch distributed training.
Train your Agent model via our easy and efficient framework
Bringing BERT into modernity via both architecture changes and scaling
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Crime assistant including crime type prediction and crime consult service based on nlp methods and crime kg,罪名法务智能项目,内容包括856项罪名知识图谱, 基于280万罪名训练库的罪名预测,基于20W法务问答对的13类问题分类与法律资讯问答功能.
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
小红书 (xiaohongshu, rednote) ai运营助手,包括小红书风格内容(包含图片)的生成和自动发布两部分,其中自动发布利用selenium实现RPA模拟点击,将生成内容和封面图和内容图自动发布
Pytorch-Named-Entity-Recognition-with-BERT
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer
Build, evaluate and train General Multi-Agent Assistance with ease
Unified Structure Generation for Universal Information Extraction
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward