Lists (1)
Sort Name ascending (A-Z)
Stars
一个包含了多种主流大模型微调方案的实战代码库,基于Qwen3系列模型
Code and Data for Paper "AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning"
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
llama3 implementation one matrix multiplication at a time
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
LongQLoRA: Extent Context Length of LLMs Efficiently
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Inpaint anything using Segment Anything and inpainting models.
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
A project for summarization in tf2 and pytorch