Skip to content
View suyuyiS's full-sized avatar

Block or report suyuyiS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Efficient "Factory" to Build Multiple LoRA Adapters

Python 375 66 Updated Feb 13, 2025

大模型文本分类

Python 96 11 Updated Aug 15, 2024

[SIGIR'24] The official implementation code of MOELoRA.

Python 192 23 Updated Jul 22, 2024

X-LoRA: Mixture of LoRA Experts

Python 270 21 Updated Aug 4, 2024

Costrict - strict AI coder for enterprises, quality first, including AI Agent, AI CodeReview, AI Completion.

TypeScript 3,901 151 Updated Apr 13, 2026

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

Python 268 45 Updated Mar 5, 2023

Minimal reproduction of DeepSeek R1-Zero

Python 13,052 1,582 Updated Feb 27, 2026

Fully open reproduction of DeepSeek-R1

Python 25,984 2,413 Updated Apr 2, 2026

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 13,843 1,360 Updated Apr 30, 2025

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)

Jupyter Notebook 420 20 Updated Jun 30, 2025

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,051 229 Updated Apr 14, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,059 8,564 Updated Apr 12, 2026

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,154 568 Updated Jul 15, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,934 606 Updated May 3, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,675 85 Updated Mar 8, 2024

A framework for few-shot evaluation of language models.

Python 12,163 3,180 Updated Apr 8, 2026

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,558 227 Updated Dec 15, 2025

SecGPT网络安全大模型

Python 3,006 359 Updated Jun 25, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,947 1,858 Updated Jul 15, 2025

An Open-sourced Knowledgable Large Language Model Framework.

Python 1,384 133 Updated Jan 11, 2025

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,286 767 Updated Oct 16, 2024

Repo for fine-tuning Casual LLMs

Python 460 86 Updated Mar 27, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,079 520 Updated Jul 1, 2025

User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.

Python 337 56 Updated Mar 24, 2023

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 73,260 7,898 Updated Mar 11, 2026

Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction

Python 219 36 Updated Jul 12, 2022

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 79,963 15,162 Updated May 10, 2024

一个简单的中文事件抽取模型,触发词和实体联合标注识别,同时判定实体角色。

Python 74 13 Updated Dec 19, 2020

An experiment and demo-level tool for text information extraction (event-triples extraction), which can be a route to the event chain and topic graph, 基于依存句法与语义角色标注的事件三元组抽取,可用于文本理解如文档主题链,事件线等应用。

Python 932 215 Updated Nov 26, 2022
Next