Skip to content
View suyuyiS's full-sized avatar

Block or report suyuyiS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Efficient "Factory" to Build Multiple LoRA Adapters

Python 374 66 Updated Feb 13, 2025

大模型文本分类

Python 95 11 Updated Aug 15, 2024

[SIGIR'24] The official implementation code of MOELoRA.

Python 191 23 Updated Jul 22, 2024

X-LoRA: Mixture of LoRA Experts

Python 267 21 Updated Aug 4, 2024

Costrict - strict AI coder for enterprises, quality first, including AI Agent, AI CodeReview, AI Completion.

TypeScript 3,848 149 Updated Apr 2, 2026

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

Python 267 45 Updated Mar 5, 2023

Minimal reproduction of DeepSeek R1-Zero

Python 13,014 1,585 Updated Feb 27, 2026

Fully open reproduction of DeepSeek-R1

Python 25,966 2,410 Updated Apr 2, 2026

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 13,653 1,347 Updated Apr 30, 2025

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)

Jupyter Notebook 417 20 Updated Jun 30, 2025

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,052 229 Updated Apr 14, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,398 8,444 Updated Apr 1, 2026

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,158 569 Updated Jul 15, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,930 605 Updated May 3, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,672 85 Updated Mar 8, 2024

A framework for few-shot evaluation of language models.

Python 11,981 3,147 Updated Apr 1, 2026

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,551 228 Updated Dec 15, 2025

SecGPT网络安全大模型

Python 2,984 360 Updated Jun 25, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,962 1,864 Updated Jul 15, 2025

An Open-sourced Knowledgable Large Language Model Framework.

Python 1,383 133 Updated Jan 11, 2025

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,288 768 Updated Oct 16, 2024

Repo for fine-tuning Casual LLMs

Python 460 86 Updated Mar 27, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,082 520 Updated Jul 1, 2025

User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.

Python 337 56 Updated Mar 24, 2023

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 72,685 7,792 Updated Mar 11, 2026

Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction

Python 219 36 Updated Jul 12, 2022

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 79,766 15,149 Updated May 10, 2024

一个简单的中文事件抽取模型,触发词和实体联合标注识别,同时判定实体角色。

Python 75 13 Updated Dec 19, 2020

An experiment and demo-level tool for text information extraction (event-triples extraction), which can be a route to the event chain and topic graph, 基于依存句法与语义角色标注的事件三元组抽取,可用于文本理解如文档主题链,事件线等应用。

Python 932 216 Updated Nov 26, 2022
Next