jiangliqin

Follow

hashen jiangliqin

Follow

5 followers · 39 following

Stars

300 stars written in Python

shibing624 / MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 4,284 629 Updated Aug 30, 2025

gaussic / text-classification-cnn-rnn

CNN-RNN中文文本分类，基于TensorFlow

Python 4,263 1,464 Updated Mar 31, 2024

CLUEbenchmark / CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,201 547 Updated Sep 8, 2025

zjunlp / DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 4,180 730 Updated Jul 19, 2025

codemayq / chinese-chatbot-corpus

中文公开聊天语料库

Python 4,160 793 Updated Apr 23, 2024

IDEA-CCNL / Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。

Python 4,149 383 Updated Aug 13, 2024

baichuan-inc / Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Python 4,120 294 Updated Nov 8, 2024

MaartenGr / KeyBERT

Minimal keyword extraction with BERT

Python 4,039 373 Updated Oct 23, 2025

amazon-science / mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,978 331 Updated Jun 12, 2024

FlagAI-Open / FlagAI

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Python 3,879 419 Updated Sep 10, 2025

mymusise / ChatGLM-Tuning

基于ChatGLM-6B + LoRA的Fintune方案

Python 3,768 443 Updated Nov 25, 2023

hiyouga / ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python 3,723 476 Updated Oct 12, 2023

hankcs / pyhanlp

中文分词

Python 3,203 803 Updated Jan 16, 2025

huawei-noah / Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Python 3,143 642 Updated Jan 22, 2024

dbiir / UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 3,096 526 Updated May 9, 2024

yangjianxin1 / GPT2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

Python 3,016 677 Updated Oct 30, 2023

baichuan-inc / Baichuan-13B

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,959 237 Updated Sep 6, 2023

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,899 175 Updated May 26, 2025

yanqiangmiffy / Chinese-LangChain

中文langchain项目|小必应，Q.Talk，强聊，QiangTalk

Python 2,813 333 Updated Jun 20, 2023

liucongg / ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

Python 2,773 315 Updated Dec 12, 2023

OFA-Sys / OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,538 249 Updated Apr 24, 2024

paulpierre / RasaGPT

💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram

Python 2,446 255 Updated Nov 17, 2023

OpenBMB / CPM-Bee

百亿参数的中英文双语基座大模型

Python 2,420 179 Updated Jul 28, 2023

microsoft / DialoGPT

Large-scale pretraining for dialogue

Python 2,413 347 Updated Oct 17, 2022

Rayhane-mamah / Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,314 909 Updated Jul 6, 2023

Turing-Project / AntiFraudChatBot

A simple prompt-chatting AI based on wechaty and fintuned NLP model

Python 2,234 431 Updated Feb 16, 2023

KaiyangZhou / CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 2,102 230 Updated May 20, 2024

candlewill / Dialog_Corpus

用于训练中英文对话系统的语料库 Datasets for Training Chatbot System

Python 2,051 494 Updated Sep 23, 2020

alibaba / AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Python 2,041 294 Updated Mar 19, 2024

thu-coai / CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python 1,904 264 Updated Jun 12, 2023