Skip to content
View jiangliqin's full-sized avatar

Block or report jiangliqin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
300 stars written in Python
Clear filter

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 4,284 629 Updated Aug 30, 2025

CNN-RNN中文文本分类,基于TensorFlow

Python 4,263 1,464 Updated Mar 31, 2024

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,201 547 Updated Sep 8, 2025

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 4,180 730 Updated Jul 19, 2025

中文公开聊天语料库

Python 4,160 793 Updated Apr 23, 2024

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,149 383 Updated Aug 13, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,120 294 Updated Nov 8, 2024

Minimal keyword extraction with BERT

Python 4,039 373 Updated Oct 23, 2025

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,978 331 Updated Jun 12, 2024

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Python 3,879 419 Updated Sep 10, 2025

基于ChatGLM-6B + LoRA的Fintune方案

Python 3,768 443 Updated Nov 25, 2023

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python 3,723 476 Updated Oct 12, 2023

中文分词

Python 3,203 803 Updated Jan 16, 2025

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Python 3,143 642 Updated Jan 22, 2024

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 3,096 526 Updated May 9, 2024

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

Python 3,016 677 Updated Oct 30, 2023

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,959 237 Updated Sep 6, 2023

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,899 175 Updated May 26, 2025

中文langchain项目|小必应,Q.Talk,强聊,QiangTalk

Python 2,813 333 Updated Jun 20, 2023

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Python 2,773 315 Updated Dec 12, 2023

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,538 249 Updated Apr 24, 2024

💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram

Python 2,446 255 Updated Nov 17, 2023

百亿参数的中英文双语基座大模型

Python 2,420 179 Updated Jul 28, 2023

Large-scale pretraining for dialogue

Python 2,413 347 Updated Oct 17, 2022

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,314 909 Updated Jul 6, 2023

A simple prompt-chatting AI based on wechaty and fintuned NLP model

Python 2,234 431 Updated Feb 16, 2023

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 2,102 230 Updated May 20, 2024

用于训练中英文对话系统的语料库 Datasets for Training Chatbot System

Python 2,051 494 Updated Sep 23, 2020

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Python 2,041 294 Updated Mar 19, 2024

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python 1,904 264 Updated Jun 12, 2023