Skip to content
View cnbeining's full-sized avatar

Block or report cnbeining

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

高性能的真·Coze API

JavaScript 133 28 Updated Apr 12, 2024

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,133 211 Updated Oct 8, 2024

Gradm (Gradle dependencies manager)

Kotlin 37 2 Updated Sep 17, 2024

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,610 279 Updated Oct 20, 2024

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 583 35 Updated Jul 22, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 33,196 4,082 Updated Oct 29, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,245 716 Updated Aug 5, 2024

Run inference on replit-3B code instruct model using CPU

Python 155 27 Updated Jul 5, 2023

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,443 476 Updated Sep 28, 2024

Tune MPTs

Python 84 16 Updated Jun 17, 2023
Python 526 43 Updated Jan 16, 2024

Official Code for Paper: RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text

Python 970 155 Updated May 15, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,012 821 Updated Jun 10, 2024

C++ implementation for 💫StarCoder

C 446 36 Updated Sep 9, 2023

⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops

Python 905 68 Updated Jul 15, 2024

📋 A list of open LLMs available for commercial use.

11,111 718 Updated Jul 5, 2024

Inference code and configs for the ReplitLM model family

Python 927 81 Updated Oct 9, 2023

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,439 155 Updated Feb 24, 2024

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHX…

Python 2,603 206 Updated Sep 23, 2024

Port of OpenAI's Whisper model in C/C++

C++ 35,328 3,602 Updated Oct 29, 2024

Faster Whisper transcription with CTranslate2

Python 12,136 1,017 Updated Oct 29, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,730 374 Updated Mar 14, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,823 4,537 Updated Oct 24, 2024

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,605 102 Updated Aug 30, 2023

Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.

1,937 91 Updated Sep 20, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,893 758 Updated Oct 16, 2024

LLM as a Chatbot Service

Python 3,287 381 Updated Nov 20, 2023
Python 406 29 Updated Mar 22, 2023

Inference script for Meta's LLaMA models using Hugging Face wrapper

Python 110 5 Updated Mar 24, 2023

Quantized inference code for LLaMA models

Python 1,053 105 Updated Mar 17, 2023
Next