Starred repositories
soon14 / Linfer
Forked from l-sf/Linfer基于TensorRT的C++高性能推理库,Yolov10, YoloPv2,Yolov5/7/X/8,RT-DETR,单目标跟踪OSTrack、LightTrack。
PrithivirajDamodaran / blitz-embed
Forked from iamlemec/bert.cppC++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcome.
iamlemec / llama.cpp
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
xyzhang626 / embeddings.cpp
Forked from skeskinen/bert.cppggml implementation of embedding models including SentenceTransformer and BGE
hadoop2xu / fastllm
Forked from ztxz16/fastllm纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
A framework for few-shot evaluation of language models.
Tonic-AI / EasyAGI
Forked from Josephrp/LablabAutogen🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.
Physton / ChatGPT-Next-Web
Forked from ChatGPTNextWeb/NextChat基于 Yidadaa/ChatGPT-Next-Web 修改
danielcy715 / llm_search
Forked from shibing624/SearchGPTllm search: Building a quick conversation-based search engine with LLMs.
iamlemec / bert.cpp
Forked from xyzhang626/embeddings.cppGGML implementation of BERT model with Python bindings and quantization.
kingbri1 / flash-attention
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
shibing624 / SearchGPT
Forked from leptonai/search_with_leptonSearchGPT: Building a quick conversation-based search engine with LLMs.
A simplest LLM-powered search in 200 lines, based on Lepton AI's work.
lchh5 / GeminiPro-Next-Web
Forked from ChatGPTNextWeb/NextChatGoogle Gemini Pro UI (Base on ChatGPT-Next-Web). 一键拥有你自己的跨平台 Gemini 应用。
creisle / doccano
Forked from doccano/doccanoOpen source annotation tool for machine learning practitioners.
sharpHL / llama.cpp
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
hariag / chatglm.cpp
Forked from li-plus/chatglm.cppC++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs for cuda
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
vilm-ai / llm-factory
Forked from hiyouga/LlamaFactoryLite version of LLaMA-Factory. Easy-to-use LLM fine-tuning framework (LLaMA, Mistral, Qwen, etc.)
dasmy / gpt-code-ui
Forked from ricklamers/gpt-code-uiAn open source implementation of OpenAI's ChatGPT Code interpreter
Wheels for llama-cpp-python compiled with cuBLAS support
Explore what LLMs are really leanring over SFT
gagb / autogen
Forked from microsoft/autogenEnable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
sudoskys / KwaiAgents
Forked from KwaiKEG/KwaiAgentsA generalized information-seeking agent system with Large Language Models (LLMs).
Update your Ollama models to their latest versions with Bun!
v3ucn / Bert-VITS2
Forked from fishaudio/Bert-VITS2vits2 backbone with multilingual-bert
xmxoxo / Tianchi-LLM-QA
Forked from dawoshi/Tianchi-LLM-QA阿里天池: 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答 baseline 80+
luweigen / whisper_streaming
Forked from ufal/whisper_streamingWhisper realtime streaming for long speech-to-text transcription and translation
litagin02 / Style-Bert-VITS2
Forked from fishaudio/Bert-VITS2Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.