Skip to content
View Orion-zhen's full-sized avatar
💥
CUDA Out Of Memory
💥
CUDA Out Of Memory

Block or report Orion-zhen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. our our Public

    Orion User's Repository for Arch Linux

    Python 3

  2. abliteration abliteration Public

    Make abliterated models with transformers, easy and fast

    Python 107 40

  3. turboderp-org/exllamav2 turboderp-org/exllamav2 Public

    A fast inference library for running LLMs locally on modern consumer-class GPUs

    Python 4.4k 325

  4. hiyouga/LLaMA-Factory hiyouga/LLaMA-Factory Public

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Python 64.2k 7.8k

  5. SJTU-IPADS/PowerInfer SJTU-IPADS/PowerInfer Public

    High-speed Large Language Model Serving for Local Deployment

    C++ 8.5k 461

  6. CrazyBoyM/llama3-Chinese-chat CrazyBoyM/llama3-Chinese-chat Public

    Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。

    Python 4.2k 336