-
Peking University
- Shenzhen
-
10:06
(UTC +08:00)
-
Pai-Megatron-Patch Public
Forked from alibaba/Pai-Megatron-PatchThe official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Python Apache License 2.0 UpdatedSep 29, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedSep 13, 2025 -
-
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Python Apache License 2.0 UpdatedAug 25, 2025 -
LLaVA-CoT Public
Forked from PKU-YuanGroup/LLaVA-CoTLLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Python Apache License 2.0 UpdatedJan 22, 2025 -
-
Efficient-Diffusion-Model-Survey Public
Forked from AIoT-MLSys-Lab/Efficient-Diffusion-Model-SurveyApache License 2.0 UpdatedJan 7, 2025 -
FastVideo Public
Forked from hao-ai-lab/FastVideoFastVideo is an open-source framework for accelerating large video diffusion model.
Python Apache License 2.0 UpdatedDec 23, 2024 -
-
The paper collections for the efficient diffusion models.
UpdatedOct 22, 2024 -
MM-NIAH Public
Forked from OpenGVLab/MM-NIAHThis is the official implementation of the paper "Needle In A Multimodal Haystack"
Python UpdatedJul 2, 2024 -
LOOK-M Public
Forked from SUSTechBruce/LOOK-MOfficial implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"
Python MIT License UpdatedJul 1, 2024 -
open-interpreter Public
Forked from openinterpreter/open-interpreterA natural language interface for computers
Python GNU Affero General Public License v3.0 UpdatedJun 25, 2024 -
-
Mixture-of-depths Public
Forked from astramind-ai/Mixture-of-depthsUnofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Python UpdatedApr 18, 2024 -
JetMoE Public
Forked from myshell-ai/JetMoEReaching LLaMA2 Performance with 0.1M Dollars
Python Apache License 2.0 UpdatedApr 15, 2024 -
-
-
llama_gger.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedMar 28, 2024 -
bitsandbytes Public
Forked from bitsandbytes-foundation/bitsandbytesAccessible large language models via k-bit quantization for PyTorch.
Python MIT License UpdatedMar 28, 2024 -
llm-action Public
Forked from liguodongiot/llm-action本项目旨在分享大模型相关技术原理以及实战经验。
Python Apache License 2.0 UpdatedMar 27, 2024 -
openvino Public
Forked from openvinotoolkit/openvinoOpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
C++ Apache License 2.0 UpdatedMar 27, 2024 -
Multimodal-Roadmap-for-freshman Public
Forked from inFaaa/Multimodal-Roadmap-for-freshman本项目用于Multimodal领域新手的学习路线,包括该领域的经典论文,项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知,能够自己进行的独立研究。
UpdatedMar 26, 2024 -
CTranslate2 Public
Forked from OpenNMT/CTranslate2Fast inference engine for Transformer models
C++ MIT License UpdatedMar 25, 2024 -
TinyLlama Public
Forked from jzhang38/TinyLlamaThe TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Python Apache License 2.0 UpdatedMar 25, 2024 -
BigDL Public
Forked from intel/ipex-llmAccelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max)…
Jupyter Notebook Apache License 2.0 UpdatedMar 22, 2024 -
Open-Sora-Plan-deploy Public
Forked from PKU-YuanGroup/Open-Sora-PlanThis project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
Python MIT License UpdatedMar 20, 2024 -
MiniCPM Public
Forked from OpenBMB/MiniCPMMiniCPM-2B: An end-side LLM outperforms Llama2-13B.
Python Apache License 2.0 UpdatedMar 16, 2024 -
Awesome-Multimodal-Large-Language-Models Public
Forked from BradyFU/Awesome-Multimodal-Large-Language-Models✨✨Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
UpdatedMar 15, 2024