-
x-algorithm Public
Forked from xai-org/x-algorithmAlgorithm powering the For You feed on X
Rust Apache License 2.0 UpdatedJan 20, 2026 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedDec 8, 2025 -
flash-attention Public
Forked from vllm-project/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 8, 2025 -
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedDec 5, 2025 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedNov 21, 2025 -
-
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedOct 9, 2025 -
NVSHMEM-Tutorial Public
Forked from KuangjuX/NVSHMEM-TutorialNVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
Cuda UpdatedSep 18, 2025 -
-
-
x-attention Public
Forked from mit-han-lab/x-attentionXAttention: Block Sparse Attention with Antidiagonal Scoring
Python UpdatedJun 20, 2025 -
SpargeAttn Public
Forked from thu-ml/SpargeAttnSpargeAttention: A training-free sparse attention that can accelerate any model inference.
Cuda Apache License 2.0 UpdatedJun 12, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedApr 14, 2025 -
ktransformers Public
Forked from kvcache-ai/ktransformersA Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
-
3FS Public
Forked from deepseek-ai/3FSA high-performance distributed file system designed to address the challenges of AI training and inference workloads.
C++ MIT License UpdatedFeb 28, 2025 -
KAG Public
Forked from OpenSPG/KAGKAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
Python Apache License 2.0 UpdatedFeb 21, 2025 -
VectorDBBench Public
Forked from zilliztech/VectorDBBenchA Benchmark Tool for VectorDB
Python MIT License UpdatedFeb 12, 2025 -
langchain Public
Forked from langchain-ai/langchain🦜🔗 Build context-aware reasoning applications
Jupyter Notebook MIT License UpdatedDec 18, 2024 -
DB-GPT Public
Forked from eosphoros-ai/DB-GPTAI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
-
dspy Public
Forked from stanfordnlp/dspyDSPy: The framework for programming—not prompting—language models
Python MIT License UpdatedNov 22, 2024 -
tugraph-db Public
Forked from TuGraph-family/tugraph-dbTuGraph is a high performance graph database.
C++ Apache License 2.0 UpdatedNov 19, 2024 -
mem0 Public
Forked from mem0ai/mem0The Memory layer for your AI apps
Python Apache License 2.0 UpdatedNov 15, 2024 -
-
postgres Public
Forked from postgres/postgresMirror of the official PostgreSQL GIT repository. Note that this is just a *mirror* - we don't work with pull requests on github. To contribute, please see https://wiki.postgresql.org/wiki/Submitti…
C Other UpdatedOct 31, 2024 -
AI-System Public
Forked from microsoft/AI-SystemSystem for AI Education Resource.
Python Creative Commons Attribution 4.0 International UpdatedOct 25, 2024 -
crewAI Public
Forked from crewAIInc/crewAIFramework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Python MIT License UpdatedOct 21, 2024 -
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedOct 18, 2024 -
llama_index Public
Forked from run-llama/llama_indexLlamaIndex is a data framework for your LLM applications
Python MIT License UpdatedOct 17, 2024 -
-