Stars
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
CUDA Templates and Python DSLs for High-Performance Linear Algebra
FlashMLA: Efficient Multi-head Latent Attention Kernels
🐧 Linux 上完整的 Clash / Mihomo (Clash Meta) 管理工具
An extension library of tensorflow to accelerate industrial recommendation system model training
按流量计费机场推荐|SS、SSR、V2Ray、Trojan、Clash、winXray、Shadow| 布鲁克翻墙机场墙不同机场、VPN,采用机场GFW开发的翻协议,在速度和稳定性方面非常出色。需要注册通知订阅并作为软件使用,本机场供你推荐好用的地址。
Alluxio, data orchestration for analytics and machine learning in the cloud
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
An open-source C++ library developed and used at Facebook.
FlashInfer: Kernel Library for LLM Serving
AddressSanitizer, ThreadSanitizer, MemorySanitizer
定投改变命运 —— 让时间陪你慢慢变富 https://onregularinvesting.com
Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documen…
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on h…
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
This repo contains source code and materials for the TEmporally COherent GAN SIGGRAPH project.
📚Java编程书籍收集分享。Java programming books collection to share.🚀
A General-purpose Task-parallel Programming System in C++