Skip to content
View kylehh's full-sized avatar
  • Nvidia (ex-AWS/Anyscale)
  • Houston, TX

Block or report kylehh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,798 907 Updated Sep 30, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 59,709 10,583 Updated Oct 9, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,869 1,647 Updated Oct 9, 2025

Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone

Python 1,023 125 Updated Nov 13, 2024

LLMPerf is a library for validating and benchmarking LLMs

Python 1,021 191 Updated Dec 9, 2024

FireAct: Toward Language Agent Fine-tuning

Python 280 21 Updated Oct 22, 2023

RayLLM - LLMs on Ray (Archived). Read README for more info.

1,262 93 Updated Mar 13, 2025

Numbers every LLM developer should know

4,260 139 Updated Jan 16, 2024

🦜🔗 Build context-aware reasoning applications

Python 116,895 19,235 Updated Oct 9, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,259 6,864 Updated Oct 9, 2025

🌍 针对小白的算法训练 | 包括四部分:①.大厂面经 ②.力扣图解 ③.千本开源电子书 ④.百张技术思维导图(项目花了上百小时,希望可以点 star 支持,🌹感谢~)推荐免费ChatGPT使用网站

Java 35,818 6,466 Updated Jun 13, 2023

Everything you need in order to get YOLOv3 up and running in the cloud. Learn to train your custom YOLOv3 object detector in the cloud for free!

Jupyter Notebook 91 77 Updated Oct 22, 2020

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Python 10,466 3,454 Updated Oct 4, 2025

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

C 22,147 7,955 Updated Aug 28, 2025

A curated list of deep learning resources for computer vision

11,063 2,783 Updated Aug 15, 2023

A cloud-native open-source unified multi-cloud and hybrid-cloud platform. 开源、云原生的多云管理及混合云融合平台

Go 2,800 595 Updated Oct 9, 2025

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

274,514 21,044 Updated Aug 22, 2025