Skip to content
View khotyn's full-sized avatar
😌
Focusing
😌
Focusing

Organizations

@acug @sofastack

Block or report khotyn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

TypeScript 57,808 2,719 Updated Jun 19, 2026

The API to search, scrape, and interact with the web at scale. 🔥

TypeScript 136,188 7,905 Updated Jun 21, 2026

Markdown Architectural Decision Records

Markdown 2,277 462 Updated Jun 21, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,788 79,504 Updated Jun 21, 2026

A simple, performant and scalable Jax LLM!

Python 2,331 540 Updated Jun 21, 2026

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 49,965 5,308 Updated Jun 21, 2026

Incredibly fast JavaScript runtime, bundler, test runner, and package manager – all in one

Rust 93,337 4,721 Updated Jun 21, 2026

A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.

C++ 1,349 233 Updated Jun 21, 2026

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,693 729 Updated Jun 20, 2026

Public repository for Agent Skills

Python 153,458 18,091 Updated Jun 9, 2026

The best ChatGPT that $100 can buy.

Python 55,287 7,589 Updated May 5, 2026

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 4,490 717 Updated Jun 21, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Python 133,586 21,601 Updated Jun 20, 2026

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 901 262 Updated Jun 18, 2026

Data driven agentic landscapes and insights. Produced by Ant Open Source and inclusionAI.

TypeScript 498 35 Updated Jun 17, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,664 972 Updated Jun 17, 2026

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,568 316 Updated Jul 17, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 35,869 3,649 Updated Jun 21, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,065 4,107 Updated Jun 18, 2026

iTerm2 is a terminal emulator for Mac OS X that does amazing things.

Objective-C 17,717 1,412 Updated Jun 21, 2026

[HPCA 2026] AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

Python 365 130 Updated Apr 22, 2026

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,321 523 Updated Jun 19, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,398 1,056 Updated Jun 4, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,933 1,918 Updated Jun 21, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,708 1,063 Updated Apr 30, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,317 1,320 Updated Jun 21, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,491 6,637 Updated Jun 21, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,321 8,852 Updated Jun 17, 2026

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 336 22 Updated Apr 24, 2025

Module, Model, and Tensor Serialization/Deserialization

Python 313 52 Updated Apr 30, 2026
Next