- Anyang, Korea
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
The code for NeurIPS 2025 paper "A-MEM: Agentic Memory for LLM Agents"
Route, manage, and analyze your LLM requests across multiple providers with a unified API interface.
Helpful tools and examples for working with flex-attention
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.
Lifelong Learning with Dynamically Expandable Networks, ICLR 2018
RepoQA: Evaluating Long-Context Code Understanding
Kortix – build, manage and train AI Agents. Fully Open Source.
[CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
Shared Middle-Layer for Triton Compilation
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
Continuous Thought Machines, because thought takes time and reasoning is a process.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
real time face swap and one-click video deepfake with only a single image
Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Automatic differentiation for Triton Kernels
Official code repository for Sketch-of-Thought (SoT)