Skip to content
View gmlwns2000's full-sized avatar
  • Anyang, Korea

Highlights

  • Pro

Organizations

@Kawaian @NeuralAction

Block or report gmlwns2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 14 Updated Oct 3, 2025

The code for NeurIPS 2025 paper "A-MEM: Agentic Memory for LLM Agents"

Python 658 62 Updated Nov 1, 2025

Route, manage, and analyze your LLM requests across multiple providers with a unified API interface.

TypeScript 622 68 Updated Nov 5, 2025

Helpful tools and examples for working with flex-attention

Python 1,043 63 Updated Nov 1, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 50,720 5,350 Updated Nov 5, 2025

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,230 988 Updated Jul 31, 2025

Lifelong Learning with Dynamically Expandable Networks, ICLR 2018

Python 160 51 Updated May 4, 2020

RepoQA: Evaluating Long-Context Code Understanding

Python 122 7 Updated Nov 1, 2024

Tensara's GPU programming problems

Python 6 2 Updated Oct 30, 2025

Kortix – build, manage and train AI Agents. Fully Open Source.

TypeScript 18,514 3,159 Updated Nov 5, 2025

[CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

Python 150 8 Updated Jul 11, 2025

Shared Middle-Layer for Triton Compilation

MLIR 302 79 Updated Oct 27, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,622 256 Updated Oct 28, 2025

Continuous Thought Machines, because thought takes time and reasoning is a process.

Python 1,393 201 Updated Oct 14, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 52,100 7,628 Updated Nov 5, 2025

ResearchAgent

Python 13 2 Updated Aug 24, 2025

Kernels & AI inference engine for phones

C++ 3,625 213 Updated Nov 5, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 2 1 Updated Nov 5, 2025

real time face swap and one-click video deepfake with only a single image

Python 75,244 10,946 Updated Nov 5, 2025

Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)

Go 2,573 412 Updated Nov 5, 2025

[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.

Python 112 3 Updated Jul 27, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,648 1,640 Updated Sep 30, 2025

This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.

HTML 89 20 Updated Nov 5, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,255 419 Updated Nov 3, 2025

Build Anything with AI Agents

TypeScript 1,542 104 Updated Oct 27, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,930 286 Updated May 15, 2025

An AI Hedge Fund Team

Python 42,211 7,469 Updated Oct 11, 2025

Automatic differentiation for Triton Kernels

Python 28 4 Updated Aug 12, 2025

Sage attention for turning.

Cuda 25 3 Updated Sep 5, 2025
Next