gmlwns2000

AinL gmlwns2000

Deep-learning-based A.I. code-bot. Actually, I am the ingredient of the AI (Heejun Lee)

147 followers · 234 following

Anyang, Korea

Achievements

Highlights

Organizations

gmlwns2000 Public

Updated Oct 4, 2025
RULER-hip Public
Forked from NVIDIA/RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1 Apache License 2.0 Updated Aug 19, 2025
FEA-Bench Public
Forked from microsoft/FEA-Bench

[ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation

Python MIT License Updated Aug 1, 2025
x-attention Public
Forked from mit-han-lab/x-attention

[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring

Python Updated Jul 30, 2025
InfiniteBench-hip Public
Forked from OpenBMB/InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python 1 MIT License Updated Jul 15, 2025
sea-attention Public

Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)

attention linear-attention efficient-attention sea-attention

Python 11 1 Updated Jun 20, 2025
Awesome-LLM-Long-Context-Modeling Public
Forked from Xnhyacinth/Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

MIT License Updated Jun 9, 2025
election-2025 Public

Python Updated Jun 4, 2025
MInference Public
Forked from jeffwillette/MInference

[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…

Python MIT License Updated May 12, 2025
lmms-eval Public
Forked from EvolvingLMMs-Lab/lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python Other Updated May 1, 2025
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python Apache License 2.0 Updated Apr 14, 2025
sglang-hip12 Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models. See hip12-offload-add-offload-cache

Python Apache License 2.0 Updated Mar 27, 2025
triton_bwd Public
Forked from daniel-geon-park/triton_bwd

Automatic differentiation for Triton Kernels

Python Updated Mar 24, 2025
triton-autograd Public

Updated Mar 7, 2025
hip-ainl Public

Python 3 1 Other Updated Jan 23, 2025
llmperf-hip Public

Python Apache License 2.0 Updated Jan 22, 2025
cascading_kv_cache Public
Forked from jeffwillette/cascading_kv_cache

Python Updated Jan 18, 2025
LongBench-hip Public
Forked from THUDM/LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Python 2 MIT License Updated Jan 15, 2025
EXAONE-3.5 Public
Forked from LG-AI-EXAONE/EXAONE-3.5

Official repository for EXAONE 3.5 built by LG AI Research

Other Updated Dec 10, 2024
loft-hip Public
Forked from google-deepmind/loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Python Apache License 2.0 Updated Nov 22, 2024
triton-fix-autotune Public
Forked from triton-lang/triton

Development repository for the Triton language and compiler

C++ MIT License Updated Sep 20, 2024
hpc Public

Python Updated Sep 9, 2024
InfiniGen Public
Forked from snu-comparch/InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python 1 Apache License 2.0 Updated Sep 9, 2024
hip-attention Public
Forked from DeepAuto-AI/hip-attention

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

Python Updated Jun 25, 2024
image-augmentation-server Public

Python Updated Jun 19, 2024
image-lm Public

Jupyter Notebook Updated Jun 18, 2024
gmlwns2000.github.io Public
Forked from RayeRen/acad-homepage.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS MIT License Updated Jun 10, 2024
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python MIT License Updated Jun 6, 2024
ai-fact-check-accuracy Public

Jupyter Notebook Updated May 27, 2024
vllm-timber Public archive
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated May 14, 2024

AinL gmlwns2000

Achievements

Achievements

Highlights

Organizations

gmlwns2000 Public

Uh oh!

RULER-hip Public

Uh oh!

FEA-Bench Public

Uh oh!

x-attention Public

Uh oh!

InfiniteBench-hip Public

Uh oh!

sea-attention Public

Uh oh!

Awesome-LLM-Long-Context-Modeling Public

Uh oh!

election-2025 Public

Uh oh!

MInference Public

Uh oh!

lmms-eval Public

Uh oh!

transformers Public

Uh oh!

sglang-hip12 Public

Uh oh!

triton_bwd Public

Uh oh!

triton-autograd Public

Uh oh!

hip-ainl Public

Uh oh!

llmperf-hip Public

Uh oh!

cascading_kv_cache Public

Uh oh!

LongBench-hip Public

Uh oh!

EXAONE-3.5 Public

Uh oh!

loft-hip Public

Uh oh!

triton-fix-autotune Public

Uh oh!

hpc Public

Uh oh!

InfiniGen Public

Uh oh!

hip-attention Public

Uh oh!

image-augmentation-server Public

Uh oh!

image-lm Public

Uh oh!

gmlwns2000.github.io Public

Uh oh!

lm-evaluation-harness Public

Uh oh!

ai-fact-check-accuracy Public

Uh oh!

vllm-timber Public archive

Uh oh!