- Anyang, Korea
Highlights
- Pro
-
-
RULER-hip Public
Forked from NVIDIA/RULERThis repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
-
FEA-Bench Public
Forked from microsoft/FEA-Bench[ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation
Python MIT License UpdatedAug 1, 2025 -
x-attention Public
Forked from mit-han-lab/x-attention[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring
Python UpdatedJul 30, 2025 -
InfiniteBench-hip Public
Forked from OpenBMB/InfiniteBenchCodes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
-
sea-attention Public
Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)
-
Awesome-LLM-Long-Context-Modeling Public
Forked from Xnhyacinth/Awesome-LLM-Long-Context-Modeling📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
MIT License UpdatedJun 9, 2025 -
-
MInference Public
Forked from jeffwillette/MInference[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…
Python MIT License UpdatedMay 12, 2025 -
lmms-eval Public
Forked from EvolvingLMMs-Lab/lmms-evalAccelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Python Other UpdatedMay 1, 2025 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedApr 14, 2025 -
sglang-hip12 Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models. See hip12-offload-add-offload-cache
Python Apache License 2.0 UpdatedMar 27, 2025 -
triton_bwd Public
Forked from daniel-geon-park/triton_bwdAutomatic differentiation for Triton Kernels
Python UpdatedMar 24, 2025 -
-
-
-
-
LongBench-hip Public
Forked from THUDM/LongBenchLongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
-
EXAONE-3.5 Public
Forked from LG-AI-EXAONE/EXAONE-3.5Official repository for EXAONE 3.5 built by LG AI Research
Other UpdatedDec 10, 2024 -
loft-hip Public
Forked from google-deepmind/loftLOFT: A 1 Million+ Token Long-Context Benchmark
Python Apache License 2.0 UpdatedNov 22, 2024 -
triton-fix-autotune Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedSep 20, 2024 -
-
InfiniGen Public
Forked from snu-comparch/InfiniGenInfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
-
hip-attention Public
Forked from DeepAuto-AI/hip-attentionTraining-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
Python UpdatedJun 25, 2024 -
-
-
gmlwns2000.github.io Public
Forked from RayeRen/acad-homepage.github.ioAcadHomepage: A Modern and Responsive Academic Personal Homepage
SCSS MIT License UpdatedJun 10, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedJun 6, 2024 -
-
vllm-timber Public archive
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMay 14, 2024