Skip to content
View amosyou's full-sized avatar

Organizations

@callaunchpad

Block or report amosyou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

a collection of mini-games on mechanistic interpretability

Jupyter Notebook 2 Updated May 4, 2025

NVIDIA Linux open GPU with P2P support

C 1,344 139 Updated Jun 6, 2025

My learning notes for ML SYS.

Python 5,958 389 Updated Apr 8, 2026
Jupyter Notebook 7 Updated Dec 7, 2024

[EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering

Python 17 1 Updated Oct 31, 2024

[ICLR 2025] Palu: Compressing KV-Cache with Low-Rank Projection

Python 154 15 Updated Feb 20, 2025

Helpful tools and examples for working with flex-attention

Python 1,171 76 Updated Apr 1, 2026

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 399 39 Updated Apr 20, 2024

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 450 23 Updated Oct 16, 2024

[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

Python 81 3 Updated Nov 25, 2024

Sudoku solving in python packaging

Python 450 6 Updated Oct 20, 2024

3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)

Python 3,595 612 Updated Apr 9, 2026

Code for visualizing the loss landscape of neural nets

Python 3,164 438 Updated Apr 5, 2022

Materials for learning SGLang

794 60 Updated Jan 5, 2026

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 569 27 Updated Jan 4, 2025
Python 309 31 Updated Jul 10, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,208 398 Updated Jul 11, 2024

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 511 77 Updated Aug 1, 2024

Efficient Triton Kernels for LLM Training

Python 6,265 510 Updated Apr 8, 2026

Streamlit Component to quickly create Interactive Flow Diagrams using React Flow

JavaScript 342 28 Updated Jun 24, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,203 67 Updated Nov 9, 2025

Accompanying code for the paper Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation.

Python 6 Updated Aug 15, 2024

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

Python 1,839 129 Updated Jul 10, 2024

Everything you want to know about Google Cloud TPU

Python 567 31 Updated Jul 16, 2024

[ACM MM 2023] Official implementation of "Hierarchical Masked 3D Diffusion Model for Video Outpainting"

Python 110 8 Updated May 6, 2024

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,786 1,008 Updated Sep 20, 2025

[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"

Python 10,906 1,090 Updated Aug 29, 2025

A markup-based typesetting system that is powerful and easy to learn.

Rust 52,576 1,536 Updated Apr 9, 2026

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Python 198 24 Updated Jan 14, 2024
Next