Skip to content
View aflah02's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@BioBytesIIITD

Block or report aflah02

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
2756 results for source starred repositories
Clear filter

Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT

Python 26 Updated Feb 5, 2026

A template for research projects in computer science/machine learning using python and julia

Python 84 3 Updated Jan 22, 2026

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 271 54 Updated Feb 5, 2026

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 414 163 Updated Feb 5, 2026

For releasing code related to compression methods for transformers, accompanying our publications

Python 455 56 Updated Jan 16, 2025

[ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks

Python 39 5 Updated Feb 4, 2025

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Python 90 12 Updated Sep 13, 2024

A CLI to estimate inference memory requirements for Hugging Face models, written in Python.

Python 664 57 Updated Feb 4, 2026

Agent skills for Manim to create 3Blue1Brown style animations.

Python 559 38 Updated Jan 23, 2026

Experimental mini python jupyter kernel

Python 5 Updated Feb 1, 2026

A Model Agnostic function to directly remove specified layers from the LLM

Python 10 Updated May 23, 2024
Python 7 1 Updated May 30, 2025

Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"

Python 39 4 Updated May 1, 2025

Official repository for EMNLP2025 paper "IG-Pruning: Input-Guided Block Pruning for Large Language Models"

Python 6 1 Updated Nov 9, 2025

Setup guide for ML training on NVIDIA DGX Spark (GB10 Blackwell, CUDA 13, aarch64)

Shell 85 9 Updated Jan 15, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 3,561 240 Updated Jan 14, 2026

Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding

Python 200 18 Updated Jan 12, 2026

LLM checkpointing for DeepSpeed/Megatron

C++ 24 6 Updated Nov 30, 2025

nanoRLHF: from-scratch journey into how LLMs and RLHF really work.

Python 149 13 Updated Jan 23, 2026
Cuda 17 Updated Jan 25, 2026
Python 1 Updated Jan 8, 2026

Use git from python, fast

Jupyter Notebook 6 Updated Jan 29, 2026

Solar vs GLM vs Phi

Python 102 10 Updated Jan 2, 2026

🦋 An Infographic Generation and Rendering Framework, bring words to life with AI!

TypeScript 4,228 291 Updated Feb 4, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,313 413 Updated Jan 19, 2026

Data mapping framework for rust stuff

Rust 44 4 Updated Feb 5, 2026

Tooling for exact and MinHash deduplication of large-scale text datasets

Rust 66 5 Updated Feb 4, 2026
Python 44 5 Updated Jan 20, 2026
Next