Skip to content
View kminsoo's full-sized avatar

Block or report kminsoo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Accelerating MoE with IO and Tile-aware Optimizations

Python 447 26 Updated Dec 23, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,958 358 Updated Dec 23, 2025

Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Python 129 5 Updated Dec 17, 2025

Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models"

Python 44 1 Updated Dec 9, 2025
13 Updated Nov 19, 2025

Official Implementation of FedLPA (Neurips 2025)

4 Updated Oct 11, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,287 103 Updated Oct 29, 2025

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 214 36 Updated Dec 23, 2025

HuggingFace conversion and training library for Megatron-based models

Python 305 109 Updated Dec 23, 2025
Python 806 69 Updated Oct 13, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,353 1,452 Updated Nov 28, 2025

[NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs".

Python 76 3 Updated Jun 17, 2024

Perplexity GPU Kernels

C++ 542 74 Updated Nov 7, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,731 2,877 Updated Dec 23, 2025

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 142 29 Updated Dec 19, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,023 586 Updated Dec 22, 2025

SKT A.X 4.0 VL Light

7 Updated Jul 30, 2025

"Syntriever: How to Train Your Retriever with Synthetic Data from LLMs" the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), Findings, Accepted

Python 29 Updated Mar 5, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,119 99 Updated Nov 23, 2025

the LLM vulnerability scanner

Python 6,663 735 Updated Dec 22, 2025
Python 109 8 Updated Nov 19, 2025

SKT A.X LLM 3.1

11 Updated Jul 24, 2025

Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning

Python 228 8 Updated Apr 19, 2025

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,530 316 Updated Feb 18, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,289 199 Updated Dec 22, 2025

Implementation of "Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance" (WACV 2025).

Python 2 Updated Jun 17, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,583 595 Updated Dec 23, 2025

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Python 288 25 Updated Aug 15, 2025

A lightweight LMM-based Document Parsing Model

Python 6,388 441 Updated Dec 8, 2025
Next