kminsoo

Follow

Minsoo Kang kminsoo

Follow

Research Engineer at SK Telecom

15 followers · 18 following

Seoul, Korea
https://scholar.google.com/citations?user=in5F4IUAAAAJ&hl=ko

Achievements

Achievements

Highlights

Developer Program Member

Stars

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 447 26 Updated Dec 23, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,958 358 Updated Dec 23, 2025

EvolvingLMMs-Lab / OpenMMReasoner

Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Python 129 5 Updated Dec 17, 2025

JJJYmmm / Multimodal-RoPEs

Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models"

Python 44 1 Updated Dec 9, 2025

mjmjeong / PhysGaia

13 Updated Nov 19, 2025

geehokim / FedLPA

Official Implementation of FedLPA (Neurips 2025)

4 Updated Oct 11, 2025

Liuziyu77 / Visual-RFT

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,287 103 Updated Oct 29, 2025

NVIDIA-NeMo / Automodel

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 214 36 Updated Dec 23, 2025

NVIDIA-NeMo / Megatron-Bridge

HuggingFace conversion and training library for Megatron-based models

Python 305 109 Updated Dec 23, 2025

alibaba / Logics-Parsing

Python 806 69 Updated Oct 13, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,353 1,452 Updated Nov 28, 2025

MengLcool / DeepStack-VL

[NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs".

Python 76 3 Updated Jun 17, 2024

perplexityai / pplx-kernels

Perplexity GPU Kernels

C++ 542 74 Updated Nov 7, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,731 2,877 Updated Dec 23, 2025

yanring / Megatron-MoE-ModelZoo

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 142 29 Updated Dec 19, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,023 586 Updated Dec 22, 2025

SKT-AI / A.X-4.0-VL-Light

SKT A.X 4.0 VL Light

7 Updated Jul 30, 2025

kmswin1 / Syntriever

"Syntriever: How to Train Your Retriever with Synthetic Data from LLMs" the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), Findings, Accepted

Python 29 Updated Mar 5, 2025

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 2,119 99 Updated Nov 23, 2025

NVIDIA / garak

the LLM vulnerability scanner

Python 6,663 735 Updated Dec 22, 2025

infly-ai / INF-MLLM

Python 109 8 Updated Nov 19, 2025

SKT-AI / A.X-3

SKT A.X LLM 3.1

11 Updated Jul 24, 2025

LAION-AI / CLIP-based-NSFW-Detector

Python 456 36 Updated May 30, 2023

facebookresearch / metamorph

Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning

Python 228 8 Updated Apr 19, 2025

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,530 316 Updated Feb 18, 2025

NVIDIA-NeMo / Curator

Scalable data pre processing and curation toolkit for LLMs

Python 1,289 199 Updated Dec 22, 2025

Hleephilip / OIG

Implementation of "Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance" (WACV 2025).

Python 2 Updated Jun 17, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,583 595 Updated Dec 23, 2025

ppaanngggg / layoutreader

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Python 288 25 Updated Aug 15, 2025

Yuliang-Liu / MonkeyOCR

A lightweight LMM-based Document Parsing Model

Python 6,388 441 Updated Dec 8, 2025