Skip to content
View wooksu's full-sized avatar

Organizations

@nota-github

Block or report wooksu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

the best agent harness

TypeScript 32,252 2,430 Updated Feb 19, 2026

A simple yet powerful agent framework that delivers with open-source models

Python 4,427 453 Updated Feb 3, 2026

ERGO (Efficient Reasoning & Guided Observation) is a large vision–language model trained with reinforcement learning on efficiency objectives.

Python 12 1 Updated Feb 19, 2026

[ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Python 75 2 Updated Jan 26, 2026
Python 38 1 Updated Jul 14, 2025

Nano vLLM

Python 11,733 1,587 Updated Nov 3, 2025

[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

Python 87 5 Updated Sep 20, 2025

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python 173 10 Updated Jan 16, 2026
Python 1,126 69 Updated Nov 20, 2025

Open-source unified multimodal model

Python 5,677 503 Updated Oct 27, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,351 60 Updated Dec 7, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,615 355 Updated Feb 10, 2026

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,166 144 Updated Feb 11, 2026

Solve Visual Understanding with Reinforced VLMs

Python 5,844 377 Updated Oct 21, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,320 103 Updated Oct 29, 2025

Witness the aha moment of VLM with less than $3.

Python 4,033 285 Updated May 19, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 70,664 13,536 Updated Feb 19, 2026

A paper list of some recent works about Token Compress for Vit and VLM

833 39 Updated Feb 10, 2026

A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]

Python 312 19 Updated Jul 6, 2024

A 28× Compressed Wav2Lip for Efficient Talking Face Generation [ICCV'23 Demo] [MLSys'23 Workshop] [NVIDIA GTC'23]

Python 61 6 Updated Mar 8, 2024

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Python 90 12 Updated Sep 13, 2024

The official NetsPresso Python package.

Jupyter Notebook 48 1 Updated Nov 20, 2025

A library for training, compressing and deploying computer vision models (including ViT) with edge devices

Python 74 12 Updated Sep 29, 2025

Repository for 2023 AI City Challenge (Track1: Multi-Camera People Tracking)

Python 37 6 Updated Oct 7, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 156,665 32,130 Updated Feb 19, 2026

Polynomial Learning Rate Decay Scheduler for PyTorch

Python 65 13 Updated Dec 25, 2021

Learning Rate Warmup in PyTorch

Python 415 23 Updated Jun 19, 2025

An easy to use PyTorch to TensorRT converter

Python 4,856 697 Updated Aug 17, 2024

Conversion of PyTorch Models into TFLite

Python 399 52 Updated Mar 30, 2023