Skip to content
View bokyeong1015's full-sized avatar

Organizations

@nota-github @Nota-NetsPresso

Block or report bokyeong1015

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents

Python 19 2 Updated Feb 1, 2026

Code and training scripts for FlexOlmo

Python 150 24 Updated Apr 20, 2026

omo/lazycodex: The coding agent for tokenmaxxers;the one and only agent harness for complex codebases. For your Codex, for your OpenCode

TypeScript 62,478 5,058 Updated Jun 16, 2026

PaperBanana: Automating Academic Illustration For AI Scientists

Python 6,573 488 Updated May 11, 2026
Jupyter Notebook 3 2 Updated Feb 3, 2026

[NeurIPS 2025]Official repositories for "Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought".

Python 28 Updated Jan 30, 2026

MR-Pruner: Training-free Multi-resolution Visual Token Pruning for Multi-modal Large Language Models

Python 2 Updated Nov 16, 2025

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

Python 136 19 Updated May 26, 2024

[CVPR 2025 Highlight] Official implementation of "Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity"

Python 141 8 Updated Mar 25, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,619 287 Updated Jun 17, 2026

Fast Multimodal Semantic Deduplication & Filtering

Python 936 57 Updated May 24, 2026

ERGO (Efficient Reasoning & Guided Observation) is a large vision-language model trained with reinforcement learning on efficiency objectives. [ICLR'26]

Python 19 1 Updated Feb 25, 2026

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 950 83 Updated Jun 7, 2026

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,454 83 Updated Feb 11, 2026

Tools for merging pretrained large language models.

Python 7,157 738 Updated Jun 13, 2026
Python 106 6 Updated Jun 10, 2025

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 406 25 Updated Oct 7, 2024

[ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding

Python 17 3 Updated Nov 10, 2025

[ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Python 89 4 Updated May 20, 2025

Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference

Python 14 1 Updated Jun 7, 2025

[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

Python 104 5 Updated Sep 20, 2025

Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024 Best Paper]

Python 240 23 Updated Jan 3, 2026

Vision Language Models are Biased

Python 113 3 Updated Jan 26, 2026

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,866 67 Updated May 29, 2026

[NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)

Python 221 13 Updated Feb 11, 2026

OpenThinkIMG is an end-to-end open-source framework that empowers Large Vision-Language Models to think with images.

Jupyter Notebook 123 7 Updated Jul 11, 2025

[ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs

Python 86 4 Updated Jan 17, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,409 545 Updated Jun 16, 2026

Pixel-Level Reasoning Model trained with RL [NeuIPS25]

Python 299 13 Updated Jun 4, 2026
Next