bokyeong1015

Follow

Bo-Kyeong Kim bokyeong1015

Follow

ML Researcher at Nota Inc.

23 followers · 27 following

Nota Inc.
Seoul, Korea
https://sites.google.com/view/bkkim
in/bokyeong1015
https://scholar.google.co.kr/citations?user=hIWBLUgAAAAJ&hl=en

Achievements

Achievements

Organizations

Stars

hjeon2k / LRAgent

Official implementation of LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents

Python 19 2 Updated Feb 1, 2026

allenai / FlexOlmo

Code and training scripts for FlexOlmo

Python 150 24 Updated Apr 20, 2026

code-yeongyu / oh-my-openagent

omo/lazycodex: The coding agent for tokenmaxxers;the one and only agent harness for complex codebases. For your Codex, for your OpenCode

TypeScript 62,478 5,058 Updated Jun 16, 2026

dwzhu-pku / PaperBanana

PaperBanana: Automating Academic Illustration For AI Scientists

Python 6,573 488 Updated May 11, 2026

safal312 / on-the-limits-of-layer-pruning

Jupyter Notebook 3 2 Updated Feb 3, 2026

wbfwonderful / Vad-R1

[NeurIPS 2025]Official repositories for "Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought".

Python 28 Updated Jan 30, 2026

gooriiie / MR-Pruner

MR-Pruner: Training-free Multi-resolution Visual Token Pruning for Multi-modal Large Language Models

Python 2 Updated Nov 16, 2025

Roc-Ng / XDVioDet

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

Python 136 19 Updated May 26, 2024

pipixin321 / HolmesVAU

[CVPR 2025 Highlight] Official implementation of "Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity"

Python 141 8 Updated Mar 25, 2025

NVIDIA-NeMo / Curator

Scalable data pre processing and curation toolkit for LLMs

Python 1,619 287 Updated Jun 17, 2026

MinishLab / semhash

Fast Multimodal Semantic Deduplication & Filtering

Python 936 57 Updated May 24, 2026

nota-github / ERGO

ERGO (Efficient Reasoning & Guided Observation) is a large vision-language model trained with reinforcement learning on efficiency objectives. [ICLR'26]

Python 19 1 Updated Feb 25, 2026

nvidia-cosmos / cosmos-reason1

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 950 83 Updated Jun 7, 2026

AIDC-AI / Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,454 83 Updated Feb 11, 2026

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 7,157 738 Updated Jun 13, 2026

si0wang / ThinkLite-VL

Python 106 6 Updated Jun 10, 2025

DAMO-NLP-SG / VCD

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 406 25 Updated Oct 7, 2024

PostMindLab / ICD

[ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding

Python 17 3 Updated Nov 10, 2025

thunlp / Migician

[ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Python 89 4 Updated May 20, 2025

ustc-hyin / HiMAP

Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference

Python 14 1 Updated Jun 7, 2025

Theia-4869 / CDPruner

[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

Python 104 5 Updated Sep 20, 2025

TIGER-AI-Lab / Mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024 Best Paper]

Python 240 23 Updated Jan 3, 2026

anvo25 / vlms-are-biased

Vision Language Models are Biased

Python 113 3 Updated Jan 26, 2026

Project-MONAI / VLM-Radiology-Agent-Framework

Jupyter Notebook 217 31 Updated Sep 22, 2025

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,866 67 Updated May 29, 2026

snu-mllab / KVzip

[NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)

Python 221 13 Updated Feb 11, 2026

OpenThinkIMG / OpenThinkIMG

OpenThinkIMG is an end-to-end open-source framework that empowers Large Vision-Language Models to think with images.

Jupyter Notebook 123 7 Updated Jul 11, 2025

CR400AF-A / SparseMM

[ICCV 2025] SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs

Python 86 4 Updated Jan 17, 2026

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,409 545 Updated Jun 16, 2026

TIGER-AI-Lab / Pixel-Reasoner

Pixel-Level Reasoning Model trained with RL [NeuIPS25]

Python 299 13 Updated Jun 4, 2026