Skip to content
View zwcolin's full-sized avatar
🏋️
Benching, cooking and shipping
🏋️
Benching, cooking and shipping

Highlights

  • Pro

Organizations

@ucsd-ets @princeton-nlp @dsc-courses

Block or report zwcolin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AutoGaze automatically removes redundant patches in a video, reducing #tokens in ViT/MLLM by 4x-100x.

Python 210 9 Updated Mar 19, 2026
Python 1 Updated Mar 23, 2026

OpenCUA: Open Foundations for Computer-Use Agents

Python 724 95 Updated Feb 4, 2026
Python 1 Updated Mar 27, 2026

Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Jupyter Notebook 104 7 Updated Mar 10, 2026

The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models

Python 20 1 Updated Jan 30, 2026

A Large-scale Video Action Dataset

Python 442 12 Updated Jan 16, 2026

A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.

C++ 173 29 Updated Mar 26, 2026

Curate, Annotate, and Manage Your Data in LightlyStudio.

Python 688 17 Updated Mar 30, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,379 114 Updated Mar 28, 2026
HTML 1 Updated Feb 15, 2026

A customizable gym environment for maze/gridworld

Jupyter Notebook 7 2 Updated Apr 27, 2018

Lifelong Learning Note

Python 15 2 Updated Mar 4, 2026

Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1

62 4 Updated Mar 18, 2025

Our library for RL environments + evals

Python 3,952 522 Updated Mar 30, 2026

Witness the aha moment of VLM with less than $3.

Python 4,047 286 Updated May 19, 2025

Random maze environments with different size and complexity for reinforcement learning research.

Python 2 Updated Apr 30, 2024

A customizable framework to create maze and gridworld environments

Python 269 61 Updated Apr 5, 2019

A framework for few-shot evaluation of language models.

Python 11,931 3,133 Updated Mar 18, 2026

A fork to add multimodal model training to open-r1

Python 1,515 70 Updated Feb 8, 2025

[ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Python 160 12 Updated Aug 8, 2025
Python 83 6 Updated Nov 5, 2024

A collection of materials for CS application

3 Updated Dec 21, 2024
Python 1 Updated Dec 12, 2024
Python 118 13 Updated Jun 16, 2025
HTML 22 3 Updated Nov 26, 2024

qpdf: A content-preserving PDF document transformer

C++ 4,883 362 Updated Mar 28, 2026

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,443 84 Updated Feb 11, 2026
Python 17 1 Updated Dec 11, 2024
Next