Skip to content
View zwcolin's full-sized avatar
🏋️
Benching, cooking and shipping
🏋️
Benching, cooking and shipping

Highlights

  • Pro

Organizations

@ucsd-ets @princeton-nlp @dsc-courses

Block or report zwcolin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AutoGaze automatically removes redundant patches in a video, reducing #tokens in ViT/MLLM by 4x-100x.

Python 252 14 Updated Mar 19, 2026
Python 1 Updated Mar 23, 2026

OpenCUA: Open Foundations for Computer-Use Agents

Python 738 97 Updated Feb 4, 2026
Python 1 Updated Apr 13, 2026

Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Jupyter Notebook 106 8 Updated Mar 10, 2026

The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models

Python 54 7 Updated Apr 8, 2026

A Large-scale Video Action Dataset

Python 447 12 Updated Jan 16, 2026

A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.

C++ 187 31 Updated Apr 17, 2026

Curate, Annotate, and Manage Your Data in LightlyStudio.

Python 691 19 Updated Apr 17, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,398 117 Updated Apr 17, 2026
HTML 1 Updated Feb 15, 2026

A customizable gym environment for maze/gridworld

Jupyter Notebook 7 2 Updated Apr 27, 2018

Lifelong Learning Note

Python 15 2 Updated Mar 4, 2026

Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1

63 4 Updated Mar 18, 2025

Our library for RL environments + evals

Python 4,025 535 Updated Apr 18, 2026

Witness the aha moment of VLM with less than $3.

Python 4,050 285 Updated May 19, 2025

Random maze environments with different size and complexity for reinforcement learning research.

Python 2 Updated Apr 30, 2024

A customizable framework to create maze and gridworld environments

Python 269 60 Updated Apr 5, 2019

A framework for few-shot evaluation of language models.

Python 12,237 3,200 Updated Apr 8, 2026

A fork to add multimodal model training to open-r1

Python 1,528 72 Updated Feb 8, 2025

[ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Python 159 13 Updated Aug 8, 2025
Python 83 6 Updated Nov 5, 2024

A collection of materials for CS application

3 Updated Dec 21, 2024
Python 1 Updated Dec 12, 2024
Python 118 13 Updated Jun 16, 2025
HTML 22 3 Updated Nov 26, 2024

qpdf: A content-preserving PDF document transformer

C++ 4,939 368 Updated Apr 14, 2026

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,447 85 Updated Feb 11, 2026
Python 17 1 Updated Dec 11, 2024
Next