Skip to content
View zwcolin's full-sized avatar
🏋️
Benching, cooking and shipping
🏋️
Benching, cooking and shipping

Highlights

  • Pro

Organizations

@ucsd-ets @princeton-nlp @dsc-courses

Block or report zwcolin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AutoGaze automatically removes redundant patches in a video, reducing #tokens in ViT/MLLM by 4x-100x.

Python 234 9 Updated Mar 19, 2026
Python 1 Updated Mar 23, 2026

OpenCUA: Open Foundations for Computer-Use Agents

Python 727 97 Updated Feb 4, 2026
Python 1 Updated Apr 3, 2026

Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Jupyter Notebook 104 7 Updated Mar 10, 2026

The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models

Python 22 2 Updated Jan 30, 2026

A Large-scale Video Action Dataset

Python 443 12 Updated Jan 16, 2026

A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.

C++ 177 30 Updated Apr 1, 2026

Curate, Annotate, and Manage Your Data in LightlyStudio.

Python 688 17 Updated Apr 3, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,386 115 Updated Mar 28, 2026
HTML 1 Updated Feb 15, 2026

A customizable gym environment for maze/gridworld

Jupyter Notebook 7 2 Updated Apr 27, 2018

Lifelong Learning Note

Python 15 2 Updated Mar 4, 2026

Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1

62 4 Updated Mar 18, 2025

Our library for RL environments + evals

Python 3,963 527 Updated Apr 3, 2026

Witness the aha moment of VLM with less than $3.

Python 4,046 286 Updated May 19, 2025

Random maze environments with different size and complexity for reinforcement learning research.

Python 2 Updated Apr 30, 2024

A customizable framework to create maze and gridworld environments

Python 269 61 Updated Apr 5, 2019

A framework for few-shot evaluation of language models.

Python 11,992 3,150 Updated Apr 1, 2026

A fork to add multimodal model training to open-r1

Python 1,520 71 Updated Feb 8, 2025

[ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Python 160 13 Updated Aug 8, 2025
Python 83 6 Updated Nov 5, 2024

A collection of materials for CS application

3 Updated Dec 21, 2024
Python 1 Updated Dec 12, 2024
Python 118 13 Updated Jun 16, 2025
HTML 22 3 Updated Nov 26, 2024

qpdf: A content-preserving PDF document transformer

C++ 4,900 363 Updated Apr 2, 2026

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,444 85 Updated Feb 11, 2026
Python 17 1 Updated Dec 11, 2024
Next