Stars
[ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark performance. It also significantly improves the quality…
Example of using Realtime API as a meeting assistant to manage a Kanban Board
LLaDA2.0-Uni: Understanding and Generation the World.
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
Official repository for Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding
🦞 Just talk to your agent — it learns and EVOLVES 🧬.
Academic Research Skills for Claude Code: research → write → review → revise → finalize
AI agents running research on single-GPU nanochat training automatically
🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.
Extract clean conversation logs from Claude Code's internal storage
This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the informati…
No fortress, purely open ground. OpenManus is Coming.
PyTorch building blocks for the OLMo ecosystem
Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs
Block-Recurrent Dynamics in ViTs 🦖
Tools for merging pretrained large language models.
💫 Toolkit to help you get started with Spec-Driven Development
Information hub for our project training the largest possible historical LLMs.
Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.
[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation