Highlights
- Pro
Stars
Harness engineering beginner tutorial, from 0 to 1
🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.
The official repository of "Position: Agentic Evolution is the Path to Evolving LLMs".
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
implementing minimal versions of joint-embedding predictive architecture (JEPA)
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
Screen2Vec is a new self-supervised technique for generating more comprehensive semantic embeddings of GUI screens and components using their textual content, visual design and layout patterns, and…
ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K…
Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations between selected general UI elements and their text labels. A…
Spatial Aptitude Training for Multimodal Langauge Models
[CVPR 2026] IOMM: Fast Pre-training of Unified Multimodal Models without Text-Image Pairs
This is a repository for awesome any2any work collection.
OpenClaw-RL: Train any agent simply by talking
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.
50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better
Various ML tidbits in Python/PyTorch and C++
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
Shaping capabilities with token-level pretraining data filtering
A collection of optimization problems in mathematics