Stars
A curated list of academic papers and resources on Physical AI — focusing on Vision-Language-Action (VLA) models, world models, embodied ai, and robotic foundation models.
Use Garry Tan's exact Claude Code setup: 15 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA
The pro_image_editor is a Flutter widget designed for image editing within your application. It provides a flexible and convenient way to integrate image editing capabilities into your Flutter proj…
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands
Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.
An open source library designed to provide community examples of Joint Embedding Predictive Architectures (JEPAs). It contains code and examples for learning representations from images, video, and…
Official implementation of WebVLN: Vision-and-Language Navigation on Websites
[ICLR 2026] RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation
A Visual Studio Code extension for Colab.
Algorithm powering the For You feed on X
Autonomous AI development loop for Claude Code with intelligent exit detection
"DeepTutor: AI-Powered Personalized Learning Assistant"
A large collection of system log datasets for AI-driven log analytics [ISSRE'23]
A curated list of Graph/Transformer-based fraud, anomaly, and outlier detection papers & resources
A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require e…
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
A local-first workflow engine built the way it should be: declarative, file-based, self-contained, air-gapped ready. One binary that scales from laptop to distributed cluster. Your Workflow Operato…
Optimized Whisper models for streaming and on-device use
The most powerful MCP Slack Server with no permission requirements, Apps support, GovSlack, DMs, Group DMs and smart history fetch logic.
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
Durable workflow automation in just a few lines of code
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM …
Training PyTorch models with differential privacy
PipelineDP is a Python framework for applying differentially private aggregations to large datasets using batch processing systems such as Apache Spark, Apache Beam, and more.
Google's differential privacy libraries.