Highlights
- Pro
Stars
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Google Suite CLI: Gmail, GCal, GDrive, GContacts.
Beautiful, visual bases views for Obsidian
[CVPR'25 Oral] Official implementation for "DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models"
A free, open source, and extensible speech-to-text application that works completely offline.
A github issue sync tool for working with issues locally
Native and Compact Structured Latents for 3D Generation
FluidVoice - Fastest macOS Offline Dictation app - Voice to Text fully Local. One ⭐ takes us a long way :))
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
LL3M writes Python code that generates 3D assets in Blender.
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Best Claude Code framework that actually save time. Built by a dev tired of typing "please act like a senior engineer" in every conversation.
A friendly rust interface to Apple's Core Audio API.
Peekaboo is a macOS CLI & optional MCP server that enables AI agents to capture screenshots of applications, or the entire system, with optional visual question answering through local or remote AI…
MAESTRO is an AI-powered research application designed to streamline complex research tasks.
The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
[CVPR 2025] "Towards Universal Soccer Video Understanding".
UFPR-VCR: a dataset for vehicle color recognition that includes 10,039 images of vehicles in a wide range of real-world conditions, such as frontal and rear views, partial occlusions, diverse light…
Python tool for converting files and office documents to Markdown.
kloppy: standardizing soccer tracking- and event data
Fast and accurate automatic speech recognition (ASR) for edge devices
A simple screen parsing tool towards pure vision based GUI agent
📔 The interactive scratchpad for hackers.