Stars
OCR model that handles complex tables, forms, handwriting with full layout.
Our method reconstructs 3D worlds from video diffusion models using non-rigid alignment to resolve inherent 3D inconsistencies in the generated sequences.
Ground Station is all-in-one satellite monitoring suite
attempting to detect smart glasses nearby and warn you
Real-time global intelligence dashboard. AI-powered news aggregation, geopolitical monitoring, and infrastructure tracking in a unified situational awareness interface
The open-source voice synthesis studio
Tooth arrangement, Medical orthodontics,Neural networks, Deep learning, Transformer, Pytorch, Python
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
The official TypeScript SDK for Model Context Protocol servers and clients
A free and open-source block-based email template builder.
🌟 Automatically render forms for your existing data schema
A curated list of awesome things related to shadcn/ui.
Implementation of "Disentangled Motion Modeling for Video Frame Interpolation", AAAI 2025
ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Plupload is JavaScript API for building file uploaders. It supports multiple file selection, file filtering, chunked upload, client side image downsizing and when necessary can fallback to alternat…
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
Low-code platform allows you to build business apps, enables you to quickly create internal tools such as dashboard, crud app, admin panel, crm, cms, etc. Supports PostgreSQL, MySQL, Supabase, Grap…
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Simple react component to generate cron expressions
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.