ai
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Multi-platform desktop app to download and run Large Language Models(LLM) locally in your computer.
Simultaneous speech-to-text model
Instant voice cloning by MIT and MyShell. Audio foundation model.
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
A list of awesome beginners-friendly projects.
Neural Networks: Zero to Hero
A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be accessed by MCP-compatible LLM applications. Converts REST API …
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
[3DV 2026] SpatialGen: Layout-guided 3D Indoor Scene Generation
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
A simple yet powerful agent framework that delivers with open-source models
real time face swap and one-click video deepfake with only a single image
A topic-centric list of HQ open datasets.