Stars
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A natural language interface for computers
We write your reusable computer vision tools. 💜
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
get things from one computer to another, safely
Automate browser based workflows with AI
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Create Reddit Videos with just✨ one command ✨
MCP server for Atlassian tools (Confluence, Jira)