Starred repositories
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
Badget aims to simplify financial management with a user-friendly interface and robust backend
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
Cappuccino is an GUI Agent based on desktop screen. It is a Manus-like AI Agent that can be deployed locally.
Fully open reproduction of DeepSeek-R1
This repository is maintained to release dataset and models for multimodal puzzle reasoning.
This repository automatically updates a list of the top 100 repositories related to ComfyUI based on the number of stars on GitHub.
[ICLR 2024] DiffTactile: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation
BizyAir: Comfy Nodes that can run in any environment.
The API to search, scrape, and interact with the web at scale. 🔥
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
tracking medical datasets, with a focus on medical imaging
YouTube video to chords, lyrics, beat and melody.
A command line interface to download PDF files from https://arxiv.org.
Train high-quality text-to-image diffusion models in a data & compute efficient manner
🍃 MINT-1T: A one trillion token multimodal interleaved dataset.
WildEval / ZeroEval
Forked from allenai/WildBenchA simple unified framework for evaluating LLMs