Starred repositories
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
AGENTS.md — a simple, open format for guiding coding agents
A Collection of Papers and Codes for ECCV2024/ECCV2020 Low Level Vision
A book for Learning the Foundations of LLMs
"Paper2Slides: From Paper to Presentation in One Click"
Transform any arXiv papers into slides using LLMs
[ICLR 2024] Code for FreeNoise based on VideoCrafter
StableLM: Stability AI Language Models
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…
Send a phone call from AI agent, in an API call. Or, directly call the bot from the configured phone number!
实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
[AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Learning materials for Stanford CS149 : Parallel Computing
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
深入探索精选的套壳站和必备API资源。本文为初学者和经验丰富的运营者提供一站式指南,涵盖常见问题解答和基础攻略,助您迈向套壳站副业成功之路。Dive into a curated selection of shell sites and essential APIs. This article offers a comprehensive guide for both beginners a…
Advanced-ChatFile. Simple implementation enterprise RAG system. Support Query Rewrite, Retrieval Reranking, Vector Search, Multi LLM Interface, etc...
Ikaros-521 / AI-Vtuber
Forked from sandboxdream/AI-VtuberAI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…
Scrape the webpage convert it into Markdown, and enhance AI search applications.
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…
🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.