Skip to content
View bryanyzhu's full-sized avatar
🏹
Focusing
🏹
Focusing

Highlights

  • Pro

Block or report bryanyzhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A pipeline parallel training script for diffusion models.

Python 1,684 226 Updated Oct 29, 2025

Community trainer for Lightricks' LTX Video model 🎬 ⚡️

Python 351 46 Updated Oct 26, 2025

[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Python 825 74 Updated Sep 11, 2025

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 490 20 Updated Oct 20, 2025

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 2,638 448 Updated Sep 25, 2025

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,578 124 Updated Aug 20, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,375 1,257 Updated Oct 12, 2025

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,579 561 Updated Oct 31, 2025

We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…

Python 1,113 93 Updated Oct 13, 2025

[ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Python 546 96 Updated Jul 11, 2025

Foundation Models and Data for Human-Human and Human-AI interactions.

Python 304 18 Updated Aug 16, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 18,021 1,860 Updated Sep 8, 2025

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

TypeScript 18,319 2,547 Updated Nov 5, 2025

React UI + elegant infrastructure for AI Copilots, AI chatbots, and in-app AI agents. The Agentic last-mile 🪁

TypeScript 24,752 3,306 Updated Nov 5, 2025

Text-audio foundation model from Boson AI

Python 7,566 558 Updated Sep 15, 2025

本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。

JavaScript 11,117 2,924 Updated Jul 19, 2025

AI overlays on top of what you are doing

Swift 771 173 Updated Jul 28, 2025

Get your documents ready for gen AI

Python 43,011 3,079 Updated Nov 4, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

8,454 557 Updated Oct 31, 2025

A fast, local neural text to speech system

C++ 10,211 850 Updated Aug 26, 2025

Fast and local neural text-to-speech engine

C++ 1,479 161 Updated Nov 3, 2025

[Up-to-date] Awesome Agentic Deep Research Resources

526 46 Updated Aug 26, 2025

Anthropic's educational courses

Jupyter Notebook 17,497 1,634 Updated Oct 24, 2025

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 65,977 6,863 Updated Nov 1, 2025

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 23,348 2,510 Updated Sep 14, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,252 1,758 Updated Oct 13, 2025

Latest Advances on Long Chain-of-Thought Reasoning

542 25 Updated Jul 18, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,320 308 Updated Jun 21, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

94,394 25,470 Updated Nov 1, 2025
Next