-
Beijing Normal University
- Beijing
-
15:29
(UTC +08:00) - https://blog.leafyee.xyz/about
- https://orcid.org/0009-0005-2620-5412
Highlights
- Pro
Lists (14)
Sort Name ascending (A-Z)
Starred repositories
📚 Freely available programming books
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Python tool for converting files and office documents to Markdown.
real time face swap and one-click video deepfake with only a single image
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.
No fortress, purely open ground. OpenManus is Coming.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A generative speech model for daily dialogue.
Instant voice cloning by MIT and MyShell. Audio foundation model.
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Fully open reproduction of DeepSeek-R1
Maple Mono: Open source monospace font with round corner, ligatures and Nerd-Font icons for IDE and terminal, fine-grained customization options. 带连字和控制台图标的圆角等宽字体,中英文宽度完美2:1,细粒度的自定义选项
🚀 The fast, Pythonic way to build MCP servers and clients.
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
Xiaomi Home Integration for Home Assistant
Automate browser based workflows with AI
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.