Skip to content
View fand-ee's full-sized avatar
🔖
And someday, I hope that my sadness will be replaced by something beautiful.
🔖
And someday, I hope that my sadness will be replaced by something beautiful.
  • Toshiba Corporation
  • Tokyo, Japan
  • 18:35 (UTC +08:00)

Block or report fand-ee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
368 stars written in Python
Clear filter

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 38,785 4,662 Updated Jan 30, 2026

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 27,907 3,992 Updated Jan 29, 2026

MiniCPM-o 4.5: A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Mulitmodal Live Streaming on Your Phone

Python 22,913 1,735 Updated Feb 5, 2026

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,749 1,305 Updated Apr 6, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,399 1,330 Updated Oct 11, 2025

Hierarchical Reasoning Model Official Release

Python 12,300 1,789 Updated Sep 9, 2025

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,641 1,122 Updated Sep 14, 2024

An Autonomous LLM Agent for Complex Task Solving

Python 8,494 891 Updated Aug 12, 2024

UFO³: Weaving the Digital Agent Galaxy

Python 7,998 979 Updated Jan 6, 2026

Mobile-Agent: The Powerful GUI Agent Family

Python 7,133 740 Updated Dec 2, 2025

A lightweight LMM-based Document Parsing Model

Python 6,459 446 Updated Jan 30, 2026

Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale

Python 5,622 530 Updated Feb 5, 2026

ACI.dev is the open source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function calling or a unified MCP server. The birthplace of VibeOps.

Python 4,704 457 Updated Jan 8, 2026

Align Anything: Training All-modality Model with Feedback

Python 4,631 509 Updated Nov 27, 2025

HunyuanVideo-1.5: A leading lightweight video generation model

Python 4,261 177 Updated Jan 2, 2026

This Inventory management system is the currently Ford Asia Pacific after-sales logistics warehousing supply chain process . After I leave Ford , I start this project . You can share your vacant wa…

Python 4,248 1,126 Updated Sep 26, 2025

[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python 4,245 483 Updated Jul 10, 2024

Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers powerful capabilities for agent running control, data processing …

Python 4,113 471 Updated Feb 5, 2026

Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…

Python 4,085 656 Updated Apr 2, 2025

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o…

Python 4,002 764 Updated Oct 28, 2025

Easiest and laziest way for building multi-agent LLMs applications.

Python 3,712 362 Updated Feb 5, 2026

[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,674 530 Updated Feb 27, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,622 511 Updated Feb 4, 2026

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,520 307 Updated Nov 5, 2024

Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,514 286 Updated Feb 2, 2026

A Doctor for your data

Python 3,490 256 Updated Jan 14, 2025

The next generation deep reinforcement learning tookit

Python 3,459 596 Updated Jun 16, 2023

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,291 209 Updated Mar 5, 2024

PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.

Python 3,234 278 Updated Jan 26, 2026

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.

Python 3,150 278 Updated Dec 15, 2025
Next