Skip to content
View MingLunHan's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report MingLunHan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

https://avocado-captioner.github.io/

Python 27 Updated Oct 16, 2025
Python 70 7 Updated Nov 12, 2025

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Python 406 28 Updated Nov 27, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,655 1,355 Updated Dec 17, 2025
Python 4,460 434 Updated Sep 14, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,552 2,283 Updated Oct 17, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,134 191 Updated Oct 9, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,233 438 Updated Dec 18, 2025
Python 11 Updated Aug 7, 2025

MindVL: Towards Efficient and Effective Training of Multimodal Large Language Models on Ascend NPUs

2 Updated Sep 29, 2025

A simple, elegant, and fast workflow to write resumes and CVs in Markdown.

HTML 92 47 Updated Jan 24, 2025

A Fully Self-Hosted Solution for Full-Duplex Voice Interaction

Python 452 34 Updated Sep 28, 2025

Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.

Python 242 17 Updated Sep 22, 2025

[ICCV 2025] Explore the Limits of Omni-modal Pretraining at Scale

Python 121 6 Updated Sep 2, 2024
Python 844 45 Updated Sep 15, 2025

🤗 R1-AQA Model: mispeech/r1-aqa

Python 309 27 Updated Mar 28, 2025

ICML 2025 Papers: Dive into cutting-edge research from the premier machine learning conference. Stay current with breakthroughs in deep learning, generative AI, optimization, reinforcement learning…

22 1 Updated Oct 24, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,371 1,348 Updated Jul 9, 2025

🛠Awesome Tools,程序员常用高效实用工具、软件资源精选,办公效率提升利器(A Curated Collection of High-Efficiency and Practical Tools and Software Resources for Programmers to Boost Office Productivity)。

868 112 Updated Dec 9, 2025
Python 704 14 Updated Nov 20, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,391 1,152 Updated Apr 30, 2025

Latest Advances on System-2 Reasoning

Python 1,296 73 Updated Jun 8, 2025

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Python 211 9 Updated Sep 26, 2025

The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.

Python 273 24 Updated May 15, 2025

Visual R1: Trasfer Reasoning Ability from R1 to Visual R1

3 Updated Feb 15, 2025
Python 37 2 Updated Aug 26, 2025

Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓

35 Updated Apr 3, 2025

R1-Vision: Let's first take a look at the image

Python 48 1 Updated Feb 16, 2025
Next