Skip to content
View The-Martyr's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report The-Martyr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

一个基于nano banana pro🍌的原生AI PPT生成应用,迈向真正的"Vibe PPT"; 支持上传任意模板图片;上传任意素材&智能解析;一句话/大纲/页面描述自动生成PPT;口头修改指定区域、一键导出 - An AI-native PPT generator based on nano banana pro🍌

Python 6,173 693 Updated Dec 24, 2025

一个面向中国学生(尤其受10043政策影响)的香港、澳门、新加坡等地区导师信息库。An open-source database of professors in HK/MO/SG/etc. for Chinese students (esp. those affected by 10043 policy).

36 Updated Nov 26, 2025

🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,1分钟手机通知,无需…

Python 40,306 20,860 Updated Dec 23, 2025

An efficient local intrinsic dimension estimator with diffusion models.

Jupyter Notebook 5 1 Updated Jun 11, 2025

本文原文由知名 Hacker Eric S. Raymond 所撰寫,教你如何正確的提出技術問題並獲得你滿意的答案。

JavaScript 34,255 5,763 Updated Jan 1, 2025

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 681 27 Updated Dec 23, 2025

[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs

Python 145 3 Updated Dec 14, 2025

A visual attention visualization tool for Vision-Language models based on SmoothGrad and Grad-CAM, specifically optimized for Qwen2.5-VL models.

Python 8 1 Updated Sep 22, 2025
Jupyter Notebook 2 1 Updated Apr 21, 2025

Visualizing the attention of vision-language models

Jupyter Notebook 268 22 Updated Feb 28, 2025
Python 6,807 1,152 Updated Dec 21, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,649 55 Updated Nov 15, 2025

The best ChatGPT that $100 can buy.

Python 39,199 4,964 Updated Dec 23, 2025

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

JavaScript 2,211 735 Updated Dec 24, 2025

SSH Proxy Command

C 112 26 Updated Oct 30, 2021

[EMNLP 2025 Demo] Extracting internal representations from vision-language models. Beta version.

Python 76 3 Updated Nov 13, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,348 614 Updated Dec 24, 2025
Jupyter Notebook 21 3 Updated Sep 16, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,722 1,359 Updated Dec 24, 2025
Python 705 14 Updated Nov 20, 2025

MiMo-VL

611 29 Updated Aug 21, 2025

Finetuning CLIP on a small image/text dataset using huggingface libs

Python 52 2 Updated Jan 6, 2023

Fully Open Framework for Democratized Multimodal Training

Python 663 53 Updated Dec 15, 2025

🐍 The official Python client library for Google's discovery based APIs.

Python 8,635 2,524 Updated Dec 5, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,764 2,888 Updated Dec 24, 2025

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 338 11 Updated Dec 22, 2025

Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types

Python 32 Updated Jul 16, 2025

A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.

368 26 Updated Dec 15, 2025

🔥 【Meta Awesome List】: AI/ML Research Hub - Solving the "Chasing Hot Topics" Problem for AI Researchers. 🤖 Agent-driven intelligence automatically discovers trending topics, curates hottest GitHub …

58 Updated Sep 1, 2025
Next