Skip to content
View iworldtong's full-sized avatar

Block or report iworldtong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GELab: GUI Exploration Lab. One of the best GUI agent solutions in the galaxy, built by the StepFun-GELab team and powered by Step’s research capabilities.

Python 1,684 139 Updated Dec 19, 2025

Convert any video into a tiny size.

TypeScript 1,450 87 Updated Dec 11, 2025

Send files and folders anywhere in the world without storing in cloud - any size, any format, no accounts, no restrictions.

TypeScript 4,490 239 Updated Dec 8, 2025
12 Updated Nov 25, 2025

One-command vLLM installation for NVIDIA DGX Spark with Blackwell GB10 GPUs (sm_121 architecture)

Shell 28 6 Updated Oct 28, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,357 742 Updated Dec 21, 2025

有趣的80后程序员的工作流分享

1,439 347 Updated Dec 23, 2025

带有 WebUI 的 NovelAI 批量生成工具, 支持批量文生图, 图生图, 局部重绘, 导演工具, 角色分区, 角色参考, 支持 wildcards, 支持超分降噪, 支持元数据解析及抹除, 支持反推 tag, 支持图片筛选, 插件加载!

Python 28 2 Updated Dec 18, 2025

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 790 52 Updated Dec 22, 2025

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 23,931 3,528 Updated Nov 13, 2025

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 607 51 Updated Oct 29, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,490 420 Updated Jun 27, 2025

A terminal-based dashboard for managing cron jobs locally and on servers.

Python 899 32 Updated Nov 11, 2025

🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,1分钟手机通知,无需…

Python 40,082 20,815 Updated Dec 22, 2025

Official implementation of WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance

111 2 Updated Sep 20, 2025

我的 nano-banana 创意玩法大合集! 持续更新中!

3,341 327 Updated Sep 18, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,041 1,271 Updated Oct 11, 2025

[arXiv 2025] ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models

32 1 Updated Aug 26, 2025

The open-source CapCut alternative

TypeScript 44,558 4,373 Updated Dec 7, 2025

带有 WebUI 的 NovelAI 量产工具, 实现了批量文生图; 批量图生图; 视频转绘; 分块重绘; 批量 Vibe; 批量局部重绘; 批量超分降噪; 批量自动打码; 批量添加水印; 批量上传 Pixiv; 图片筛选; 批量抹除, 还原或导出生成信息; 法术解析; 多模型反推提示词; ChatGPT; 动态加载插件; 自动 roll 画风串; 批量 Enhance; tag选择器; 涂…

Python 371 38 Updated Oct 19, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 26,820 2,683 Updated Dec 20, 2025

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 23,084 2,094 Updated Dec 4, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,285 2,053 Updated Oct 21, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 16,949 2,040 Updated Dec 2, 2025

An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance

TypeScript 4,002 256 Updated Dec 15, 2025

:electron: An unofficial https://bgm.tv ui first app client for Android and iOS, built with React Native. 一个无广告、以爱好为驱动、不以盈利为目的、专门做 ACG 的类似豆瓣的追番记录,bgm.tv 第三方客户端。为移动端重新设计,内置大量加强的网页端难以实现的功能,且提供了相当的自定义选项。 目前已适配…

TypeScript 5,059 157 Updated Dec 21, 2025

基于自定义规则的番剧采集APP,支持流媒体在线观看,支持弹幕,支持实时超分辨率。

Dart 18,690 532 Updated Dec 20, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 18,200 2,029 Updated Dec 22, 2025
Python 6,052 466 Updated Aug 29, 2025
Next