Skip to content
View Paper99's full-sized avatar
🐢
Focusing
🐢
Focusing

Organizations

@Tongyi-MAI

Block or report Paper99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HuggingFace conversion and training library for Megatron-based models

Python 305 109 Updated Dec 23, 2025

An AI image gen prompt manager !

JavaScript 173 25 Updated Dec 14, 2025

AI-powered Resume Expert based on Conversation

JavaScript 10 2 Updated Dec 4, 2025
Python 7,747 458 Updated Dec 14, 2025

Official inference repo for FLUX.2 models

Python 1,260 64 Updated Dec 1, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,409 745 Updated Dec 21, 2025

Native Multimodal Models are World Learners

Python 1,372 52 Updated Nov 28, 2025

Identifying and removing near-duplicate images using perceptual hashing.

Python 385 25 Updated Apr 25, 2025

Contexts Optical Compression

Python 21,556 1,928 Updated Oct 25, 2025

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 306 55 Updated Dec 23, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,595 2,286 Updated Oct 17, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,996 220 Updated Sep 12, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 914 57 Updated Dec 23, 2025

cuGraph - RAPIDS Graph Analytics Library

Cuda 2,095 342 Updated Dec 23, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 41,879 3,734 Updated Dec 24, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,449 122 Updated Dec 23, 2025
Python 54 Updated Sep 21, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,464 1,998 Updated Nov 1, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,474 366 Updated Dec 23, 2025

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

Rust 8,285 675 Updated Dec 23, 2025

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,784 76 Updated Nov 27, 2025

Official Release of ICCV 2025 paper -- DiscretizedSDF

Python 98 11 Updated Aug 25, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,754 707 Updated Nov 7, 2025

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 677 25 Updated Sep 24, 2025

Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Python 876 47 Updated Jul 1, 2025

Official code for the paper: Depth Anything At Any Condition

Python 312 19 Updated Aug 21, 2025

Easy to use Python module to extract Exif metadata from digital image files.

Python 917 207 Updated Dec 2, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,049 1,273 Updated Oct 11, 2025

Open-source unified multimodal model

Python 5,500 481 Updated Oct 27, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,773 105 Updated Nov 4, 2025
Next