Skip to content
View myownskyW7's full-sized avatar

Highlights

  • Pro

Block or report myownskyW7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,174 714 Updated Dec 11, 2025

Fine-Grained GRPO for Precise Preference Alignment in Flow Models

Python 39 Updated Nov 25, 2025

[NeurIPS'25] Official repository of Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Python 449 20 Updated Nov 29, 2025

An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 153 7 Updated Nov 5, 2025

Official implementation of BLIP3o-Series

Python 1,610 72 Updated Nov 29, 2025

An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"

Python 112 6 Updated Sep 28, 2025

Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

Python 123 2 Updated Dec 12, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,436 121 Updated Dec 19, 2025

Collection of scripts and notebooks for OpenAI's latest GPT OSS models

Jupyter Notebook 484 53 Updated Aug 25, 2025

Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"

Python 215 19 Updated Aug 7, 2025

Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"

Python 155 6 Updated Sep 12, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,035 2,060 Updated Sep 12, 2025

Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme

Python 12,929 1,419 Updated Dec 15, 2025

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Jupyter Notebook 255 17 Updated Dec 16, 2025

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 2,021 215 Updated Dec 16, 2025

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 131 7 Updated Jun 30, 2025

Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’

Python 59 2 Updated Jun 25, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,237 7,787 Updated Dec 19, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 590 32 Updated Dec 19, 2025

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 600 25 Updated Dec 11, 2024

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

TypeScript 80,342 4,922 Updated Dec 19, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,605 227 Updated Jun 17, 2025

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 7,797 1,705 Updated May 26, 2025

[NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance

Python 84 1 Updated Sep 18, 2025

GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities

Python 305 8 Updated May 3, 2025

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Python 1,077 53 Updated Nov 3, 2025

Kyutai with an "eye"

Python 230 29 Updated Mar 26, 2025

[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 3,079 310 Updated Sep 4, 2025
Python 8,605 606 Updated Nov 12, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,629 2,853 Updated Dec 19, 2025
Next