Stars
Elevate your AI research writing, no more tedious polishing ✨
Fine-tune Vision Transformers on Apple Silicon with MLX. Unsloth-like API for ViT-B/L/H with LoRA, SwiGLU, and register tokens.
One dashboard. An entire research team.
JamesQFreeman / PathClaw
Forked from nanocoai/nanoclawPathology assistant for whole-slide image viewing and agent-assisted WSI analysis.
FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity co…
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Reference PyTorch implementation and models for DINOv3
State-of-the-art 2D and 3D Face Analysis Project
Hierarchical Reasoning Model Official Release
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)
Awesome Unified Multimodal Models
[ICLR 2025] An Intelligent Agentic System for Complex Image Restoration Problems
RLogist = RL (reinforcement learning) + Pathologist
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437
Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"
[MICCAI 2024] Region Attention Transformer for Medical Image Restoration.