Stars
Convert PDF to markdown + JSON quickly with high accuracy
A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Open-source, community-driven agent harness
The official implementation of Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models.
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
The state-of-the-art image restoration model without nonlinear activation functions.
Quality-Aware Image-Text Alignment for Opinion-Unaware Image Quality Assessment
[CVPRW oral 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Machine learning metrics for distributed, scalable PyTorch applications.
A Python port of the MATLAB reference implementation
[WACV 2024 Oral] - ARNIQA: Learning Distortion Manifold for Image Quality Assessment
Official implementation for "Image Quality Assessment using Contrastive Learning"
A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for p…
The agent that grows with you
An agentic skills framework & software development methodology that works.
Wan: Open and Advanced Large-Scale Video Generative Models
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Line Segment Detector for computer vision applications.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
AI中文提示词秘籍ChatGPT中文提示词秘籍(Prompt圣经)K-Render整理
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Deep Learning model to detect and correct image orientation (0°, 90°, 180°, 270°) using a fine-tuned EfficientNetV2