Skip to content
View m-chadda's full-sized avatar

Block or report m-chadda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 744 20 Updated Sep 10, 2025

Vision infrastructure to turn complex documents into RAG/LLM-ready data

Rust 2,922 188 Updated Sep 24, 2025

Document Artifical Intelligence

194 8 Updated Sep 28, 2025

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

222 11 Updated Sep 9, 2024
TypeScript 2 1 Updated Jan 16, 2025

➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are completely removed. Crawl and convert any website into LLM-ready …

TypeScript 625 53 Updated May 23, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 66,603 9,529 Updated Dec 16, 2025
Rust 4 1 Updated Nov 2, 2024
Python 65 8 Updated Aug 14, 2024

A neural network library for Swift

Swift 126 7 Updated Oct 9, 2025

All-in-one platform for search, recommendations, RAG, and analytics offered via API

Rust 2,578 230 Updated Oct 10, 2025