Skip to content
View SarthakMishra's full-sized avatar

Block or report SarthakMishra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
14 stars written in Python
Clear filter

Get your documents ready for gen AI

Python 43,134 3,088 Updated Nov 6, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 18,838 1,286 Updated Oct 21, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 15,819 1,198 Updated Nov 4, 2025

Fast State-of-the-Art Static Embeddings

Python 1,897 107 Updated Oct 11, 2025

MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model

Python 1,558 165 Updated Aug 6, 2025

asyncio bridge to the standard sqlite3 module

Python 1,470 106 Updated Nov 1, 2025

PgQueuer is a Python library leveraging PostgreSQL for efficient job queuing.

Python 1,387 27 Updated Nov 6, 2025

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

Python 1,059 109 Updated Nov 6, 2025

A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!

Python 860 65 Updated Nov 5, 2025

Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it…

Python 720 76 Updated Mar 4, 2025

🛡️A resilient, high-performance asynchronous connection pool layer for SQLite, designed for efficient and scalable database operations.

Python 404 7 Updated Jul 21, 2025

Sentiment Analysis of news on stock prices

Python 129 41 Updated May 22, 2023
Python 50 12 Updated Jun 10, 2025

CodeMap is a CLI tool that generates optimized markdown docs and streamline Git workflows.

Python 5 1 Updated May 26, 2025