09 Jan 26

A lot of really useful looking tools here, mostly stuff for converting from one format to another.

by cobbland 1 month ago

24 Oct 25

Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing. The pdf-to-markdown GitHub repository hosts a tool designed to convert PDF files into Markdown format for easier text extraction and reformatting, with the process running locally on the user’s machine.

by tmfnk 3 months ago

31 Jan 25

MarkItDown is a utility for converting various files to Markdown (e.g., for indexing, text analysis, etc). It supports:

by pyrho 1 year ago saved 2 times