Skip to content
View pudo's full-sized avatar

Sponsors

@medecau
@mysk
@adamdecaf
Private Sponsor

Organizations

@bundestag @pdfminer @opensanctions

Block or report pudo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

28 stars written in HTML
Clear filter

Convert PDF to HTML without losing text or format.

HTML 10,588 1,849 Updated Jun 2, 2023

extract text from any document. no muss. no fuss.

HTML 4,540 671 Updated Apr 28, 2026

Marble - the real time decision engine for fraud and AML

HTML 513 79 Updated Apr 15, 2026

Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py

HTML 394 111 Updated May 22, 2023

🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based

HTML 332 38 Updated Oct 13, 2023

A lightweight, backend-free open data portal, powered by Jekyll

HTML 227 314 Updated Apr 2, 2026

The project aims to build and use open source tools and datasets to gather and analyse the financial transactions of governments around the world.

HTML 171 56 Updated Jul 30, 2020

The old website for Code for Germany, 2013 edition. Includes the blog, projects list and basic info about the group.

HTML 153 139 Updated Apr 20, 2021

A repository of journalist's lookup tables.

HTML 107 15 Updated Apr 26, 2017

International legislative data specifications

HTML 106 19 Updated Feb 13, 2023

Source for the Where Does My Money Go? app.

HTML 67 44 Updated Sep 19, 2018

Repository für den bequemen Zugang zu einigen Open-Data-Angeboten aus Köln

HTML 42 18 Updated Nov 22, 2017

Map of power relations in Spain. TheyRule meets Pinterest.

HTML 40 13 Updated Oct 4, 2022

Specifications of the reconciliation API

HTML 39 12 Updated Nov 10, 2025

Exploring power and influence in the European Union by combining information from a variety of official EU data sources related to lobbying, expert groups, expenditure and procurement.

HTML 37 4 Updated Feb 23, 2016

A modernized, improved version of the OFAC SDN website.

HTML 24 15 Updated Dec 7, 2022

OpenSpending Community Site

HTML 16 14 Updated Apr 14, 2023
HTML 14 6 Updated Feb 26, 2022

Website for the public consultation on the review of the EU copyright rules

HTML 9 21 Updated May 12, 2020

Collects newspaper articles for analysis

HTML 8 1 Updated Oct 23, 2017

Know-your-business datasets (corporate registries converted to FollowTheMoney data format)

HTML 7 2 Updated Oct 1, 2023

Website for "Cameroon Budget Inquirer"

HTML 7 9 Updated May 25, 2018

JSON Schema for OCCRP data

HTML 5 Updated Dec 16, 2015
HTML 4 Updated Jun 26, 2025

Nkonson Konson ("chain link") is an Adinkra Symbols; a symbol of unity and human relations meaning that we are all linked. This website has been designed to provide users with a tool to investigate…

HTML 2 1 Updated Feb 27, 2018

Ein neues blahg, ein besseres blahg, oh Freunde will ich generieren!

HTML 2 4 Updated Aug 1, 2024