Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.
-
Updated
Nov 10, 2025 - HTML
Python is a dynamically-typed garbage-collected programming language developed by Guido van Rossum in the late 80s to replace ABC. Much like the programming language Ruby, Python was designed to be easily read by programmers. Because of its large following and many libraries, Python can be implemented and used to do anything from webpages to scientific research.
Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.
Flask app that compares an ID photo and a selfie using face_recognition, logs results in SQLite, and displays insights on a clean dashboard.
Automate job application tracking with Gmail API, OpenAI, and GitHub Actions. Generates visualizations and updates hourly.
Portfolio repo demonstrating hands on experience in ML, NLP, Deep Learning & GenAI building clean, modular projects with real-world problems solutions: text classification, RAG/Agentic systems, and PEFT. Developed impactful AI tools powered by AWS, Streamlit, Slack, & vector DBs.
Materials for the Deploy and Monitor ML Pipelines with Python, Docker and GitHub Actions workshop at the PyData NYC 2024 conference
Get daily and high-speed MTProto proxy for Telegram.
Sandbox of automated workflows using GitHub Actions with Python and R code
Let's try to build a simple .NET news site aggregator using Python, Hugo and Github Actions. The result can be visit on www.dotnetramblings.com
Python and shell scripts for automatically counting the total word count of a Hexo blog.
Build a web scraper for the daily Philadelphia Prison Census and make that data beautiful and useful for citizens and academic research.
My folder for practice in different languages
My Personal Website
Large scale simulations made simple.
Open-source developer platform to power your entire infra and turn scripts into webhooks, workflows and UIs. Fastest workflow engine (13x vs Airflow). Open-source alternative to Retool and Temporal.
MSDEVBUILD is a blog dedicated to sharing knowledge on Microsoft technologies, software development, cloud computing, mobile applications, AI, and more.The blog features in-depth technical articles, tutorials, and best practices for developers.
Created by Guido van Rossum
Released February 20, 1991