Skip to content
View sergiocorreia's full-sized avatar
🐢
🐢

Block or report sergiocorreia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
107 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,985 31,470 Updated Dec 17, 2025

The Python programming language

Python 70,355 33,696 Updated Dec 17, 2025

The uncompromising Python code formatter

Python 41,226 2,690 Updated Dec 12, 2025

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 32,030 2,232 Updated Dec 15, 2025

🔒 Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories.

Python 29,469 2,372 Updated Dec 16, 2025

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…

Python 28,761 8,046 Updated Mar 20, 2024

Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.13. You feed it your Python app, it does a lot of clever things, and spits out an executable or exte…

Python 14,238 748 Updated Dec 17, 2025

Pythonic HTML Parsing for Humans™

Python 13,870 997 Updated Apr 16, 2024

Python job scheduling for humans.

Python 12,198 983 Updated May 25, 2024

Statsmodels: statistical modeling and econometrics in Python

Python 11,149 3,306 Updated Dec 17, 2025

An open access book on scientific visualization using python and matplotlib

Python 11,131 1,016 Updated Jan 22, 2024

An open source multi-tool for exploring and publishing data

Python 10,599 797 Updated Dec 17, 2025

Practical Python Programming (course by @dabeaz)

Python 10,503 7,067 Updated Aug 10, 2024

Deep learning library featuring a higher-level API for TensorFlow.

Python 9,616 2,391 Updated May 6, 2024

Python Data. Leaflet.js Maps.

Python 7,291 2,256 Updated Dec 17, 2025

SQL for Humans™

Python 7,223 572 Updated Jul 9, 2024

Community maintained fork of pdfminer - we fathom PDF

Python 6,830 1,014 Updated Dec 14, 2025

Git for Humans, Inspired by GitHub for Mac™.

Python 5,699 215 Updated Oct 9, 2023

A Unified Toolkit for Deep Learning Based Document Image Analysis

Python 5,618 516 Updated Aug 15, 2024

Python PDF Parser (Not actively maintained). Check out pdfminer.six.

Python 5,301 1,123 Updated Dec 7, 2022

💡 Full-featured code intelligence and smart autocomplete for Sublime Text

Python 5,056 525 Updated Aug 28, 2023

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 5,049 334 Updated Sep 12, 2025

CommonMark spec, with reference implementations in C and JavaScript

Python 5,035 343 Updated Sep 16, 2025

Good Curio!

Python 4,128 248 Updated Oct 4, 2024

Snips Python library to extract meaning from text

Python 3,952 512 Updated May 22, 2023

Camelot: PDF Table Extraction for Humans

Python 3,713 361 Updated Jan 5, 2023

Best practice and tips & tricks to write scientific papers in LaTeX, with figures generated in Python or Matlab.

Python 3,703 259 Updated May 17, 2023

(OLD REPO) Line-by-line profiling for Python - Current repo ->

Python 3,609 257 Updated Oct 26, 2021

A Python library to extract tabular data from PDFs

Python 3,548 523 Updated Nov 12, 2025

Was an interactive continuous Python profiler.

Python 2,947 112 Updated Aug 24, 2020
Next