Stars
Automatic extraction of relevant features from time series:
Python package that implements temporal disaggregation models to convert low-frequency to high-frequency time series (pip install timedisagg).
🌐 The easiest way to parse and modify URLs in Python.
A toolbelt of useful classes and functions to be used with python-requests
Python package for collecting ACS and geospatial data from the Census API
A machine-readable mapping of the American Housing Survey to the American Community Survey
🔷 Get Census Data from the API for arbitrary areas
Materials for a NICAR 2020 workshop on advanced Census data with Python
List of Python API Wrappers and Libraries
Style guides for Google-originated open-source projects
Passport Index 2025: visa requirements for 199 countries, in .csv
A python library detect and extract listing data from HTML page.
A simple python library that allows for easy access of the SEC website so that someone can parse filings, collect data, and query documents.
Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced PDFs.
Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
A markdown version emoji cheat sheet
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
A collection of Python packages for geospatial analysis with binder-ready notebook examples
Python interface to the Stanford Named Entity Recognizer
Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4