-
Investigative Economics
- Washington, DC
- https://www.investigativeeconomics.org
Stars
Extract text and tables from scanned IRS form 990s
R data analysis of Government of Canada proactive disclosure of contracts data
Feature engineering tool to efficiently create effective, arbitrarily complex arithmetic combinations of numeric features
US H-1B Visa Lottery and Petition Data FY 2021 - FY 2024
Pandas-like data tool for analyzing data iteratively to avoid memory issues
Spreadsheets, obtained via FOIA, quantifying thefts/losses of controlled substances (and regulated chemicals) reported to the DEA.
Probabilistic Hierarchical forecasting 👑 with statistical and econometric methods.
Data from decades of PHMSA's "5800.1" hazardous material transportation incident reports
A nicer way to view SEC 13F filings data
Source codes and experimental results of our scientific integrity verification system.
Recod.ai Scientific Image Integrity Library
Python script to retrieve and monitor the United States Department of Labor TopHat Plan Search API for new filings.
Modin: Scale your Pandas workflows by changing a single line of code
Django app for building dashboards using raw SQL queries
IRSx: Turn the IRS' versioned XML 990 nonprofit annual tax returns into standardized python objects, json, or human readable text with original line number and description.
Data and analysis of intermediate care facilities, supporting a BuzzFeed News investigation.
Bayesian Additive Regression Trees For Python
Visual analysis and diagnostic tools to facilitate machine learning model selection.
Simplifies use of the Dedupe library via Pandas
Examples of programs that interact with the OpenFIGI services via their APIs.
Extended Isolation Forest for Anomaly Detection
A simple library for querying U.S. zipcodes.
Grid studio is a web-based application for data science with full integration of open source data science frameworks and languages.
Label line using matplotlib.
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
A quickly-hacked-together Python script to turn mysqldump files to CSV files. Optimized for Wikipedia database dumps.