-
-
sfguide-getting-started-with-snowflake-intelligence Public
Forked from Snowflake-Labs/sfguide-getting-started-with-snowflake-intelligencePLpgSQL Apache License 2.0 UpdatedSep 30, 2025 -
getting-started-with-dbt-on-snowflake Public
Forked from Snowflake-Labs/getting-started-with-dbt-on-snowflakeProvides a getting started dbt project for dbt on Snowflake
Python Apache License 2.0 UpdatedAug 29, 2025 -
MSU-CSAM-Accelerator-Talk Public
Slides for the talk I gave on August 11, 2025 at Montclair State University's College of Science and Mathematics Summer Accelerator Program.
UpdatedAug 11, 2025 -
advanced-data-engineering-snowflake Public
Forked from Snowflake-Labs/advanced-data-engineering-snowflakeCompanion repository that goes along with Snowflake's "Advanced Data Engineering with Snowflake" course
PLpgSQL UpdatedJun 14, 2025 -
-
foiarchive-search Public
Forked from history-lab/foiarchive-searchStreamlit for FOIArchive search GUI
-
-
GitHubActionsTutorial-USRSE24 Public
Forked from uwescience/GitHubActionsTutorial-USRSE24Content for US-RSE'24 Tutorial "GitHub Actions for Scientific Data Workflows"
Jupyter Notebook UpdatedOct 7, 2024 -
subore Public
Subject and body search via regular expression across FOIArchive corpora.
Python UpdatedSep 4, 2024 -
-
covid19-gui Public
GUI for History Lab's COVID-19 collection
Python MIT License UpdatedOct 10, 2023 -
eabcc-presentation Public
Slides and materials for the talk "Creating Email Archives from PDFs – The COVID-19 Corpus" delivered at the EABCC Email Archiving Symposium in June '23
Creative Commons Zero v1.0 Universal UpdatedJun 14, 2023 -
muckrock-client Public
Simple Python client for MuckRock API
Python MIT License UpdatedMay 30, 2023 -
-
-
test-eval Public
Schema and code for the History Lab test evaluation framework
MIT License UpdatedMar 16, 2023 -
-
pdb-gui Public
A prototype query interface to the FOIArchive's PDB corpus.
Python MIT License UpdatedJan 26, 2023 -
piir-poc-dp Public archive
PII proof of concept with DataProfiler
-
ddmd Public
Generates a Markdown table description based on SQL data dictionary information
MIT License UpdatedSep 30, 2022 -
-
optimal-data-loads Public
Materials for PGCONF NYC 2022 presentation: Tips and techniques to optimize the 'L' component of your PostgreSQL ETL and ELT processes and make them easy to maintain. Includes real-world examples w…
Python UpdatedSep 21, 2022 -
tqs Public
Forked from martinamaximovich/improvingOCRCode for generating a text quality score for a text file, intended to measure OCRed text quality. It's a fork of martinamaximovich/improvingOCR.
Python GNU General Public License v3.0 UpdatedSep 19, 2022 -
csv2pg Public
Utility for loading a CSV file into PostgreSQL.
Python MIT License UpdatedJul 13, 2022 -
-
pg-pandas-profiling Public
A Python package that executes pandas profiling on the results of a SQL query run against a PostgreSQL database.
Python MIT License UpdatedJun 2, 2022 -
foiarchive-search-prototype Public
Ideas for a new FOIArchive search interface
Python MIT License UpdatedMay 16, 2022 -
article-hugo-website Public
Forked from second-state/hugo-websiteA no-code, no-software and no-cost solution to publishing sophisticated web sites managed by non-technical people.
HTML UpdatedApr 29, 2022 -