Starred repositories
This is a repo with links to everything you'd ever want to learn about data engineering
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
Python-based tools for document analysis and OCR
This is a workshop designed for Amazon Bedrock a foundational model service.
Library extending Jupyter notebooks to integrate with Apache TinkerPop, openCypher, and RDF SPARQL.
LLM-based ontological extraction tools, including SPIRES
Graph Data Science: an abstraction layer in Python for building knowledge graphs, integrated with popular graph libraries – atop Pandas, NetworkX, RAPIDS, RDFlib, pySHACL, PyVis, morph-kgc, pslpyth…
Heritage Connector: Transforming text into data to extract meaning and make connections
A comprehensive guide to maplib.
Pass data between a google sheet and jupyter notebook
Jupyter Notebooks Relating to Open Context (https://opencontext.org)
Jupyter notebooks projects using BL's Collections data and Sources
This repository contains scripts for accessing, extracting and transforming epigraphic datasets from the Epigraphic Database Heidelberg (https://edh.ub.uni-heidelberg.de/) in a reproducible manner.
This course provides an introduction to working with geospatial data in Python.
Jupyter Notebooks Relating to working with APIs
Visualising the poetic cultures of 18th periodicals