Lists (17)
Sort Name ascending (A-Z)
Stars
Text-to-image samples collected for the evaluation of DALL-E 3 in the whitepaper.
PyTorch package for the discrete VAE used for DALL·E.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
A computer vision system for automated analysis of index cards from a collection of coin forgeries using Qwen2.5-VL vision-language model. Developed for the imagines nummorum project.
Introductory workshop on Python and NLP for humanities
Handbuch zur Erstellung diskriminierungsfreier Metadaten für historische Quellen und Forschungsdaten: Erfahrungen aus dem historischen Forschungsprojekt Stadt.Geschichte.Basel.
A Python package to inspect formal quality problems in research data.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Ein QA-basiertes RAG-Tool zur Erkundung von Archivalien
Begleitmaterial für den Kurs Machine Learning mit Python
For collaborative work on extracting information from BZK index cards
flatten your xml for OpenRefine
Using machine learning approaches for automatic topic detection in a multilingual environment
Tesseract Open Source OCR Engine (main repository)
A tool to convert EAC-CPF and EAD 2002 XML files to RDF datasets conforming to Records in Contexts Ontology (RiC-O)
Repository for the paper "Literary Metaphor Detection with LLM Fine-Tuning and Few-Shot Learning".
knaw-huc / loghi
Forked from rvankoert/loghiLoghi is a comprehensive toolkit designed for Handwritten Text Recognition (HTR) and Optical Character Recognition (OCR), offering an accessible approach to transcribing historical documents and tr…
A deep learning toolkit specialized for handwritten document analysis
Python tool for converting files and office documents to Markdown.
Utilities intended for use with Llama models.