Skip to content
View dhofu's full-sized avatar

Block or report dhofu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
39 stars written in Python
Clear filter

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 100,814 28,028 Updated Jun 16, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,050 18,121 Updated Jun 16, 2026

Convert PDF to markdown + JSON quickly with high accuracy

Python 36,138 2,493 Updated Jun 6, 2026

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 20,829 1,489 Updated Jun 13, 2026

Toolkit for linearizing PDFs for LLM datasets/training

Python 17,396 1,399 Updated Mar 25, 2026

Topic Modelling for Humans

Python 16,441 4,407 Updated Nov 1, 2025

Image annotation with Python. Supports polygon, rectangle, circle, line, point, and AI-assisted annotation.

Python 15,966 3,676 Updated Jun 16, 2026

Jupyter metapackage for installation and documentation

Python 15,321 4,514 Updated Jun 7, 2026

NLTK Source

Python 14,649 3,010 Updated Jun 11, 2026

Ollama Python library

Python 10,173 1,076 Updated Apr 30, 2026

Open-source, low-code AutoML platform for Python. PyCaret 4.0: sklearn-native engine + React control plane.

Python 9,811 1,857 Updated Jun 16, 2026

The SQL IDE for Your Terminal.

Python 6,163 157 Updated Jun 16, 2026

All-in-One Development Tool based on PaddlePaddle

Python 6,162 1,199 Updated Jun 12, 2026

A Unified Toolkit for Deep Learning Based Document Image Analysis

Python 5,739 535 Updated Aug 15, 2024

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Python 904 107 Updated Apr 16, 2026

🐦 Quickly annotate data from the comfort of your Jupyter notebook

Python 786 128 Updated Apr 4, 2024

Code for the Molmo2 Vision-Language Model

Python 652 41 Updated Mar 18, 2026

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw

Python 614 102 Updated Dec 6, 2024

Convert between CBOR, JSON, MessagePack, TOML, and YAML 1.1 & 1.2

Python 549 35 Updated Jun 15, 2026

Linked Open Data Modeling Language

Python 538 175 Updated Jun 15, 2026

PPOCRLabelv3 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data.

Python 425 105 Updated Apr 24, 2026

Python library for computer vision labeling tasks. The core functionality is to translate bounding box annotations between different formats-for example, from coco to yolo.

Python 341 63 Updated Aug 14, 2024

🐦 Quickly annotate data from the comfort of your Jupyter notebook

Python 281 43 Updated Jun 9, 2023

Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.

Python 264 46 Updated Jun 16, 2026

DaNLP is a repository for Natural Language Processing resources for the Danish Language.

Python 209 34 Updated Feb 12, 2025

Optical table recognition - recognize tables in scan images using OpenCV

Python 112 40 Updated Jul 26, 2019

DaCy: The State of the Art Danish NLP pipeline using SpaCy

Python 103 22 Updated Jun 11, 2026

Python RIS files parser, provides RIS files as dictionary via generator.

Python 82 20 Updated May 23, 2025

Attention-based sequence-to-sequence model for handwritten word recognition

Python 65 13 Updated Sep 22, 2024

A python-built web crawler to automate file downloads off of https://www.moodle.tum.de/

Python 33 12 Updated Nov 11, 2023
Next