Skip to content
View arne-cl's full-sized avatar
  • Potsdam

Organizations

@pelias @trost @discourse-lab @NLPbox

Block or report arne-cl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
36 stars written in HTML
Clear filter

Create graphics with a hand-drawn, sketchy, appearance

HTML 20,914 645 Updated Jul 28, 2024

Convert PDF to HTML without losing text or format.

HTML 10,593 1,853 Updated Jun 2, 2023

A vintage 1980s DOS inspired Twitter Bootstrap theme

HTML 6,850 315 Updated Nov 12, 2025

Compilation of public failure/horror stories related to Kubernetes

HTML 6,212 309 Updated Aug 23, 2020

A rendition of everyone's favorite 1995 Microsoft operating system for Linux.

HTML 5,697 187 Updated May 27, 2025

extract text from any document. no muss. no fuss.

HTML 4,495 667 Updated Apr 3, 2026

The official online compendium for Mining the Social Web, 2nd Edition (O'Reilly, 2013)

HTML 2,890 1,459 Updated Jul 11, 2022

A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.

HTML 2,803 751 Updated Jul 3, 2021

Statistical Data Analysis in Python

HTML 1,721 964 Updated Sep 2, 2015

Convert LaTeX documents into beautiful responsive web pages using LaTeXML.

HTML 1,102 90 Updated Jan 3, 2024

Create, edit and display a journal article, entirely in GitHub

HTML 620 82 Updated Dec 9, 2022

Given a domain, will tell you the decisions that the domain owner has made.

HTML 539 75 Updated Sep 12, 2018

Universal Dependencies online documentation

HTML 291 271 Updated Apr 3, 2026

Genealogy of Elizas

HTML 290 39 Updated Mar 5, 2026

Examine two sentences and determine whether they have the same meaning.

HTML 223 82 Updated Feb 5, 2019

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

HTML 149 48 Updated Feb 16, 2020

An ongoing fun challenge where I'll try to post one Python benchmark per day.

HTML 134 29 Updated Mar 23, 2015

CollateX – Software for Collating Textual Sources

HTML 98 43 Updated Jan 22, 2026

Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.

HTML 85 16 Updated Mar 1, 2016

Temporal Expression Recognition and Normalisation in Python

HTML 77 17 Updated Jan 27, 2016

Python package for stylometry

HTML 64 13 Updated Mar 30, 2021

Back to the Future Java (b2fJ) aims at bringing the power of Java to 8-bit home computers of the '80s. This project provides a toolchain to cross-compile Java programs under Windows.

HTML 51 2 Updated Jun 20, 2021

Amsterdam Content Analysis Toolkit

HTML 46 15 Updated Jul 6, 2022

Python static blog generator

HTML 42 5 Updated Nov 1, 2016

Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.

HTML 35 9 Updated May 25, 2023
HTML 33 7 Updated Nov 12, 2015

Parsing Time: Learning to Interpret Time Expressions

HTML 31 3 Updated Apr 14, 2023

port of nevan scott's "mockingbird" to pelican

HTML 28 14 Updated Apr 9, 2020
HTML 24 2 Updated Jul 6, 2015
Next