Skip to content
View OrianeN's full-sized avatar

Block or report OrianeN

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A Simple Python Module for German Grapheme To Phoneme Conversion

Python 8 Updated Apr 26, 2024

[EMNLP 2025 Findings] TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation

Jupyter Notebook 6 Updated Oct 9, 2025

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

Python 808 159 Updated Mar 25, 2026

Library for fast text representation and classification. Fix compatibility with numpy 2

HTML 14 2 Updated Nov 21, 2024

Python module (C extension and plain python) implementing Aho-Corasick algorithm

C 1,097 142 Updated Dec 17, 2025

Hands-on exercises for the "NLP for Dialects" MSc seminar at LMU Munich

Jupyter Notebook 3 Updated Dec 4, 2025

Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13

Python 210 27 Updated Mar 12, 2026

Scripts and metadata for the paper "Corpus-based dialectometry with topic models"

Python 3 Updated Sep 27, 2024

Flexible, extensible and scalable web-based speech annotation tool

Python 14 3 Updated Apr 4, 2025

Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern string search

Python 35 6 Updated Jul 7, 2022

Journey towards Fine-Tuning a Breton speaking Chat Model

Python 4 Updated May 9, 2025

Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German

518 66 Updated Oct 30, 2024

[LT4HALA 2020] Phonetic lexicon generator and sound change applier

Python 4 2 Updated Mar 31, 2025

😈Awful AI is a curated list to track current scary usages of AI - hoping to raise awareness

1 Updated Jul 29, 2020
Python 1 Updated May 2, 2025
Python 5 2 Updated Nov 17, 2023

A collection of useful .gitignore templates

173,448 82,626 Updated Apr 17, 2026

A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.

Python 24 10 Updated Oct 27, 2023

Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https://arxiv.org/abs/2309.08351)

Python 29 6 Updated Apr 17, 2024

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)

Python 3,077 265 Updated Apr 18, 2026

Material for a course on Advanced NLP

HTML 16 3 Updated Jul 22, 2025
Jupyter Notebook 31 2 Updated Sep 23, 2024

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,975 3,610 Updated Jul 28, 2024

Extension for pie to include taggers with their models and pre/postprocessors

Python 11 3 Updated May 30, 2024

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Python 3,809 213 Updated Jul 9, 2024

A Neural Framework for MT Evaluation

Python 739 108 Updated Mar 27, 2026

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 6,795 223 Updated Apr 1, 2026

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.

Python 320 54 Updated Apr 8, 2026
Python 1 2 Updated Jun 12, 2025

Interactive Widgets for the Jupyter Notebook

TypeScript 3,308 965 Updated Nov 7, 2025
Next