Skip to content
View pdufter's full-sized avatar

Highlights

  • Pro

Organizations

@cisnlp

Block or report pdufter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Robust recipes to align language models with human and AI preferences

Python 5,544 478 Updated Sep 8, 2025

An Extensible Deep Learning Library

Python 2,336 402 Updated Feb 18, 2026

A Python + iCloud wrapper to access iPhone and Calendar data.

Python 66 23 Updated Jan 28, 2023
Python 13 Updated Apr 16, 2021

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting has…

Python 675 95 Updated Jun 2, 2025

Jina examples and demos to help you get started

Python 463 140 Updated Nov 1, 2021

☁️ Build multimodal AI applications with cloud-native stack

Python 21,855 2,238 Updated Mar 24, 2025

An open-registry for hosting Jina executors via container images

Python 108 47 Updated Aug 31, 2021

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Python 1,305 221 Updated Mar 23, 2022

This repo supports various cross-lingual transfer learning & multilingual NLP models.

Python 93 6 Updated Sep 13, 2023

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Python 1,178 131 Updated Aug 28, 2024

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 3,001 166 Updated Jul 9, 2025
Python 25 6 Updated Jan 22, 2024

A word2vec negative sampling implementation with correct CBOW update.

C++ 261 18 Updated Nov 8, 2021

Acceptance rates for the major AI conferences

Jupyter Notebook 4,740 315 Updated Sep 23, 2025

LibKGE - A knowledge graph embedding library for reproducible research

Python 828 132 Updated Apr 8, 2024

Getting interpretable dimensions in word embedding spaces.

Python 15 2 Updated Jul 6, 2023

Analyzing mBERT's multilinguality in a small laboratory setting

Python 13 2 Updated Jun 12, 2023

A list of selected resources, methods, and tools dedicated to Legal Text Analytics.

709 139 Updated Nov 5, 2024

Helper to create posts for Bayern Ticket Mitfahrer groups in Facebook.

HTML 2 Updated Apr 11, 2019

DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models

159 12 Updated Dec 6, 2022

Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper

Python 412 44 Updated Jun 23, 2024

Unsupervised text tokenizer focused on computational efficiency

C++ 977 109 Updated Mar 29, 2024

BERT-related papers

2,039 279 Updated Aug 12, 2023

Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)

Python 393 52 Updated Nov 7, 2023

Papers & presentation materials from Hugging Face's internal science day

2,055 118 Updated Oct 31, 2020

Python framework for creating, editing, and running Noisy Intermediate-Scale Quantum (NISQ) circuits.

Python 4,904 1,202 Updated Mar 27, 2026

Language-Agnostic SEntence Representations

Jupyter Notebook 3,662 461 Updated May 2, 2024

A framework to learn cross-lingual word embedding mappings

Python 654 136 Updated Apr 22, 2023

👓 A web interface of gpustat: monitor GPU clusters at a look

Python 361 43 Updated Feb 17, 2026
Next