Skip to content
View roeeaharoni's full-sized avatar

Organizations

@BIU-NLP

Block or report roeeaharoni

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".

Python 82 11 Updated Dec 16, 2025

Multilingual extension of T0

Python 1 3 Updated Apr 20, 2022
Python 30 9 Updated Sep 5, 2021
Python 2,921 333 Updated Dec 10, 2025
Python 102 34 Updated Mar 4, 2024

Open-Domain Question Answering Goes Conversational via Question Rewriting

Python 164 19 Updated May 23, 2022

Google Research

Jupyter Notebook 36,933 8,273 Updated Dec 19, 2025

Minimalist NMT for educational purposes

Python 708 223 Updated Jan 29, 2024

Data and code for Knowledge-Based Machine Translation Evaluation (KoBE)

6 Updated Oct 14, 2020

Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)

Python 385 50 Updated Nov 7, 2023

Yet Another (natural language) Parser

Go 87 26 Updated Nov 8, 2022

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,335 2,128 Updated Oct 27, 2025

XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.

Python 649 110 Updated Jan 4, 2023

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,329 1,009 Updated Dec 16, 2025

NMT domain adaptation papers (updating...)

17 1 Updated Jun 1, 2019

State-of-the-Art Text Embeddings

Python 18,024 2,720 Updated Dec 18, 2025

A tool for holistic analysis of language generations systems

Python 471 58 Updated Sep 22, 2025

Domain Adaptation of Neural Machine Translation by Lexicon Induction

Shell 20 5 Updated Jan 3, 2020

A python package for sampling from determinantal point processes

Jupyter Notebook 36 8 Updated Oct 28, 2018

XLNet for generating language.

Python 166 20 Updated Jan 30, 2021
Python 34 12 Updated Nov 24, 2020

Machine Learning applied to Natural Language Processing Toolkit used in the Lisbon Machine Learning Summer School

Jupyter Notebook 226 224 Updated Jul 25, 2025

KenLM: Faster and Smaller Language Model Queries

C++ 2,706 531 Updated Mar 30, 2025

Scripts to compute moore-lewis using kenlm

Shell 7 1 Updated Jul 10, 2018

TensorFlow code and pre-trained models for BERT

Python 39,756 9,708 Updated Jul 23, 2024

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,795 2,079 Updated Jan 23, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 38,488 4,158 Updated Dec 19, 2025

Approximate Nearest Neighbor Search for Sparse Data in Python!

Python 920 145 Updated Oct 2, 2020

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,122 31,503 Updated Dec 22, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,522 1,316 Updated Dec 18, 2025
Next