Skip to content
View amittai's full-sized avatar

Block or report amittai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cynical data selection

Perl 20 7 Updated Jan 16, 2021

These are lists for a variety of languages containing words that are distinctive to each language.

42 5 Updated Apr 5, 2022

is scot injured?

HTML 2 Updated Aug 15, 2023

End-to-End Speech Processing Toolkit

Python 9,819 2,399 Updated Apr 27, 2026

NAACL website

HTML 4 8 Updated Apr 22, 2026

Reference implementations of MLPerf® training benchmarks

Python 1,755 587 Updated Apr 16, 2026

Reference implementations of MLPerf® inference benchmarks

Python 1,559 621 Updated Apr 27, 2026

Quick & dirty hack to read AMD Ryzen rapl counters

C 74 12 Updated Sep 4, 2018

Democratizing NLP!

Jupyter Notebook 106 29 Updated Dec 6, 2023

Python port of Moses tokenizer, truecaser and normalizer

Python 495 59 Updated Feb 6, 2026

[Discontinued] Auryo - Unofficial Soundcloud Desktop App

TypeScript 625 44 Updated Dec 10, 2022

Fast Neural Machine Translation in C++

C++ 1,443 247 Updated Aug 25, 2023
Python 121 40 Updated Mar 15, 2017

C++/CUDA toolkit for training sequence and sequence-to-sequence models across multiple GPUs

C++ 185 64 Updated May 15, 2017

Examples and scripts using Blocks

Python 147 91 Updated Aug 22, 2016

A Multilingual and Multilevel Representation Learning Toolkit for NLP

C++ 117 31 Updated Feb 14, 2018

SALM: Suffix Array and its Applications in Empirical Language Processing by Joy

C++ 11 5 Updated Dec 22, 2017

Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better Hypothesis Testing for Statistical Machine Translation: Cont…

Groff 205 39 Updated Feb 25, 2023

GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package also contains the source for the mkcls tool which generates th…

C++ 273 82 Updated Nov 18, 2025

A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.

C++ 166 61 Updated May 12, 2021

Simple, fast unsupervised word aligner

C++ 770 163 Updated Jul 19, 2022

Moses, the machine translation system

Roff 1,625 775 Updated Mar 28, 2025

A workflow management system for researchers who heart Unix.

Scala 128 14 Updated Sep 23, 2015
C++ 1 Updated Nov 3, 2014

Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms

C++ 185 77 Updated May 26, 2020