Stars
Contains phonetically misspelled versions of tasks from the SuperGLUE dataset. The phonetic misspellings in English are influenced by pronunciations in native languages Hindi, Bengali and Tamil.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Unofficial PyTorch implementation of the paper - Dynamic Meta-Embeddings for Improved Sentence Representations, EMNLP 2018
An attempt to make Google BERT closer to production before Hugging Face Transformers etc.
Open source annotation tool for machine learning practitioners.
A Survey and Experiments on Annotated Corpora for Emotion Classification in Text
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW