Stars
TensorFlow code and pre-trained models for BERT
A community-maintained Python framework for creating mathematical animations.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
A MNIST-like fashion product database. Benchmark 👇
Source code for Twitter's Recommendation Algorithm
Natural Language Processing Best Practices & Examples
Text classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.
Extremely simple and fast word2vec implementation with Negative Sampling + Sub-sampling
A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.
BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s layers? A closer look at the NLP pipeline in monolingual and…
Source code for the paper "Morphological Inflection Generation with Hard Monotonic Attention"
Classification of German nouns by genders using different RNN, CNN, and MLP architectures.
Morphological Inflection for Low-Resource Languages using cross-lingual transfer
A repository for the 2022 Inflection Shared Task
Implements modified version of the Pegasos (Primal Estimated Sub-Gradient Solver for SVM) algorithm as well as Perceptron and Average Perceptron for comparison