Stars
The fastest path to AI-powered full stack observability, even for lean teams.
A community-maintained Python framework for creating mathematical animations.
Unsupervised text tokenizer for Neural Network-based text generation.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
🔡 List of Tools, Libraries, Models, Datasets and other resources for Turkish Natural Language Processing..
A collection of scientific methods, processes, algorithms, and systems to build stories & models.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
A beginner's guide to learning, implementing and using PDDL.
TensorFlow code and pre-trained models for BERT
Source code for Twitter's Recommendation Algorithm
Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL…
Use AI to translate code from one language to another.
Text classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.
BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s layers? A closer look at the NLP pipeline in monolingual and…
An IPython Notebook tutorial on deep learning for natural language processing, including structure prediction.
Natural Language Processing Best Practices & Examples
A repository for the 2022 Inflection Shared Task
A MNIST-like fashion product database. Benchmark 👇
Examples and libraries for "Natural Language Processing in Action" book
Extremely simple and fast word2vec implementation with Negative Sampling + Sub-sampling
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, …
State of the Art Language models and Classifier for Kannada, which is spoken predominantly by Kannada people in India, mainly in the state of Karnataka
VIP cheatsheets for Stanford's CS 230 Deep Learning