Stars
A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Faster Whisper transcription with CTranslate2
The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".
Scalable training for dense retrieval models.
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
📋 A list of open LLMs available for commercial use.
NAACL 2021: Are NLP Models really able to Solve Simple Math Word Problems?
This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.
Shared repository for open-sourced projects from the Google AI Language team.
Examples and guides for using the OpenAI API
UnifiedQA: Crossing Format Boundaries With a Single QA System
Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering
Extrapolating knowledge graphs from unstructured text using GPT-3 🕵️♂️
Official Implementation of "Detecting Euphemisms with Literal Descriptions and Visual Imagery"
Enterprise Scale NLP with Hugging Face & SageMaker Workshop series
This is a ZSH plugin that enables you to use OpenAI's Codex AI in the command line.
Jupyter notebooks for the Natural Language Processing with Transformers book
Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations
Huggingface Transformers + Adapters = ❤️
Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)