Stars
spaCy module for linking text to Wikidata items
🪐 End-to-end NLP workflows from prototype to production
blubrom / MLCA
Forked from wissemriahi/boaviztapia tool to perform simplified LCA on ML processes based on BOAVIZTAPI
Track and predict the energy consumption and carbon footprint of training deep learning models.
A framework for few-shot evaluation of language models.
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents questions.
🐢 Open-Source Evaluation & Testing library for LLM Agents
Public repository for the EMNLP 2023 Findings paper: "Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters
We analyze and mitigate gender bias in MT tokenizers.
Doctor Dignity is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
Meditron is a suite of open-source medical Large Language Models (LLMs).
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Official style files for papers submitted to venues of the Association for Computational Linguistics
A high-throughput and memory-efficient inference and serving engine for LLMs
Repository for the *SEM 2023 paper „Mann“ is to “Donna” as「国王」is to « Reine »: Adapting the Analogy Task for Multilingual and Contextual Embeddings
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
[COLING22] An End-to-End Library for Evaluating Natural Language Generation
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation