-
retrieval-scaling Public
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
-
-
massive-serve Public
Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.
-
-
RAG-evaluation-harnesses Public
An evaluation suite for Retrieval-Augmented Generation (RAG).
-
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python Apache License 2.0 UpdatedApr 10, 2025 -
Search-R1 Public
Forked from PeterGriffinJin/Search-R1Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Python Apache License 2.0 UpdatedMar 14, 2025 -
-
-
LightSeq Public
Official repository for DistFlashAttn: Distributed Memory-efficient Attention for Long-context LLMs Training
-
bff Public
Forked from allenai/bffLarge-scale deduplication & decontamination using bloom filters
Rust Apache License 2.0 UpdatedMay 26, 2024 -
instruct-eval Public
Forked from declare-lab/instruct-evalThis repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Python Apache License 2.0 UpdatedFeb 28, 2024 -
open_lm Public
Forked from mlfoundations/open_lmA repository for research on medium sized language models.
Python MIT License UpdatedDec 20, 2023 -
Llava-doremi Public
Automatic task-balancing for vision-language instruction tuning using group distributionally robust optimization (Group DRO, the technique used in Doremi)
-
VL-Instruct Public
Codes for vision-language instruction tuning. Currently support BLIP2-t5 and BLIP2-vicuna.
-
FastCkpt Public
Python package for rematerialization-aware gradient checkpointing
-
UWrc.github.io Public
Forked from UWrc/UWrc.github.ioThe best research computing website in all the land.
JavaScript UpdatedOct 27, 2023 -
NeMo Public
Forked from barry-jin/NeMoNeMo: a toolkit for conversational AI
Python Apache License 2.0 UpdatedApr 18, 2023 -
denoised-smoothing Public
Forked from microsoft/denoised-smoothingProvably defending pretrained classifiers including the Azure, Google, AWS, and Clarifai APIs
Jupyter Notebook MIT License UpdatedNov 7, 2022 -
-
ALBEF Public
Forked from salesforce/ALBEFCode for ALBEF: a new vision-language pre-training method
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 29, 2022 -
AdversariallyRobustDistillation Public
Forked from goldblum/AdversariallyRobustDistillationPytorch implementation of Adversarially Robust Distillation (ARD)
-
RepGAN Public
Forked from NVlabs/stylegan2-ada-pytorchCode base from StyleGAN2-ADA
Python Other UpdatedMay 3, 2022 -
Code for the paper "On the Adversarial Robustness of Visual Transformers"
-