-
FlexOlmo Public
Forked from allenai/FlexOlmoCode and training scripts for FlexOlmo
Python Apache License 2.0 UpdatedSep 23, 2025 -
transformers Public
Forked from 2015aroras/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
-
OLMo-core Public
Forked from allenai/OLMo-corePyTorch building blocks for the OLMo ecosystem
Python Apache License 2.0 UpdatedJun 1, 2025 -
-
-
mteb Public
Forked from embeddings-benchmark/mtebMTEB: Massive Text Embedding Benchmark
Python Apache License 2.0 UpdatedJul 15, 2024 -
-
-
-
-
-
detect-pretrain-code Public
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins…
-
silo-lm Public
Forked from kernelmachine/silo-lmSILO Language Models code repository
-
-
-
-
GenRead Public
Forked from wyu97/GenReadCode and Checkpoints for "Generate rather than Retrieve: Large Language Models are Strong Context Generators" in ICLR 2023.
Python UpdatedJan 29, 2023 -
-
-
DESCGEN Public
This repo contains data and code for the paper: DESCGEN: A Distantly Supervised Dataset for Generating Abstractive Entity Descriptions
7 UpdatedSep 16, 2021 -
surface-form-competition Public
Forked from peterwestuw/surface-form-competitionPython UpdatedSep 8, 2021 -
-
-
-
-
bilm-tf Public
Forked from allenai/bilm-tfTensorflow implementation of contextualized word representations from bi-directional language models
Python Apache License 2.0 UpdatedMar 21, 2019 -
paraphrase_identification Public
Forked from wasiahmad/paraphrase_identificationExamine two sentences and determine whether they have the same meaning.
Rich Text Format MIT License UpdatedFeb 5, 2019 -
-
complex Public
Forked from ttrouill/complexSource code for experiments in the papers "Complex Embeddings for Simple Link Prediction" (ICML 2016) and "Knowledge Graph Completion via Complex Tensor Factorization" (JMLR 2017).
Python Other UpdatedAug 16, 2018