Skip to content
View DylanJoo's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report DylanJoo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,412 154 Updated Aug 12, 2025
Python 12 Updated Jul 30, 2025
Python 1,669 101 Updated Sep 30, 2025

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,265 699 Updated Nov 19, 2025

Generative Representational Instruction Tuning

Jupyter Notebook 680 50 Updated Jun 25, 2025

SGPT: GPT Sentence Embeddings for Semantic Search

Jupyter Notebook 872 51 Updated Feb 17, 2024
Python 361 36 Updated Aug 7, 2025

AllenAI's post-training codebase

Python 3,473 478 Updated Dec 25, 2025

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,434 83 Updated Dec 22, 2025
Python 6 2 Updated Jul 24, 2025

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 220 17 Updated Jun 13, 2025

[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers

Python 136 12 Updated Mar 14, 2024

OpenResearcher, an advanced Scientific Research Assistant

HTML 474 38 Updated Oct 10, 2024

One-stop shop for running and fine-tuning transformer-based language models for retrieval

Python 62 17 Updated Dec 20, 2025

State-of-the-Art Text Embeddings

Python 18,041 2,720 Updated Dec 22, 2025

Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.

Jupyter Notebook 22 2 Updated Nov 28, 2024

TrustRAG:The RAG Framework within Reliable input,Trusted output

Python 1,206 124 Updated Dec 12, 2025

Unified Learned Sparse Retrieval Framework

Python 68 7 Updated May 13, 2024

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 561 81 Updated Dec 16, 2025

[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search

Python 102 8 Updated Dec 2, 2024

A Workbench for Autograding Retrieve/Generate Systems

Python 15 3 Updated Jun 30, 2025

Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)

Python 404 36 Updated Apr 3, 2025

final code for fultr

Python 10 3 Updated May 7, 2021

A modular RL library to fine-tune language models to human preferences

Python 2,376 203 Updated Mar 1, 2024

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Python 167 10 Updated Jun 13, 2024

Ollama Python library

Python 9,058 878 Updated Dec 11, 2025

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,547 130 Updated Feb 6, 2025

Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Findings of ACL 2024]

Python 69 3 Updated May 28, 2024

RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.

Python 143 8 Updated May 13, 2025

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 2,278 216 Updated May 25, 2024
Next