Skip to content
View PosoSAgapo's full-sized avatar
  • 08:00 (UTC +09:00)

Highlights

  • Pro

Organizations

@mynlp @llm-jp

Block or report PosoSAgapo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code to verify citations in a bibtex file

Python 13 1 Updated Mar 14, 2026

This is the repository for llm jp membership inference attack.

Python 5 2 Updated Jan 18, 2025

A library for mechanistic interpretability of GPT-style language models

Python 3,306 551 Updated Apr 13, 2026

Interpretability for sequence generation models 🐛 🔍

Python 463 39 Updated Mar 6, 2026

Materials for EACL2024 tutorial: Transformer-specific Interpretability

Jupyter Notebook 64 3 Updated Mar 26, 2024

[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs

Python 55 10 Updated May 26, 2025

Official Repository for Dataset Inference for LLMs

Jupyter Notebook 41 7 Updated Jul 25, 2024

Python implementation of an extension of the Kolmogorov-Smirnov test for multivariate samples

Python 13 Updated Aug 6, 2023

Python package for measuring memorization in LLMs.

Jupyter Notebook 187 31 Updated Jul 16, 2025

An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).

Python 61 7 Updated Aug 13, 2024
Python 62 16 Updated Jun 13, 2024

The official repository for the paper entitled "Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models."

Python 7 2 Updated Aug 10, 2025
Python 2 Updated Oct 25, 2024

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,769 210 Updated Nov 15, 2025

JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation

Macaulay2 115 24 Updated Jun 11, 2023

虚拟桌宠模拟器的饭制Mac版(功能不全!)

Swift 72 8 Updated Jun 9, 2025

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 990 95 Updated May 3, 2024

Noise-robust de-duplication at scale

Python 19 2 Updated Apr 9, 2023

All-in-one text de-duplication

Python 750 76 Updated Mar 9, 2026

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python 12,279 805 Updated Apr 13, 2026

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,591 202 Updated May 7, 2025

A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html

HTML 905 109 Updated Apr 1, 2026
Python 8 Updated Dec 27, 2022

This repository contains the files for Syntactic and Semantic Uniformity for Semantic Parsing and Task-Oriented Dialogue Systems

Python 2 Updated May 10, 2023

Codebase for SIGIR 2022 paper: Coarse-to-Fine Sparse Sequential Recommendation

Python 23 3 Updated Nov 21, 2022
Python 10 5 Updated May 24, 2022

Crawl BookCorpus

Python 854 112 Updated Jul 14, 2023
Python 1,642 149 Updated Apr 27, 2023

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Python 5,847 465 Updated Mar 26, 2026
Next