Skip to content
View PosoSAgapo's full-sized avatar
  • 01:34 (UTC +09:00)

Highlights

  • Pro

Organizations

@mynlp @llm-jp

Block or report PosoSAgapo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the repository for llm jp membership inference attack.

Python 4 2 Updated Jan 18, 2025

A library for mechanistic interpretability of GPT-style language models

Python 2,895 481 Updated Dec 7, 2025

Interpretability for sequence generation models 🐛 🔍

Python 450 38 Updated Dec 3, 2025

Materials for EACL2024 tutorial: Transformer-specific Interpretability

Jupyter Notebook 61 3 Updated Mar 26, 2024

[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs

Python 51 9 Updated May 26, 2025

Official Repository for Dataset Inference for LLMs

Jupyter Notebook 43 7 Updated Jul 25, 2024

Python implementation of an extension of the Kolmogorov-Smirnov test for multivariate samples

Python 13 Updated Aug 6, 2023

Python package for measuring memorization in LLMs.

Jupyter Notebook 175 30 Updated Jul 16, 2025

An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).

Python 58 7 Updated Aug 13, 2024
Python 62 16 Updated Jun 13, 2024

The official repository for the paper entitled "Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models."

Python 5 Updated Aug 10, 2025
Python 2 Updated Oct 25, 2024

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,691 200 Updated Nov 15, 2025

JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation

Macaulay2 113 25 Updated Jun 11, 2023

虚拟桌宠模拟器的饭制Mac版(功能不全!)

Swift 69 8 Updated Jun 9, 2025

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 958 96 Updated May 3, 2024

Noise-robust de-duplication at scale

Python 19 2 Updated Apr 9, 2023

All-in-one text de-duplication

Python 736 74 Updated Aug 31, 2025

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python 12,007 797 Updated Dec 15, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,481 202 Updated May 7, 2025

A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html

HTML 896 108 Updated Dec 8, 2025
Python 8 Updated Dec 27, 2022

This repository contains the files for Syntactic and Semantic Uniformity for Semantic Parsing and Task-Oriented Dialogue Systems

Python 2 Updated May 10, 2023

Codebase for SIGIR 2022 paper: Coarse-to-Fine Sparse Sequential Recommendation

Python 23 3 Updated Nov 21, 2022
Python 10 5 Updated May 24, 2022

Crawl BookCorpus

Python 848 109 Updated Jul 14, 2023
Python 1,622 145 Updated Apr 27, 2023

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Python 5,681 453 Updated Oct 31, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,033 6,637 Updated Sep 30, 2025
Next