Skip to content
View houyu0930's full-sized avatar

Block or report houyu0930

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 845 94 Updated Oct 13, 2025

💪 Models' quality and performance metrics (R2, ICC, LOO, AIC, BF, ...)

R 1,119 104 Updated Dec 5, 2025

Materials for my Mixed Model Workshop

HTML 69 12 Updated Jul 26, 2019

OLMost every training recipe you need to perform data interventions with the OLMo family of models.

Python 62 11 Updated Dec 25, 2025

Attribute (or cite) statements generated by LLMs back to in-context information.

Jupyter Notebook 312 25 Updated Oct 8, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,794 2,897 Updated Dec 25, 2025

Fit GLMM with brms

HTML 3 1 Updated Apr 18, 2022

ICLR 2025 Workshop & CHI 2025 SIG: "Bidirectional Human-AI Alignment"

47 1 Updated Aug 6, 2024

EMNLP 2025 Two Papers - Value-Action Gap in LLMs (Main Track); ValueCompass (WiNLP Workshop)

Jupyter Notebook 3 1 Updated Nov 5, 2025

(EMNLP 2025) Should I Share this Translation? Evaluating Quality Feedback for User Reliance on Machine Translation

HTML 2 Updated Aug 28, 2025

GraphicBench: A Planning Benchmark for Graphic Design Generation with Language Agents

JavaScript 4 Updated Apr 17, 2025

What would you do with 1000 H100s...

Jupyter Notebook 1,134 69 Updated Jan 10, 2024

A platform for building reliable AI agents

Python 88 5 Updated Dec 23, 2025

⚔️ Official code of "Search Arena: Analyzing Search-Augmented LLMs".

Jupyter Notebook 46 7 Updated Jun 6, 2025

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,459 142 Updated Mar 7, 2025

The best ChatGPT that $100 can buy.

Python 39,246 4,974 Updated Dec 23, 2025

Replication data and code for the paper: When LLMs are Reliable for Judging Empathic Communication

Jupyter Notebook 3 1 Updated Sep 30, 2025

Micromodels -- A framework for accurate, explainable, data efficient, and reusable NLP models.

Python 14 3 Updated Feb 7, 2023

[ICML 2025] HypotheSAEs: Hypothesizing interpretable relationships in text datasets using sparse autoencoders. https://arxiv.org/abs/2502.04382

Jupyter Notebook 67 21 Updated Oct 29, 2025

Code, data, and models for the ICWSM 2023 paper "Bridging Nations: Quantifying the Role of Multilinguals in Communication on Social Media"

Jupyter Notebook 1 Updated Apr 10, 2023

Topic modeling on 100,000 r/AmItheAsshole threads.

Python 7 1 Updated Feb 13, 2023

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,666 12,216 Updated Dec 21, 2025

This repository contains published parts test cross-lingual calls, collected and annotated as part of the InCroMin project (an FSTP under the EU project UTTER.

1 Updated Jul 8, 2025

Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"

Python 15 2 Updated Aug 2, 2025

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Python 515 32 Updated Dec 31, 2024

Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"

Python 20 1 Updated Dec 16, 2025
Next