Skip to content
View phu-pmh's full-sized avatar
  • New York University
  • New York, NY

Organizations

@nyu-mll

Block or report phu-pmh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 7,038 543 Updated Apr 13, 2026

This repository contains cutting-edge open-source security tools (OST) for a red teamer and threat hunter.

10,246 2,344 Updated Sep 29, 2025

Making open safety AI models accessible and beneficial to the safety community

Jupyter Notebook 108 13 Updated Apr 17, 2026

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 60,418 6,177 Updated Apr 18, 2026

DSPy: The framework for programming—not prompting—language models

Python 33,785 2,803 Updated Apr 17, 2026

Probabilistic LLM evaluations. [CogSci2023; ACL2023]

Python 73 4 Updated Jul 27, 2024

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 73,464 7,923 Updated Mar 11, 2026

Democratizing NLP!

Jupyter Notebook 106 29 Updated Dec 6, 2023

Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch

Python 1,216 320 Updated Oct 24, 2024

Improved Sentence Alignment in Linear Time and Space

Python 196 35 Updated Mar 6, 2023
Python 34 3 Updated Nov 22, 2021

Machine-Translation-based sentence alignment tool for parallel text

Python 315 80 Updated Mar 18, 2021

An original implementation of ACL 2019, "Multi-hop Reading Comprehension through Question Decomposition and Rescoring"

Python 138 35 Updated Apr 23, 2022

Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"

Python 211 20 Updated Aug 31, 2021
Python 9 4 Updated Jan 18, 2023

Multilingual Compositional Wikidata Questions (MCWQ)

Python 20 Updated Jun 12, 2023

Repository for the Bias Benchmark for QA dataset.

Python 141 31 Updated Jan 8, 2024

Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simplicity can mask nuances in evaluation items (examples) and s…

Python 18 6 Updated Mar 30, 2022

Bayesian IRT models in Python

Python 161 53 Updated Mar 30, 2026

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,229 617 Updated Jul 19, 2024

Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.

Python 579 51 Updated Mar 11, 2026

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

Python 944 207 Updated Apr 18, 2026

Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data

Python 57 11 Updated Aug 5, 2021

Materials from Stan conferences

HTML 256 83 Updated Oct 22, 2020

😈Awful AI is a curated list to track current scary usages of AI - hoping to raise awareness

7,424 254 Updated Feb 20, 2025

Replication code for "With Little Power Comes Great Responsibility"

Jupyter Notebook 39 1 Updated Oct 15, 2020
Python 1,652 316 Updated Jul 20, 2023

Analysis of NLU test sets with IRT

Jupyter Notebook 12 8 Updated Jul 23, 2021

Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"

200 13 Updated Dec 2, 2020
Next