Skip to content
View harsh19's full-sized avatar

Organizations

@sdspag @icebergnlp

Block or report harsh19

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting

Python 13 4 Updated Jan 2, 2026

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 615 59 Updated Apr 6, 2026

Code and data for LM Agents for Coordinating Multi-User Information Gathering

Python 8 4 Updated Feb 13, 2026

Kolmogorov Arnold Networks

Jupyter Notebook 16,231 1,552 Updated Jan 19, 2025
Python 1 Updated Mar 18, 2024

AI PDF chatbot agent built with LangChain & LangGraph

TypeScript 16,445 3,227 Updated Mar 27, 2026

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Python 1,064 77 Updated Mar 7, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,205 2,925 Updated Apr 14, 2026

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,260 4,001 Updated Jul 17, 2024

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,692 4,946 Updated Aug 1, 2024

The agent engineering platform

Python 133,630 22,079 Updated Apr 15, 2026

Examples and guides for using the OpenAI API

Jupyter Notebook 72,745 12,269 Updated Apr 15, 2026

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,336 6,928 Updated Apr 15, 2026

A latent text-to-image diffusion model

Jupyter Notebook 72,885 10,617 Updated Jun 18, 2024

[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"

Python 324 39 Updated Aug 25, 2023
Python 16 3 Updated Apr 9, 2021

This is the starting kit for task-1 of IGLU 2021.

Python 4 1 Updated Jun 1, 2022

The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data"

Python 14 Updated Apr 21, 2023

Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023

Python 251 20 Updated Dec 15, 2023

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

C 508 103 Updated Feb 5, 2026

OpenAI Codex demo using Minecraft GameTest API

TypeScript 134 22 Updated Sep 25, 2023

Toolkit for creating, sharing and using natural language prompts.

Python 3,007 379 Updated Oct 23, 2023

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Jupyter Notebook 1,399 194 Updated Jan 30, 2026

Knowledge-Aware RL agents with Commonsense Reasoning

Inform 7 79 23 Updated Mar 4, 2022

This plugin displays your tex source in a textarea so plugins like grammarly can check it.

TypeScript 519 30 Updated May 18, 2023

Truth-Conditional Captions for Time Series Data. EMNLP 2021. Harsh Jhamtani, Taylor Berg-Kirkpatrick

Python 13 1 Updated Feb 9, 2022

Automatic metrics for GEM tasks

Python 68 20 Updated Oct 25, 2022
Next