-
Princeton University
Stars
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
A massively parallel, high-level programming language
Code for the paper "LASER: LLM Agent with State-Space Exploration for Web Navigation"
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Can Language Models Solve Olympiad Programming?
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
A large collection of example and demo Soar agents for a variety of domains and problems.
High accuracy RAG for answering questions from scientific documents with citations
A high-throughput and memory-efficient inference and serving engine for LLMs
The official Python library for the OpenAI API
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
SWE-bench: Can Language Models Resolve Real-world Github Issues?
A simple way to manage and store the data related to all your research papers!
A guidance language for controlling large language models.
The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
[ACL 2024 Findings] Referral Augmentation for Zero-Shot Information Retrieval https://arxiv.org/abs/2305.15098
List of language agents based on paper "Cognitive Architectures for Language Agents"
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models