Skip to content
@GAIR-NLP

SII - Generative Artificial Intelligence Research Lab (GAIR)

GAIR is part of SII, focusing on Generative Artificial Intelligence Research, with joint effort from SJTU.

Pinned Loading

  1. factool factool Public

    FacTool: Factuality Detection in Generative AI

    Python 890 69

Repositories

Showing 10 of 52 repositories
  • ReasonEval Public

    [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy

    GAIR-NLP/ReasonEval’s past year of commit activity
    Python 69 4 1 0 Updated Oct 9, 2025
  • LIMI Public

    LIMI: Less is More for Agency

    GAIR-NLP/LIMI’s past year of commit activity
    Python 137 6 4 0 Updated Oct 8, 2025
  • InnovatorBench Public

    A benchmark for LLMs on complicated long-horizon tasks

    GAIR-NLP/InnovatorBench’s past year of commit activity
    Jupyter Notebook 3 Apache-2.0 0 0 0 Updated Oct 7, 2025
  • DatasetResearch Public

    DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery

    GAIR-NLP/DatasetResearch’s past year of commit activity
    Python 18 Apache-2.0 0 0 0 Updated Sep 24, 2025
  • SII-CLI Public
    GAIR-NLP/SII-CLI’s past year of commit activity
    4 0 0 0 Updated Sep 22, 2025
  • AgencyBench Public
    GAIR-NLP/AgencyBench’s past year of commit activity
    Python 6 0 0 0 Updated Sep 22, 2025
  • ResearcherBench Public

    ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry

    GAIR-NLP/ResearcherBench’s past year of commit activity
    Python 33 3 2 1 Updated Sep 22, 2025
  • GAIR-NLP/WindowsAgentArena-V2’s past year of commit activity
    Python 1 MIT 0 0 0 Updated Sep 9, 2025
  • PC-Agent-E Public

    Efficient Agent Training for Computer Use

    GAIR-NLP/PC-Agent-E’s past year of commit activity
    Python 131 MIT 5 0 0 Updated Sep 5, 2025
  • ASI-Arch Public

    AlphaGo Moment for Model Architecture Discovery.

    GAIR-NLP/ASI-Arch’s past year of commit activity
    Python 1,093 Apache-2.0 214 9 0 Updated Aug 4, 2025