btyu

Follow

Botao Yu btyu

Follow

PhD student at @OSU-NLP-Group.

43 followers · 33 following

The Ohio State University

Achievements

Achievements

Highlights

Pro

Stars

ultraworkers / claw-code

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,846 109,957 Updated Jun 8, 2026

hkust-nlp / Toolathlon

[ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 393 43 Updated Jun 12, 2026

sparfenyuk / mcp-proxy

A bridge between Streamable HTTP and stdio MCP transports

Python 2,594 241 Updated Jun 8, 2026

princeton-pli / hal-harness

Python 302 56 Updated Jun 15, 2026

HowieHwong / sde-harness

SDE-Harness (Scientific Discovery Evaluation Framework)

Python 59 4 Updated Mar 27, 2026

mahmoudrabie / agentic-ai

Agentic AI research papers, benchmarks, frameworks, and tools curated across 24 domains.

150 4 Updated Jun 15, 2026

OSU-NLP-Group / Mind2Web-2

[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge

Python 111 7 Updated May 17, 2026

mims-harvard / TDC

Therapeutics Commons (TDC): Multimodal Foundation for Therapeutic Science

Jupyter Notebook 1,255 216 Updated Jul 13, 2025

yuzhimanhua / Awesome-Scientific-Language-Models

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery (EMNLP'24)

659 39 Updated Jun 21, 2025

xuhuihuang / ts-dar

TS-DAR identifies transition states of protein conformational changes from MD simulations using hyperspherical embeddings in the latent space.

Jupyter Notebook 21 6 Updated May 15, 2026

whitead / synspace

Synthesis generative model

Python 47 4 Updated Apr 24, 2025

OSU-NLP-Group / ChemMCP

A Chemistry Toolkit that turns your AI assistant into a Chemistry coscientist..

Python 65 12 Updated Jun 9, 2025

ninglab / RLSynC

Python 9 1 Updated Feb 27, 2025

OSU-NLP-Group / GUI-Agents-Paper-List

Awesome GUI Agent Paper List

TypeScript 819 41 Updated Jun 5, 2026

OSU-NLP-Group / AgentSafety

189 7 Updated Oct 31, 2025

Future-House / paper-qa

High accuracy RAG for answering questions from scientific documents with citations

Python 8,708 882 Updated Jun 11, 2026

grobidOrg / grobid

A machine learning software for extracting information from scholarly documents

Java 4,939 557 Updated Jun 14, 2026

OSU-NLP-Group / ScienceAgentBench

[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Python 140 20 Updated Apr 29, 2026

idavidrein / gpqa

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Jupyter Notebook 510 57 Updated Sep 30, 2024

OpenHands / OpenHands

🙌 OpenHands: AI-Driven Development

Python 77,202 9,809 Updated Jun 15, 2026

lmmentel / awesome-python-chemistry

A curated list of Python packages related to chemistry

1,404 230 Updated Sep 21, 2025

ur-whitelab / chemcrow-public

Chemcrow

Python 925 144 Updated Dec 19, 2024

QizhiPei / Awesome-Biomolecule-Language-Cross-Modeling

Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"

258 18 Updated Mar 5, 2026

OpenBioML / chemnlp

ChemNLP project

Python 176 46 Updated Jun 15, 2026

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,471 4,789 Updated May 1, 2026

OSU-NLP-Group / SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 848 109 Updated Feb 3, 2025

HICAI-ZJU / Scientific-LLM-Survey

Scientific Large Language Models: A Survey on Biological & Chemical Domains

356 34 Updated Sep 7, 2025

OSU-NLP-Group / LLM4Chem

Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset"

Python 113 19 Updated Jun 9, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 16,708 4,080 Updated Jun 15, 2026

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 21,289 1,835 Updated Mar 5, 2026