DominicSlw

Dominic Sun DominicSlw

CS@CMU 25

5 followers · 21 following

CMU
dominicslw.github.io

Starred repositories

allenai / OLMo-core

PyTorch building blocks for the OLMo ecosystem

Python 1,131 224 Updated Apr 9, 2026

zzachw / llemr

NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records

Jupyter Notebook 59 5 Updated Sep 10, 2025

awslabs / graphrag-toolkit

Python toolkit for building graph-enhanced GenAI applications

Python 381 83 Updated Apr 9, 2026

FreedomIntelligence / Chain-of-Diagnosis

An interpretable large language model (LLM) for medical diagnosis.

Python 161 7 Updated Sep 12, 2024

JarvisUSTC / DoctorAgent-RL

DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue

Python 77 7 Updated Jan 23, 2026

openai / simple-evals

Python 4,435 480 Updated Jul 31, 2025

yale-nlp / MCTS-RAG

Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"

Python 114 18 Updated Nov 4, 2025

UCSC-VLAA / MedReason

MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs

Python 263 23 Updated Jun 19, 2025

octotools / octotools

OctoTools: An agentic framework with extensible tools for complex reasoning

Python 1,437 186 Updated Apr 6, 2026

kicarussays / MedRep

Python 8 Updated Aug 26, 2025

som-shahlab / hf_ehr

Training HuggingFace models on EHR data

Jupyter Notebook 44 11 Updated Nov 2, 2025

stair-lab / kg-gen

[NeurIPS '25] Knowledge Graph Generation from Any Text

Python 1,094 161 Updated Mar 24, 2026

DDVD233 / CLIMB

Python 72 10 Updated Jul 30, 2025

UCSC-VLAA / m1

[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models

Jupyter Notebook 48 4 Updated Dec 21, 2025

GAIR-NLP / DeepResearcher

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 725 49 Updated Oct 15, 2025

zilliztech / deep-searcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,749 750 Updated Nov 19, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,831 367 Updated Apr 6, 2026

lqtrung1998 / mwp_ReFT

Python 553 65 Updated Jan 2, 2025

lapisrocks / LanguageAgentTreeSearch

[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Python 824 87 Updated Jul 30, 2024

ncbi-nlp / MedCalc-Bench

[NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations

Python 88 18 Updated Dec 18, 2025

mims-harvard / TxAgent

TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools

Python 613 95 Updated Jul 30, 2025

gersteinlab / MedAgentsBench

MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Jupyter Notebook 79 9 Updated Mar 10, 2026

yiqingxyq / DocLens

Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)

Python 22 3 Updated May 18, 2024

RUCAIBox / R1-Searcher

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 707 46 Updated Aug 5, 2025

FreedomIntelligence / HuatuoGPT-o1

Medical o1, Towards medical complex reasoning with LLMs

Python 1,299 130 Updated Jan 20, 2025

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,740 370 Updated Apr 9, 2026

pat-jj / DeepRetrieval

[COLM’25] DeepRetrieval — 🔥 Training Search Agent by RLVR with Retrieval Outcome

Python 705 85 Updated Oct 12, 2025

bowang-lab / MedRAX

MedRAX: Medical Reasoning Agent for Chest X-ray - ICML 2025

Python 1,123 198 Updated Oct 31, 2025

matthewchung74 / qwen_2_5_3B_GRPO_medical_thinking

Jupyter Notebook 49 8 Updated Apr 21, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,038 1,582 Updated Feb 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly