goaaron

Aaron Goebel goaaron

19 followers · 13 following

San Francisco, California

Achievements

x2 x3

Achievements

x2 x3

Stars

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 57,396 4,753 Updated Mar 26, 2026

Zipstack / unstract

LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows

Python 6,519 618 Updated Mar 27, 2026

vstorm-co / pydantic-deepagents

Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, …

Python 581 61 Updated Mar 19, 2026

Dammyjay93 / interface-design

Design engineering for Claude Code. Craft, memory, and enforcement for consistent UI.

Shell 4,298 294 Updated Feb 10, 2026

jax-md / jax-md

Differentiable, Hardware Accelerated, Molecular Dynamics

Jupyter Notebook 1,395 234 Updated Mar 28, 2026

stanfordnlp / pyreft

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,564 132 Updated Mar 5, 2026

AD-SDL / MADSci

Main repository for the Modular Autonomous Discovery for Science (MADSci) Framework

Python 51 7 Updated Mar 28, 2026

vdblm / CausalPFN

CausalPFN: Amortized Causal Effect Estimation via In-Context Learning

Jupyter Notebook 96 12 Updated Feb 27, 2026

quanlin-wu / unisite

[NeurIPS2025 Spotlight 🔥 ] Official implementation of "UniSite: The First Cross-Structure Dataset and Learning Framework for End-to-End Ligand Binding Site Detection"

Python 38 1 Updated Nov 26, 2025

dllm-reasoning / d1

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 426 51 Updated Jan 26, 2026

llama-farm / llamafarm

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

Python 829 49 Updated Mar 28, 2026

github / spec-kit

💫 Toolkit to help you get started with Spec-Driven Development

Python 83,016 7,101 Updated Mar 27, 2026

cactus-compute / cactus

Low-latency AI engine for mobile devices & wearables

C 4,529 336 Updated Mar 27, 2026

decoderesearch / SAELens

Training Sparse Autoencoders on Language Models

Python 1,280 221 Updated Mar 19, 2026

MouseLand / cellpose

a generalist algorithm for cellular segmentation with human-in-the-loop capabilities

Python 2,128 596 Updated Mar 27, 2026

computational-cell-analytics / micro-sam

Segment Anything for Microscopy

Jupyter Notebook 668 98 Updated Mar 25, 2026

upstash / context7

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

TypeScript 50,844 2,401 Updated Mar 27, 2026

SakanaAI / text-to-lora

Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input

Python 1,238 85 Updated Jun 8, 2025

lucasmaystre / choix

Inference algorithms for models based on Luce's choice axiom

Jupyter Notebook 190 31 Updated Sep 5, 2025

YingfanWang / PaCMAP

PaCMAP: Large-scale Dimension Reduction Technique Preserving Both Global and Local Structure

Python 947 79 Updated Mar 3, 2026

ag-ui-protocol / ag-ui

AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.

Python 12,705 1,157 Updated Mar 27, 2026

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,737 158 Updated Feb 27, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,264 3,522 Updated Mar 27, 2026

pat-jj / s3

[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)

Python 828 138 Updated Nov 5, 2025

Physical-Intelligence / openpi

Python 10,900 1,653 Updated Mar 20, 2026

LambdaLabsML / distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

Python 598 68 Updated Oct 22, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,377 4,523 Updated Mar 16, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,017 5,034 Updated Mar 28, 2026

NVIDIA-Digital-Bio / proteina

Proteina is a new large-scale flow-based protein backbone generator that utilizes hierarchical fold class labels for conditioning and relies on a tailored scalable transformer architecture.

Python 248 35 Updated Jul 24, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,246 264 Updated Feb 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aaron Goebel goaaron

Achievements

Achievements

Block or report goaaron

Stars

opendatalab / MinerU

Zipstack / unstract

vstorm-co / pydantic-deepagents

Dammyjay93 / interface-design

jax-md / jax-md

stanfordnlp / pyreft

AD-SDL / MADSci

vdblm / CausalPFN

quanlin-wu / unisite

dllm-reasoning / d1

llama-farm / llamafarm

github / spec-kit

cactus-compute / cactus

decoderesearch / SAELens

MouseLand / cellpose

computational-cell-analytics / micro-sam

upstash / context7

SakanaAI / text-to-lora

lucasmaystre / choix

YingfanWang / PaCMAP

ag-ui-protocol / ag-ui

langfengQ / verl-agent

verl-project / verl

pat-jj / s3

Physical-Intelligence / openpi

LambdaLabsML / distributed-training-guide

hpcaitech / ColossalAI

sgl-project / sglang

NVIDIA-Digital-Bio / proteina

SafeAILab / EAGLE