Skip to content
#

trades

Here are 28 public repositories matching this topic...

Multimodal evaluation benchmark for AI agents in real-world field operations across 16 trades (HVAC, electrical, plumbing, roofing, solar, mining, oil & gas, marine, telecom, automotive, construction, and more). 194 cases; scores retrieval, code citation, jurisdiction, safety, trajectory, multi-turn, speed; 5-layer contamination defense.

  • Updated Apr 19, 2026
  • Python

Improve this page

Add a description, image, and links to the trades topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the trades topic, visit your repo's landing page and select "manage topics."

Learn more