Popular repositories Loading
-
gorilla
gorilla PublicForked from ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Python
-
agentdojo
agentdojo PublicForked from ethz-spylab/agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
Python
-
-
tau2-bench
tau2-bench PublicForked from sierra-research/tau2-bench
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
Python
-
openai-cua-sample-app
openai-cua-sample-app PublicForked from openai/openai-cua-sample-app
Learn how to use CUA (our Computer Using Agent) via the API on multiple computer environments.
Python
-
webvoyager
webvoyager PublicForked from MinorJerry/WebVoyager
Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"
Python
Repositories
- agentdojo Public Forked from ethz-spylab/agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
sequrity-ai/agentdojo’s past year of commit activity - OSWorld Public Forked from xlang-ai/OSWorld
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
sequrity-ai/OSWorld’s past year of commit activity - webvoyager Public Forked from MinorJerry/WebVoyager
Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"
sequrity-ai/webvoyager’s past year of commit activity - openai-cua-sample-app Public Forked from openai/openai-cua-sample-app
Learn how to use CUA (our Computer Using Agent) via the API on multiple computer environments.
sequrity-ai/openai-cua-sample-app’s past year of commit activity - tau2-bench Public Forked from sierra-research/tau2-bench
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
sequrity-ai/tau2-bench’s past year of commit activity - gorilla Public Forked from ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
sequrity-ai/gorilla’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…