Popular repositories Loading
-
-
EmergentTTS-Eval-public
EmergentTTS-Eval-public Public[NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.
-
RPBench-Auto
RPBench-Auto PublicAn automated pipeline for evaluating LLMs for role-playing.
-
-
-
Repositories
- sgl-omni-readme Public Forked from sgl-project/sglang-omni
SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
boson-ai/sgl-omni-readme’s past year of commit activity - Gym-boson Public Forked from NVIDIA-NeMo/Gym
Evaluate and improve models and agents using environments
boson-ai/Gym-boson’s past year of commit activity - ProactBench Public
Code for ProactBench: Beyond What The User Asked For — measuring conversational proactivity in multi-turn LLM dialogues.
boson-ai/ProactBench’s past year of commit activity - tau2-bench Public Forked from sierra-research/tau2-bench
τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
boson-ai/tau2-bench’s past year of commit activity - hackathon-m3-api-card-public Public
boson-ai/hackathon-m3-api-card-public’s past year of commit activity - EmergentTTS-Eval-public Public
[NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.
boson-ai/EmergentTTS-Eval-public’s past year of commit activity - hackathon-msac-public Public
boson-ai/hackathon-msac-public’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…