Amazon Nova Sonic Call Center Agent w/ Tools

By Reilly Manton (rcmanton@amazon.com); Shuto Araki (shuaraki@amazon.com); Andrew Young (ajuny@amazon.com)

This template provides an AWS cloud-based solution for deploying applications that interact with the Amazon Nova Sonic Model. It serves as a foundation for developing future speech-to-speech tooling use cases. Unlike previous implementations that required locally hosted backend and frontend, this cloud architecture leverages:

Frontend: Hosted on Amazon CloudFront and S3
Backend: Deployed on Amazon ECS
Connection: Websocket communication through Network Load Balancer (NLB)
Authentication: Integrated Amazon Cognito authentication

The sample application demonstrates Amazon Nova Sonic model interactions in a customer support context. The model acts as AnyTelco's call center agent Telly and responds to the user in real time. It has two tools at its disposal to augment its knowledge with data:

Customer information lookup via phone number
Knowledge base search for AnyTelco company information such as plan features and pricing

Note: This is a sample application. For production, please modify the application to align with your security standards.

Architecture

Speech-to-Speech Conversation Flow

The user signs onto the frontend hosted on Amazon Cloudfront with a static S3 web page. If the user is unauthenticated, they are re-directed to the Amazon Cognito sign on page where they can sign on with their credentials.
The user clicks start session to open the websocket connection to the python backend. The connect payload contains the JWT which is validated against cognito by the python backend before connection is established.
Speech data is transmitted bidirectionally through this connection for real-time conversation. The user speaks and audio from the user is sent to the Nova Sonic model through the python backend.
Nova Sonic processes the audio. It first outputs a transcription of the user audion. It then does one of two things:
1. Outputs a response which is streamed back to the user. This response includes the assistant response audio and assistant response text.
2. Outputs a tool use request which is picked up and implemented by the Python backend. The backend returns the tool result to Nova Sonic which generates a final response which is streamed back to the user. This response includes the assistant response audio and assistant response text.

Getting started

Prerequisites

The versions below are tested and validated. Minor version differences would likely be acceptable.

Python 3.12
Node.js v20
Node Package Manager (npm) v10.8
Docker v27.4 (if your build environment is x86, make sure your docker environment is configured to be able to build Arm64 container images)
AWS Account (make sure your account is bootstrapped for CDK)
Amazon Nova Sonic is enabled via the Bedrock model access console
Chrome, Safari, or Edge browser environment (Firefox is currently not supported)
Working microphone and speakers

Deployment

[Optional] Update the environment variables to point to your Amazon Dynamodb table and your Bedrock Knowledge Base

You can skip this section if you don't intend to use these tools or build your own with a simple interaction with Amazon Nova Sonic.

Copy template.env to a new file .env and update the KNOWLEDGE_BASE_ID and DYNAMODB_TABLE_NAME to your knowledge base ID and your table name. For table structure, the tool expects phone_number (S) as the primary key (assuming telecom call center use case) and you can add any other keys you want. (e.g., "plan", "current_bill", etc.) Ask about those attributes in the chat to confirm the profile search tool is working. Knowledge base can be loaded with your own contact center guideline texts as needed.

We may add a separate construct to create those resources through CDK if there is enough demand. Let us know by raising an issue.

If you want to bring your own VPC rather than the solution deploying a new VPC for you, specify your VPC ID in VPC_ID.

Ensure you are deploying to aws region us-east-1 since this is the only region that currently supports Amazon Nova Sonic model in Amazon Bedrock.
Run the deployment script to deploy two stacks: Network and S2S. Make sure both stacks get deployed.

./deploy.sh

You should see an output like this:

Outputs:
S2SStack-dev.BackendUrl = https://{cloudfront_distribution}/api
S2SStack-dev.CognitoAppClientId = random_string
S2SStack-dev.CognitoDomain = domain_with_prefix_you_specified_in_cdk_json
S2SStack-dev.CognitoUserPoolId = random_string
S2SStack-dev.FrontendUrl = https://{cloudfront_distribution}
S2SStack-dev.NlbUrl = nlb_endpoint_url
Stack ARN:
arn:aws:cloudformation:us-east-1:123456789101:stack/S2SStack-dev/{stack_id}

✨  Total time: 307.96s

Create a Cognito user in your console and access the frontend URL in your browser to get started.

To create a Cognito user in CLI, use these commands:

Create a User Use the admin-create-user command to create a user in the Cognito User Pool. Replace placeholders with your actual values.

aws cognito-idp admin-create-user \
  --user-pool-id YOUR_USER_POOL_ID \
  --username USERNAME \
  --user-attributes Name=email,Value=USER_EMAIL \
  --temporary-password TEMPORARY_PASSWORD \
  --region YOUR_AWS_REGION

YOUR_USER_POOL_ID: The ID of your Cognito User Pool.
USERNAME: The desired username for the user.
USER_EMAIL: The email address of the user.
TEMPORARY_PASSWORD: A temporary password for the user.
YOUR_AWS_REGION: Your AWS region (e.g., us-east-1).

Log in and change password Click on the frontend URL with the username and temporary password you just created. You will be asked to change the password when you first log in.

Usage

Click "Start Session" to begin
Speak into your microphone to interact with the application. You are acting as the customer and the solution acts as the call center agent.
The chat history will automatically update with the discussion transcript and the assistant audio will play through your speakers.

Customization:

Prompting

You can change the system prompt from the UI.

Tooling

Tooling for Amazon Nova Sonic is implemented using the Model Context Protocol (MCP) in the backend Python application. Amazon Nova Sonic outputs text indicating it wants to use a tool, the MCP server processes the tool call, and the response is returned back to the model for use in generation.

To add a new tool:

Define the tool using MCP decorators in backend/tools.py.

Tools are defined using the @mcp_server.tool() decorator with explicit type annotations. The MCP server automatically generates the tool specifications that get sent to Amazon Nova Sonic. Give your tool a name, description, and specify the input parameters with types. The model uses the description to know when to use the tool and the parameter descriptions to understand the input format.

In our example there are two tools, knowledge base lookup and user profile search. To add your own tool, follow this pattern:

@mcp_server.tool(
    name="lookup",
    description="Runs query against a knowledge base to retrieve information."
)
async def lookup_tool(
    query: Annotated[str, Field(description="the query to search")]
) -> dict:
    """Look up information in the knowledge base"""
    try:
        results = knowledge_base_lookup.main(query)
        return results
    except Exception as e:
        return {"status": "error", "error": str(e)}

Implement the tool logic in a separate Python file. You can create a new Python file to implement the tool functionality, like how backend/retrieve_user_profile.py implements the user lookup tool and backend/knowledge_base_lookup.py implements the knowledge base search tool. These examples show how you can interact with AWS resources via these tools to retrieve real information for the model.
Import your tool implementation in backend/tools.py. The tool function should import and call your implementation module:

import knowledge_base_lookup

# Then call it in your tool function:
results = knowledge_base_lookup.main(query)

The MCP server handles converting your tool definitions into the proper format for Amazon Nova Sonic and automatically processes tool calls during conversations.

Local development

Assume credentials for an AWS account with Amazon Nova Sonic enabled in Amazon Bedrock and export: AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY and AWS_SESSION_TOKEN.

Make sure your .env file is in the backend folder.

Run npm run dev in the same shell session as above to start frontend and backend containers. Both use a file watching mechanism to be notified of local code changes and reload automatically. Re-run the npm command only if changes are made to the Dockerfile, Python libraries or NPM dependencies that require installation, as these are not picked up by the file watcher.

The frontend is accessible at http://localhost:5173/ and the backend at http://localhost:8080/, with authentication disabled for both.

FAQ/trouble shooting

I get ERROR: process "/bin/sh -c chmod +x entrypoint.sh" did not complete successfully: exit code: 255 during build time.

Your docker environment in x86 may not be configured properly. You may need to change the FROM statement in the backend Dockerfile.

ARG TARGETARCH=arm64
FROM --platform=$TARGETARCH python:3.12

npm run dev hangs and the backend container does not exit. I get the error docker: Error response from daemon: driver failed programming external connectivity on endpoint s2s-backend-dev when I try to run the command again.

Run docker rm -f s2s-backend-dev to remove the running container image and run npm run dev again.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
backend		backend
cdk		cdk
config		config
diagrams		diagrams
frontend		frontend
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
bancolombia_prompt.txt		bancolombia_prompt.txt
deploy.sh		deploy.sh
package-lock.json		package-lock.json
package.json		package.json
temp.santi_notes.sh		temp.santi_notes.sh
template.env		template.env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Amazon Nova Sonic Call Center Agent w/ Tools

Table of Contents

Architecture

Speech-to-Speech Conversation Flow

Getting started

Prerequisites

Deployment

Usage

Customization:

Prompting

Tooling

Local development

FAQ/trouble shooting

About

Uh oh!

Releases

Packages

Languages

License

san99tiago/sample-sonic-cdk-agent

Folders and files

Latest commit

History

Repository files navigation

Amazon Nova Sonic Call Center Agent w/ Tools

Table of Contents

Architecture

Speech-to-Speech Conversation Flow

Getting started

Prerequisites

Deployment

Usage

Customization:

Prompting

Tooling

Local development

FAQ/trouble shooting

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages