Creating and Configuring Agents¶
An agent is the core building block of SpeakNode. It defines the personality, behavior, voice, and capabilities of your voice AI assistant.
Creating a New Agent¶
- Navigate to Agents in the sidebar.
- Click + Create Agent.
- Enter a name for your agent (e.g., "Customer Support", "Appointment Scheduler").
- Click Create.
You will be taken to the agent configuration page.
Configuring the System Prompt¶
The system prompt defines your agent's personality, goals, and behavior rules. This is the most important configuration for your agent.
- In the System Prompt field, write instructions for your agent.
- Use markdown headings to structure your prompt (e.g.,
# Personality,# Goal,# Guardrails).
Keep it concise
Voice agents work best with focused prompts under 2000 tokens. See the Prompting Guide for detailed advice.
Dynamic variables
You can use {{ variable_name }} syntax in your system prompt. These variables are resolved at runtime. See Dynamic Variables for details.
Selecting the LLM Model¶
The LLM model powers the reasoning and response generation of your agent.
- Open the LLM section.
- Select a model from the dropdown (e.g., GPT-4o, GPT-4o Mini, Claude Sonnet).
- Adjust the Temperature slider:
- Lower values (0.0 - 0.3): More deterministic, consistent responses. Good for data collection and structured tasks.
- Medium values (0.3 - 0.7): Balanced creativity and consistency. Good for customer support.
- Higher values (0.7 - 1.0): More creative, varied responses. Good for casual conversation.
Model selection
Refer to Best Practices for a comparison table of models and their recommended use cases.
Adding Languages and First Messages¶
Each agent can support multiple languages. For each language, you configure a greeting message and a TTS voice.
- Open the Languages section.
- Click + Add Language.
- Select a language from the dropdown (e.g., English, Spanish, German).
- Enter the First Message -- this is what the agent says when a conversation starts.
- Repeat for each language you want to support.
Note
The first language in the list is used as the default. You can reorder languages by dragging them.
Configuring TTS Voice per Language¶
For each language, you select a text-to-speech voice and configure its settings.
- Click on a language entry to expand its settings.
- Select a Voice from the available voices for that language.
- Adjust Speed (0.5x - 2.0x). A speed of 0.9x - 1.1x is recommended for natural conversation.
- Optionally select a Background Sound (e.g., office ambience, call center) to make the conversation feel more natural.
Tip
Preview voices before selecting them. Different voices suit different agent personalities -- a warm voice for customer support, a professional voice for scheduling.
Configuring the STT Model¶
The Speech-to-Text (STT) model determines how your agent transcribes the caller's speech.
- Open the STT section.
- Select an STT model from the dropdown.
- Configure language detection settings if your agent supports multiple languages.
Configuring Voice Activity Detection (VAD)¶
VAD controls how the agent detects when the caller starts and stops speaking.
- Open the VAD section.
- Adjust the settings:
- Turn Timeout: How long to wait after the caller stops speaking before the agent responds. Use 5-10 seconds for support, 10-15 seconds for data collection.
- Silence Threshold: The silence duration that signals the end of a turn.
Warning
Setting the turn timeout too low can cause the agent to interrupt the caller. Setting it too high can make the agent feel unresponsive.
Adding Tools¶
Tools let your agent perform actions during a conversation, such as looking up data or booking appointments.
- Open the Tools section.
- Click + Add Tool to attach an existing tool, or create a new one.
- Toggle System Tools on or off as needed (e.g., end call, transfer).
- Configure Transfer Rules if you want the agent to transfer calls to another agent under certain conditions.
See Tools & Webhook Tools for a full guide on creating and configuring tools.
Publishing the Agent¶
Once your agent is fully configured, you need to publish it to make it available for phone calls, the web widget, and the API.
- Review all your configuration sections.
- Click the Publish button in the top-right corner.
- The agent is now live and can receive calls or be embedded on your website.
Warning
Unpublished changes are saved as a draft. Callers and widget users will continue to interact with the last published version until you publish again.
Note
After publishing, you can access the agent's test page, embed code, and public test URL. See Testing Your Agent and Embedding the Widget.
Summary¶
| Step | Section | Description |
|---|---|---|
| 1 | System Prompt | Define personality, goals, and rules |
| 2 | LLM | Select model and temperature |
| 3 | Languages | Add languages with first messages |
| 4 | TTS | Configure voice, speed, and background sound per language |
| 5 | STT | Select speech-to-text model |
| 6 | VAD | Configure turn detection settings |
| 7 | Tools | Attach tools and transfer rules |
| 8 | Publish | Make the agent live |