Monitor Agent System

Monitor Agents are external voice AI agents from platforms like VAPI or ElevenLabs that are integrated into Evalion purely for observability purposes. Unlike testing agents, monitor agents cannot initiate conversations but allow you to track and evaluate real conversations from external platforms.

Core Workflow

1. Agent Creation

When creating a monitor agent, Evalion performs the following steps:

Validates Credentials: Verifies the provided API key with the external platform
Fetches Agent Details: Retrieves agent information (name) from the platform API
Stores Configuration: Saves the external agent ID and API credentials securely
Links to Project: Associates the monitor agent with your Evalion project for organization

2. Historical Data Import

Monitor agents support importing past conversations from the external platform:

Preview Mode

Fetches calls from the external platform without importing
Allows you to review conversation metadata before importing
Useful for understanding data volume and quality

Import Mode

Downloads historical calls from the external platform
Includes duplicate detection (idempotency based on external call ID)
Supports date range filtering for targeted imports

Key Features:

Automatic deduplication prevents importing the same conversation twice
Batch processing for efficient large-scale imports
Preserves original timestamps and metadata from the external platform

3. Real-time Monitoring

Once configured with webhooks, monitor agents automatically process new conversations:

Webhook Reception: External platform sends event notifications for new calls
Data Processing: Evalion receives and validates the conversation data
Automatic Evaluation: Applies your configured metrics to evaluate the conversation

Supported Integration Platforms

Evalion supports multiple integration types:

1. VAPI

Platform: Voice AI platform focused on telephony and voice interactions

Capabilities:

Full transcript access with speaker labels
Detailed cost breakdown per call
Recording URLs for audio playback
Message-level analysis and timestamps
Call summary and analysis data

Data Includes:

Transcript with timing information
Cost metrics (per-minute, total)
Message array with role and content
Recording URLs
Call metadata (duration, status, outcome)

2. ElevenLabs

Platform: Conversational AI platform with advanced voice synthesis

Capabilities:

Conversation summaries and metadata
Agent configuration details
Call ratings and quality metrics
Conversation direction (inbound/outbound)

Data Includes:

Call duration and timing
Agent details and configuration
Rating information
Conversation summary
Direction (inbound/outbound)

Note: ElevenLabs uses a conversation-based structure rather than VAPI's call-based structure.

3. Custom

Platform: Custom data import and bulk audio upload

Purpose: Import custom conversation data or bulk upload audio recordings for evaluation

Capabilities:

Import custom conversation data from any source
Bulk upload audio recordings for analysis
Automatically creates or associates with an agent behind the scenes
Flexible data structure to accommodate various conversation formats

Use Cases:

Migrating historical conversation data from legacy systems
Importing recordings from call centers or custom telephony solutions
Batch processing of audio files for evaluation
Integrating with platforms not natively supported

Data Requirements:

Audio files (for bulk upload scenarios)
Conversation metadata (timestamps, participants, etc.)
Optional transcript data if available

4. Evalion

Platform: Internal Evalion agents

Purpose: Agents created natively within Evalion for testing purposes

Characteristics:

Not monitor-only (used for active testing)
Don't require external API integration
Skip import processing (data already exists internally)
Full control over agent behavior and configuration

Monitor Agents vs Testing Agents

Aspect	Monitor Agents	Testing Agents
Purpose	Observe external conversations	Execute test scenarios
Creation	Linked from external platforms	Created in Evalion
Conversation Initiation	Cannot initiate	Can initiate via test suites
Data Source	Real user interactions	Simulated test cases
Evaluation	Real-time and historical	On-demand testing
Use Case	Production monitoring	Pre-deployment validation

Best Practices

Security

Store API keys securely and rotate them regularly
Use read-only API keys when possible
Restrict access to monitor agents based on team roles

Data Management

Import historical data in batches to avoid rate limits
Use date range filters to import relevant time periods
Monitor storage usage for high-volume integrations

Evaluation

Apply consistent metrics across testing and monitoring agents
Set up alerts for critical metric thresholds
Review conversation samples regularly to validate metric accuracy

Limitations

External monitor agents cannot be used in test suites for active testing
They are read-only and cannot modify agent behavior on external platforms
Historical import is limited by the external platform's API rate limits and data retention policies

Monitor Agent System

Core Workflow

1. Agent Creation

2. Historical Data Import

Preview Mode

Import Mode

3. Real-time Monitoring

Supported Integration Platforms

1. VAPI

2. ElevenLabs

3. Custom

4. Evalion

Monitor Agents vs Testing Agents

Best Practices

Security

Data Management

Evaluation

Limitations

Next Steps