Overview
This section provides comprehensive network performance data for Aviatrix cloud networking infrastructure, including throughput characteristics, latency measurements, and connection scaling limits across different deployment scenarios and cloud platforms.Throughput Performance
Inter-Region Performance
- AWS Regions
- Azure Regions
- GCP Regions
| Source Region | Destination Region | Distance | Typical Latency | Max
Throughput | Optimal Instance |
|---------------|-------------------|----------|-----------------|----------------|------------------|
| us-east-1 | us-west-2 | ~3,000 km| 65-75 ms | 10 Gbps | c5n.4xlarge | |
us-east-1 | eu-west-1 | ~5,500 km| 80-90 ms | 8 Gbps | c5n.4xlarge | |
us-east-1 | ap-southeast-1 | ~17,000 km| 180-200 ms | 5 Gbps | c5n.2xlarge |
| eu-west-1 | ap-southeast-1 | ~11,000 km| 160-180 ms | 6 Gbps | c5n.2xlarge
| | us-west-2 | ap-northeast-1 | ~8,000 km| 120-140 ms | 7 Gbps |
c5n.4xlarge | | eu-central-1 | us-east-1 | ~6,000 km| 90-100 ms | 8 Gbps |
c5n.4xlarge |
Same-Region Performance
Availability Zone Performance
Availability Zone Performance
| Cloud Provider | Same AZ | Different AZ | Cross-Region | Backbone Network |
|---|---|---|---|---|
| AWS | < 1 ms | 1-2 ms | Variable | Up to 100 Gbps |
| Azure | < 1 ms | 1-3 ms | Variable | Up to 100 Gbps |
| GCP | < 1 ms | 1-2 ms | Variable | Up to 100 Gbps |
- Same Availability Zone: Sub-millisecond latency, full instance bandwidth
- Different Availability Zones: Low single-digit millisecond latency
- Cross-Region: Varies based on geographic distance and network path
Instance-to-Instance Performance
Instance-to-Instance Performance
| Instance Class | Network Performance | Baseline Bandwidth | Burst Bandwidth | Duration |
|---|---|---|---|---|
| Burstable | Up to 5 Gbps | 200 Mbps | 5 Gbps | 30 min |
| General Purpose | Up to 25 Gbps | 1 Gbps | 10 Gbps | Sustained |
| Compute Optimized | Up to 50 Gbps | 10 Gbps | 25 Gbps | Sustained |
| Network Optimized | Up to 100 Gbps | 25 Gbps | 100 Gbps | Sustained |
Latency Characteristics
Network Latency Factors
- Geographic Latency
- Application Latency
- Optimization Techniques
Distance-Based Latency (One-Way)
Factors Affecting Real-World Latency:
| Distance Range | Typical Latency | Speed of Light Limit | Real-World Factor |
|---|---|---|---|
| 0-500 km | 2-8 ms | 1.7 ms | 1.2-4.7x |
| 500-1,500 km | 8-20 ms | 5-10 ms | 1.6-2.0x |
| 1,500-5,000 km | 20-60 ms | 10-33 ms | 1.8-2.0x |
| 5,000-10,000 km | 60-120 ms | 33-67 ms | 1.8-1.8x |
| 10,000+ km | 120-250 ms | 67+ ms | 1.8-3.7x |
- Fiber optic cable routing (not straight-line distance)
- Number of network hops and routing equipment
- Network congestion and quality of service
- Terrestrial vs. submarine cable characteristics
Connection Scaling
Connection Limits by Gateway Type
Transit Gateway Connections
Transit Gateway Connections
| Instance Size | Concurrent Connections | New Connections/sec | BGP Sessions | Route Capacity |
|---|---|---|---|---|
| Small | 50,000 | 2,500 | 100 | 10,000 |
| Medium | 125,000 | 6,250 | 250 | 25,000 |
| Large | 250,000 | 12,500 | 500 | 50,000 |
| XLarge | 500,000 | 25,000 | 1,000 | 100,000 |
| XXLarge | 1,000,000 | 50,000 | 2,000 | 200,000 |
- East-West Traffic: 60-70% of total connections
- North-South Traffic: 20-30% of total connections
- Management Traffic: 5-10% of total connections
Spoke Gateway Connections
Spoke Gateway Connections
| Instance Size | Concurrent Connections | New Connections/sec | Application Flows | Session Capacity |
|---|---|---|---|---|
| Small | 10,000 | 500 | 1,000 | 5,000 |
| Medium | 25,000 | 1,250 | 2,500 | 12,500 |
| Large | 50,000 | 2,500 | 5,000 | 25,000 |
| XLarge | 100,000 | 5,000 | 10,000 | 50,000 |
| XXLarge | 200,000 | 10,000 | 20,000 | 100,000 |
- Database Connections: Long-lived, low bandwidth
- Web Service Connections: Short-lived, medium bandwidth
- File Transfer Connections: Medium-lived, high bandwidth
- API Connections: Very short-lived, low bandwidth
Edge Gateway Connections
Edge Gateway Connections
| Gateway Type | User Connections | Site Connections | VPN Tunnels | Firewall Sessions |
|---|---|---|---|---|
| UserVPN Small | 100 | N/A | 100 | 10,000 |
| UserVPN Large | 2,000 | N/A | 2,000 | 200,000 |
| Site2Site Small | N/A | 25 | 25 | 25,000 |
| Site2Site Large | N/A | 200 | 200 | 200,000 |
| Hybrid | 1,000 | 100 | 100 | 100,000 |
- UserVPN: High session count, moderate bandwidth per user
- Site-to-Site: Low session count, high bandwidth per tunnel
- Hybrid Deployments: Balanced approach for mixed requirements
Performance Monitoring
Key Performance Indicators
- Throughput Metrics
- Latency Metrics
- Capacity Metrics
Primary Metrics
- Bits per second (bps): Raw data transfer rate
- Packets per second (pps): Packet processing rate
- Connections per second (cps): Connection establishment rate
- Utilization percentage: Resource usage levels
- Green Zone: < 60% of maximum capacity
- Yellow Zone: 60-80% of maximum capacity
- Red Zone: > 80% of maximum capacity
- Critical Zone: > 95% of maximum capacity
- Warning alerts at 70% utilization
- Critical alerts at 85% utilization
- Automated scaling triggers at 80% utilization
Performance Testing Methodology
Test Scenarios
Baseline Performance Tests
Baseline Performance Tests
Single Stream Tests
- Purpose: Measure maximum single-connection performance
- Tools: iperf3, netperf, custom applications
- Duration: 60-300 seconds for statistical significance
- Metrics: Throughput, latency, CPU utilization
- Purpose: Measure aggregate performance across multiple connections
- Configuration: 1, 2, 4, 8, 16, 32 concurrent streams
- Analysis: Scaling efficiency and resource utilization
- Optimization: Connection balancing and resource allocation
Real-World Simulation
Real-World Simulation
Application Mix Testing
- Web Traffic: HTTP/HTTPS with varying object sizes
- Database Traffic: SQL queries and transaction patterns
- File Transfer: Large file uploads and downloads
- Real-time Applications: VoIP and video streaming
- Peak Hour Traffic: Maximum expected load
- Normal Operating Conditions: Typical daily patterns
- Burst Traffic: Sudden traffic spikes
- Failure Scenarios: Failover and recovery testing
Stress Testing
Stress Testing
Connection Stress Tests
- Maximum connection establishment rate
- Connection table exhaustion scenarios
- Rapid connection cycling patterns
- Resource cleanup and garbage collection
- Maximum sustained throughput
- Network interface saturation
- Buffer overflow conditions
- Quality of service under load
Network performance data is collected under controlled laboratory conditions
and validated through real-world deployments. Actual performance may vary
based on network topology, traffic patterns, and environmental factors.