Back

Engine Performance Monitoring

Engine Performance Monitoring

Monitoring your engine performance ensures your AI workflows, scheduled tasks, and automated processes run efficiently. GoSiteMe provides a comprehensive monitoring dashboard with real-time metrics, historical trends, and alerting capabilities.

Accessing the Monitoring Dashboard

  1. Go to Dashboard → Engines → Monitoring.
  2. The overview shows all active engines with status indicators (green/yellow/red).
  3. Click any engine to drill into its specific metrics.

Key Metrics

MetricDescriptionHealthy Range
Response TimeAverage time to process a request< 500ms for RAG, < 2s for PLAYBOOK
ThroughputRequests processed per minuteVaries by plan
Error RatePercentage of failed requests< 1%
Queue DepthNumber of pending requests< 50 (sustained)
Token UsageAI tokens consumed per engineWithin plan allocation
Memory UsageRAM consumed by the engine process< 80% of allocation

Real-Time Graphs

The monitoring dashboard displays live graphs for each metric with adjustable time ranges (last hour, 24 hours, 7 days, 30 days). Hover over any data point to see the exact value and timestamp. Use the comparison feature to overlay metrics from different engines or time periods.

Setting Up Alerts

  1. Navigate to Engines → Monitoring → Alerts.
  2. Click Create Alert.
  3. Select the engine, metric, and threshold (e.g., "Error Rate > 5% for 5 minutes").
  4. Choose notification channels: email, Slack, Discord, webhook, or SMS.
  5. Set the severity level: Info, Warning, or Critical.
  6. Click Save.

Log Analysis

Engine logs provide detailed request-level data:

  • Filter logs by engine, time range, status code, or keyword.
  • View individual request traces showing each processing step and its duration.
  • Export logs as CSV or JSON for external analysis tools.
  • Use the search bar to find specific error messages or request IDs.

Optimization Recommendations

The monitoring dashboard includes an Optimize tab that analyzes your engine usage patterns and suggests improvements:

  • Identify underutilized engines that can be downscaled.
  • Detect frequently repeated identical requests that benefit from caching.
  • Highlight slow workflow steps that could be parallelized.
  • Recommend plan upgrades when you are consistently hitting resource limits.

Using Alfred for Monitoring

Ask Alfred for quick insights without opening the dashboard:

  • "Alfred, what is the current error rate for the RAG engine?"
  • "Alfred, show me engine performance for the last 24 hours."
  • "Alfred, why did the PLAYBOOK engine slow down yesterday?"

Was this answer helpful?