src.core.error_monitoring module#
Marcus Error Monitoring and Correlation System.
Provides comprehensive error tracking, pattern analysis, and correlation capabilities for autonomous agent environments.
- class src.core.error_monitoring.AlertSeverity[source]#
Bases:
EnumAlert severity levels.
- INFO = 'info'#
- WARNING = 'warning'#
- ERROR = 'error'#
- CRITICAL = 'critical'#
- class src.core.error_monitoring.ErrorMetrics[source]#
Bases:
objectError metrics for monitoring.
- __init__(total_errors=0, errors_by_type=<factory>, errors_by_severity=<factory>, errors_by_category=<factory>, errors_by_agent=<factory>, errors_by_operation=<factory>, retryable_errors=0, critical_errors=0, error_rate_per_minute=0.0, last_updated=<factory>)#
- Parameters:
- Return type:
None
- class src.core.error_monitoring.ErrorPattern[source]#
Bases:
objectDetected error pattern.
- severity: ErrorSeverity = 'medium'#
- __init__(pattern_id, pattern_type, description, frequency, first_seen, last_seen, affected_agents=<factory>, affected_operations=<factory>, severity=ErrorSeverity.MEDIUM, sample_errors=<factory>)#
- class src.core.error_monitoring.CorrelationGroup[source]#
Bases:
objectGroup of correlated errors.
- __init__(group_id, correlation_key, errors=<factory>, start_time=<factory>, end_time=None, pattern=None, root_cause=None)#
- class src.core.error_monitoring.ErrorMonitor[source]#
Bases:
objectComprehensive error monitoring system.
Tracks error patterns, provides real-time metrics, and enables proactive issue detection for autonomous agents.
- __init__(storage_path='logs/error_monitoring.json', metrics_window_minutes=60, pattern_detection_enabled=True, correlation_timeout_minutes=30)[source]#
- metrics_history: List[ErrorMetrics]#
- detected_patterns: Dict[str, ErrorPattern]#
- correlation_groups: Dict[str, CorrelationGroup]#
- alert_callbacks: List[Callable[[ErrorPattern], None]]#
- record_error(error)[source]#
Record an error for monitoring and analysis.
- Return type:
- Parameters:
error (MarcusBaseError)
- add_alert_callback(callback)[source]#
Add callback for pattern alerts.
- Return type:
- Parameters:
callback (Callable[[ErrorPattern], None])
- get_metrics_history(hours=24)[source]#
Get metrics history for specified hours.
- Return type:
- Parameters:
hours (int)
- get_detected_patterns(active_only=True)[source]#
Get detected error patterns.
- Return type:
- Parameters:
active_only (bool)
- get_correlation_groups(active_only=True)[source]#
Get error correlation groups.
- Return type:
- Parameters:
active_only (bool)
- src.core.error_monitoring.setup_error_monitoring(storage_path='logs/error_monitoring.json', enable_patterns=True, alert_callback=None)[source]#
Set up global error monitoring.
- Return type:
- Parameters:
storage_path (str)
enable_patterns (bool)
alert_callback (Callable[[ErrorPattern], None] | None)
- src.core.error_monitoring.record_error_for_monitoring(error)[source]#
Record error in global monitor.
- Return type:
- Parameters:
error (MarcusBaseError)