1. Assessment

  • Review provided failed job inputs
  • Retrieve detailed AWS Backup job metadata
  • Inspect related backup resource context
  • Analyze failure messages and service dependencies
  • Determine root cause per failed job

2. Summary

  • Summarize root causes and fix options
1 Credits

Investigate Failed AWS Backup Jobs and Summarize Root Causes

Overview

Investigate failed AWS Backup backup, copy, and restore jobs using the failed job details supplied by a user or an upstream workflow step. This plan resolves partial job inputs where possible, retrieves detailed AWS Backup metadata, inspects related resources and vault context, classifies likely failure causes, and produces advisory remediation options without making configuration changes.

Execution Details

1. Assessment

  • Review provided failed job inputs
    Parses complete or partial failed-job records into a normalized investigation inventory, preserving lookup clues such as account, Region, job type, job ID, resource ARN, vault names, timestamps, and failure text.

  • Retrieve detailed AWS Backup job metadata
    Uses the normalized inventory to resolve exact backup, copy, or restore job records and collect status, timestamps, state messages, related identifiers, and unresolved or ambiguous matches.

  • Inspect related backup resource context
    Reviews protected resources, backup plans, vaults, recovery points, destination details, restore targets, and source or destination context tied to each failed job.

  • Analyze failure messages and service dependencies
    Classifies probable failure categories such as permissions, resource state, encryption key access, lifecycle conflicts, unsupported configuration, quota limits, destination availability, or dependency service issues.

  • Determine root cause per failed job
    Correlates job metadata, resource context, and dependency analysis into evidence-based root-cause findings with confidence and uncertainty notes.

2. Summary

  • Summarize root causes and fix options
    Produces a clear per-job or per-pattern summary that lists affected jobs, likely root causes, supporting evidence, confidence level, and recommended remediation options.