Jobs have SLA-bound jobs that should be completed within the SLA. If the SLA is missed, what action should be taken?
💡 Model Answer
When a job is bound by an SLA, the primary goal is to ensure it completes within the agreed time. First, implement monitoring that tracks job start, progress, and completion timestamps against the SLA threshold. If a job is approaching its SLA deadline, trigger an escalation: send alerts to the operations team, log the event, and optionally pause or throttle other lower-priority jobs to free resources. If the SLA is missed, the response depends on the business impact: 1) Notify stakeholders via automated communication (email, Slack). 2) Initiate a rollback or compensation process if the job produced critical data. 3) Analyze root cause: resource contention, code inefficiencies, or external dependencies. 4) Update the job’s scheduling parameters or add redundancy (e.g., retry logic, parallel execution). 5) Document the incident and refine SLAs or capacity planning. The key is a proactive monitoring pipeline, clear escalation paths, and post-mortem analysis to prevent recurrence.
This answer was generated by AI for study purposes. Use it as a starting point — personalize it with your own experience.
🎤 Get questions like this answered in real-time
Assisting AI listens to your interview, captures questions live, and gives you instant AI-powered answers — invisible to screen sharing.
Get Assisting AI — Starts at ₹500