Week 1
Foundations — SaaS, Products, Jira, Monitoring
Day 1
SaaS vs On-Premises
Infrastructure models, Linux foundation, boot process, partition vs file storage
Outcome: Understand infrastructure types
Day 2
Product Portfolio
Client-product mapping, modules, MPG, OpenConnect, service overview
Outcome: Know the product stack
Day 3
Jira & SLA
Ticket lifecycle, P1–P4 severity, SLA times, escalation path L1→L2→L3
Outcome: Work tickets correctly
Day 4
Monitoring Basics
Infra vs app monitoring, alert types, OK/Warning/Critical classification
Outcome: Understand alerting
Day 5
Transaction Flow
API→MQ→DB lifecycle, tracing a transaction, failure investigation points
Outcome: Trace a payment end to end
Reference
SaaS Client & Service Matrix
Multinet and Jazz clients mapped to all active services — RAAST, CMS, IBFT, ACS
Reference: Client service mapping
Week 2
Linux, SQL, DB Tables, S1 Handling
Day 1
Linux Basics
grep, tail -f, vi, top, df -h, free -h — core L2 commands with lab
Outcome: Navigate a Linux server
Day 2
Log Investigation
Log anatomy, 4 error patterns, hands-on investigation with grep on Kali
Outcome: Read and investigate logs
Day 3
SQL Basics
SELECT, JOIN, COUNT, GROUP BY — building queries on the fintech DB
Outcome: Query the database
Day 4
Database Tables
MPG and OpenConnect schemas — 9 tables, their purpose, and how to query them
Outcome: Know the DB schema
Day 5
Severity 1 Handling
S1 protocol, bridge rules, outage timeline, communication templates, drill
Outcome: Handle a P1 incident
Week 3
Bash Scripting & Automation
Day 1 & 2
Bash Scripting Basics
Variables, loops, if/else, echo, read — first scripts with full lab
Outcome: Write bash scripts
Day 3 & 4
Log Parsing & Cron
grep flags, awk, pipes, crontab — automated log monitoring with scheduling
Outcome: Automate log checks
Day 5
Health Check Script
6 scripts — disk, memory, CPU, log error checks combined in one runbook
Outcome: Monitoring automation
Day 7 & 8
Backup Script & Mini Project
tar, gzip, restore, service monitor, L2 daily runbook — full automation
Outcome: Backup handling & L2 readiness
Week 4
Payment Products — RAAST, CMS, ACS, IBFT, POS
Day 1 & 2
OpenConnect RAAST & CMS
PACS messages, callback handling, card authorization lifecycle, decline codes
Outcome: Understand RAAST & card lifecycle
Day 3 & 4
Open ACS & IBFT/BillPay
3D Secure flow, OTP, IBFT lifecycle, 1LINK routing, UBPS bill payments
Outcome: Identify ACS & transfer issues
Day 5
Acquiring & POS
POS to switch flow, 4 players, authorization vs settlement, decline codes
Outcome: Explain acquiring flow
Week 5
Automation, Alerts & AI
Day 1 & 2
Bash Basics & Log Automation
Variables, loops, cron, grep monitoring — 7 scripts for automated log checks
Outcome: Automate log detection
Day 3 & 4
Slack & WhatsApp Alerts
Webhook integration, Twilio API, full setup guide — test alerts from terminal
Outcome: Configure real-time alerts
Day 5
AI Log Analysis
Log summarization, pattern detection, 3 ready prompts, RCA generation
Outcome: Generate quick RCA with AI
Week 6
Incident Drills & RCA
Day 1 & 2
DB Failure & MQ Stuck
Database outage protocol, queue monitoring, safe restart order — full lab
Outcome: Handle DB outage & queue issues
Day 3 & 4
High CPU & Timeout RCA
Process analysis with top, latency chain, DB query analysis, timeout types
Outcome: Identify heavy process & delays
Day 5
Callback Issue RCA
Retry logic, status mismatch, dead letter queue, full investigation lab
Outcome: Resolve callback mismatch
Week 6.5
Monitoring & Observability
Day 1
Monitoring Basics
Metrics vs logs vs traces, 3 pillars of observability, alert levels, dashboard reading
Outcome: Understand observability
Day 2
System Metrics
CPU, Memory, Disk, Network — thresholds, commands, resource spike detection lab
Outcome: Resource troubleshooting
Day 3
Application Monitoring
API latency, error rate, throughput, health check endpoints, find slow API lab
Outcome: App health checks
Day 5
Alerts & Alertmanager
Alert rules, severity, routing, grouping, silencing, and Alertmanager flow
Outcome: Configure alerts and routing
Day 6
Logs Monitoring
Central logs, search patterns, grep, and hands-on RCA from logs
Outcome: Find root cause using logs
Day 7
Incident Correlation
Correlating alerts across systems, deduplication, and root-cause linking
Outcome: Correlate incidents for faster RCA
Day 8
Monitoring Project
End-to-end monitoring project: dashboards, alerts, runbooks, and handoff
Outcome: Deliver a monitoring implementation