Overview
A 2019 data migration left 15 million user accounts with missing spending limits records — silently allowing unlimited transactions against BSP regulations. This project designed the remediation system.
Architecture
- Multi-threaded batch processor with cache invalidation strategy for zero-disruption database patching
- Concurrent Kubernetes processing: 3 pods + 3 cron jobs running off-peak within strict time windows to avoid production impact
- Cache invalidation designed to prevent stale limit data from being served during the patching window
Key Contributions
- Architected the end-to-end remediation approach with no service downtime
- Designed execution schedule to work within BSP-mandated time windows
- Coordinated with infrastructure team for Kubernetes pod scaling and monitoring
Tech
Java Spring Boot Kubernetes CronJobs Multi-threading Distributed Caching AWS Alibaba Cloud