AWS Architect/ SRE
Location: NYC, NY / Fort Mill, SC
Role Overview:
We are looking for a hands-on, technically strong Resilience, Testability & Scalability Lead to drive engineering excellence across our data platforms and cloud-based applications. This role is critical in ensuring system uptime, test automation maturity, performance under scale, and architectural resilience to meet stringent regulatory and service-level demands.
The ideal candidate will have a deep background in designing highly available systems, implementing robust disaster recovery, managing scalable cloud infrastructure, and building automated, testable, and observable platforms—especially within AWS and Kubernetes environments.
Key Responsibilities:
Required Skills & Experience:
Infrastructure Resilience & DR:
• Multi-AZ deployments, auto-scaling, load balancing, circuit breakers
• Disaster recovery design: backup/restore, cross-region replication, RTO/RPO
Monitoring & Observability:
• Experience with CloudWatch, Prometheus, log aggregators
• Set up alerting for incident response, latency, throughput, and error rates
Application Resilience & Security:
• Error handling, service degradation, exponential backoff
• Security best practices: IAM policies, encryption at rest/transit
• Familiarity with FINRA/SIPC compliance standards (preferred)
Test Automation & Quality:
• Unit testing (e.g., pytest), integration testing, E2E automation
• Test data generation, synthetic data, environment provisioning
• Performance testing using JMeter, Gatling, stress and capacity testing
• Code reviews, static analysis, data validation, anomaly detection
Scalability & Optimization:
• Horizontal scaling using Kubernetes, Docker, service discovery
• API Gateway, caching layers (Redis, Memcached), DB partitioning
• Connection pooling, capacity planning, cost-aware architecture
Data & Stream Processing:
• Spark cluster management, parallel processing, big data optimization
• Kafka-based messaging, windowing, and aggregation for real-time data
Preferred Qualifications:
• Experience in financial services or regulated environments
• Familiarity with enterprise data and platform modernization initiatives
• AWS or Kubernetes certifications
• Strong communication skills and cross-functional collaboration experience