Platform & MLOps · Now hiring
Senior AWS Engineer
Remote (US / UK) · Occasional client travelFull-time6+ years
Architect and operate the AWS foundations our clients run their ML and AI workloads on — from VPC design and IAM boundaries through SageMaker, EKS, and event-driven data pipelines. You will set the standard for how regulated institutions deploy AI on AWS.
What you'll do
- Design multi-account AWS landing zones (Control Tower, Organizations, SCPs) for clients in banking, insurance, and healthcare.
- Build production-grade ML platforms on AWS — SageMaker, Bedrock, EKS, Step Functions, EventBridge — with full CI/CD and infrastructure-as-code.
- Own networking, IAM, and KMS architecture: VPC peering, Transit Gateway, PrivateLink, fine-grained IAM, customer-managed keys, and cross-account access.
- Codify everything in Terraform or AWS CDK; review pull requests against security and cost guardrails.
- Establish observability across the stack — CloudWatch, OpenTelemetry, Prometheus/Grafana — with alerting tied to SLOs.
- Partner with our data science and MRM consultants to harden notebooks, training pipelines, and inference endpoints for production and audit.
- Coach client engineering teams; leave them stronger than you found them.
What we're looking for
- 6+ years of hands-on AWS engineering experience, including 3+ years designing production systems end-to-end.
- Deep fluency with core AWS services: VPC, IAM, KMS, S3, EC2, EKS, Lambda, RDS/Aurora, SageMaker, Step Functions, EventBridge.
- Strong infrastructure-as-code skills with Terraform (preferred) or AWS CDK; comfortable building reusable modules and enforcing policy-as-code.
- Proficient in Python and at least one of TypeScript or Go; comfortable in Bash and the AWS CLI.
- Solid grasp of container and Kubernetes operations on EKS — Helm, ArgoCD/Flux, Karpenter, network policies.
- Experience operating CI/CD pipelines (GitHub Actions, GitLab, or CodePipeline) with automated testing, security scanning, and progressive delivery.
- Working knowledge of compliance frameworks that touch AWS environments — SOC 2, HIPAA, PCI-DSS, FedRAMP, or SR 26-2 model risk controls.
- Excellent written communication; you can explain a trade-off to an engineer, a CISO, and a regulator on the same day.
Nice to have
- AWS Certified Solutions Architect — Professional, DevOps Engineer — Professional, or Security — Specialty.
- Experience deploying LLM and RAG systems on Bedrock, SageMaker JumpStart, or self-hosted on EKS with GPU/Inferentia.
- Background supporting model validation, audit, or regulatory examinations.
- Open-source contributions to Terraform providers, Kubernetes operators, or AWS tooling.
Why Forli
- Senior-only practice — no layers between you and the work.
- Engagements that ship: our deliverables run in production at the institutions that hire us.
- Real ownership of architecture decisions and the discretion to push back when the right answer is harder.
- Competitive base, performance bonus, certification budget, and conference stipend.
Sound like you?
Send a short note and your resume — no cover letter gymnastics required.