Description:
You're the engineer who maintains uptime for 50+ SaaS products while others are still guessing. We need DevOps engineers capable of diving into unknown AWS environments, stabilizing disorder, and driving availability beyond 99.9% through genuine monitoring, automation, and root cause analysis. You'll break down complex projects into single-day tasks, deliver production-quality Python or JavaScript, and leverage AI as your assistant.
Most organizations talk about "cloud" while hand-holding servers. We're scaling reliability across dozens of acquired products where the original teams have departed and the documentation is incomplete. That's the challenge: you'll apply agents and current tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate them so the same outage can't recur. Rather than judging you on certifications and vendor names, we'll observe how you troubleshoot in real-time, produce an actual 5-Whys that identifies one preventable root cause, and create automations that endure production conditions.
This is not a tier-two "execute the runbook" position. In this capacity, you author the runbooks, architect the deployment from development to staging to 10% to full release with soak periods and rollback criteria, and implement the monitoring that detects corner cases. You block risky changes before anyone deploys. You distinguish infrastructure failures you own from application bugs Engineering owns, and you route permanent remediation to the correct team.
You'll operate at the engineering center of reliability, managing infrastructure initiatives, incident response and root cause analyses, and change requests with copy-paste-executable documentation. If you've already managed a substantial SaaS product and want to expand that expertise across a portfolio, join us. Bring expert-level AWS knowledge, production-grade development skills, strict scope discipline, and daily, essential use of AI tooling. If you're prepared to maintain operational excellence, please apply.
What You Will Be Doing
What You Won’t Be Doing
DevOps Engineer Key Responsibilities
Basic Requirements