Chat on WhatsApp
Crossover logo

DevOps Engineer, Trilogy (Remote)

Crossover
1 hour ago
Full-time
Remote
Pakistan
$100,000 USD yearly

Description:

You're the engineer who maintains uptime for 50+ SaaS products while others are still guessing. We need DevOps engineers capable of diving into unknown AWS environments, stabilizing disorder, and driving availability beyond 99.9% through genuine monitoring, automation, and root cause analysis. You'll break down complex projects into single-day tasks, deliver production-quality Python or JavaScript, and leverage AI as your assistant.

Most organizations talk about "cloud" while hand-holding servers. We're scaling reliability across dozens of acquired products where the original teams have departed and the documentation is incomplete. That's the challenge: you'll apply agents and current tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate them so the same outage can't recur. Rather than judging you on certifications and vendor names, we'll observe how you troubleshoot in real-time, produce an actual 5-Whys that identifies one preventable root cause, and create automations that endure production conditions.

This is not a tier-two "execute the runbook" position. In this capacity, you author the runbooks, architect the deployment from development to staging to 10% to full release with soak periods and rollback criteria, and implement the monitoring that detects corner cases. You block risky changes before anyone deploys. You distinguish infrastructure failures you own from application bugs Engineering owns, and you route permanent remediation to the correct team.

You'll operate at the engineering center of reliability, managing infrastructure initiatives, incident response and root cause analyses, and change requests with copy-paste-executable documentation. If you've already managed a substantial SaaS product and want to expand that expertise across a portfolio, join us. Bring expert-level AWS knowledge, production-grade development skills, strict scope discipline, and daily, essential use of AI tooling. If you're prepared to maintain operational excellence, please apply.

What You Will Be Doing

  • Sophisticated infrastructure migrations, consolidations, production-quality automations, and monitoring modifications
  • Diagnosing production failures, deploying immediate remediations, and authoring root cause analyses with permanent solutions assigned to the accountable teams
  • Authoring, reviewing, and deploying changes in production environments, including assessing whether a proposed change is safe for execution

What You Won’t Be Doing

  • Spending your time in Jira and perpetual status updates - we value individuals who can execute solutions, not merely document problems
  • Sustaining legacy systems forever - you'll be authorized to pursue substantive improvements
  • Waiting on bureaucratic approval processes - you'll possess the authority to deploy immediate remediations to resolve incidents

DevOps Engineer Key Responsibilities

  • Lead reliability and standardization of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices.

Basic Requirements

  • Deep AWS infrastructure expertise (this is our primary platform - other cloud experience alone won't cut it)
  • Experience managing production infrastructure at a scale of hundreds of containers
  • Experience scripting with Python and Bash for day-to-day administration operations
  • Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS-SQL)
  • Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)