Chat on WhatsApp
Crossover logo

AWS Architect, Trilogy (Remote)

Crossover
3 hours ago
Full-time
Remote
Pakistan

Description:

You're the engineer who maintains uptime across 50+ SaaS products when nobody else knows where to start. We need DevOps professionals capable of stepping into unknown AWS environments, restoring order from disorder, and driving availability beyond 99.9% through genuine monitoring, automation, and root-cause analysis. You'll break down complex projects into single-day tasks, deliver production-ready Python or JavaScript, and leverage AI as your assistant.

Most organizations talk about "the cloud" while manually nursing servers. We're systematizing reliability across dozens of acquired products whose original creators have left and whose documentation is incomplete. That's the challenge: you'll harness agents and current tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate solutions so identical incidents never recur. Rather than judging you on certifications and vendor badges, we'll observe you troubleshoot in real time, compose an actual 5-Whys that identifies one preventable root cause, and create automations that hold up under production stress.

This is not an L2 "follow the script" position. In this role, you author the scripts, architect the deployment from dev to staged to 10% to full release with soak periods and rollback conditions, and construct the monitors that detect corner cases. You reject risky changes before anyone executes them. You distinguish infrastructure failures you control from application bugs Engineering controls, and you route permanent remediation to the correct team.

You'll operate at the engineering center of reliability, taking ownership of infrastructure initiatives, incident triage and RCAs, and change tickets with copy-paste-ready runbooks. If you've already managed a significant SaaS offering and want to expand that expertise across a portfolio, come forward. Bring expert-grade AWS knowledge, production-quality coding ability, strict scope discipline, and daily, mission-critical use of AI tooling. If you're prepared to maintain uptime, please apply.

What You Will Be Doing

  • Advanced infrastructure migrations, consolidations, production-quality automations, and monitoring updates
  • Investigating production incidents, deploying immediate remediation, and documenting root cause analyses with permanent fixes routed to the accountable teams
  • Authoring, reviewing, and performing production changes, including assessing whether a proposed change is safe to deploy

What You Won’t Be Doing

  • Spending your time in Jira and interminable status calls - we value individuals who can deliver solutions, not simply monitor issues
  • Supporting legacy systems forever - you'll be authorized to pursue substantial improvements
  • Waiting on bureaucratic approval pipelines - you'll hold the authority to implement immediate fixes during incidents

AWS Architect Key Responsibilities

  • Lead reliability and standardization of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices.

Basic Requirements

  • Deep AWS infrastructure expertise (this is our primary platform - other cloud experience alone won't cut it)
  • Experience managing production infrastructure at a scale of hundreds of containers
  • Experience scripting with Python and Bash for day-to-day administration operations
  • Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS-SQL)
  • Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)