About Smart Working
At Smart Working Solutions, we believe your job should not only look right on paper but also feel right every day. This isn’t just another remote opportunity - it’s about finding where you truly belong, no matter where you are. From day one, you’re welcomed into a genuine community that values your growth and well-being.
Our mission is simple: to break down geographic barriers and connect skilled professionals with outstanding global teams and products for full-time, long-term roles. We help you discover meaningful work with teams that invest in your success, where you’re empowered to grow personally and professionally.
Join one of the highest-rated workplaces on Glassdoor and experience what it means to thrive in a truly remote-first world.
About the Role
We are looking for a Senior DevOps Engineer with strong cloud infrastructure expertise (GCP / AWS / Azure) using Terraform and deep MongoDB Atlas ownership experience to design, operate, and scale a cloud-native infrastructure powering a large enterprise SaaS platform.
This is a high-ownership, architecture-level role, not just execution. You will be responsible for designing and running production systems end-to-end, with a particular focus on database infrastructure (MongoDB Atlas) and scalable cloud environments.
You will work in a fully remote, async-first environment, collaborating closely with engineering teams to ensure high availability, performance, and operational excellence across multiple environments.
Responsibilities
- Design, implement, and operate cloud-native infrastructure across GCP, AWS, or Azure using Terraform.
- Take full ownership of MongoDB Atlas in production, including:
- Cluster architecture and scaling
- Replication and high availability
- Backup and disaster recovery strategies
- Performance tuning and query optimisation
- Security and access control
- Architect and manage containerised and serverless workloads (e.g., Cloud Run, ECS, Kubernetes, or equivalents).
- Design and operate event-driven systems (e.g., Pub/Sub, SQS/SNS, EventBridge, or equivalents).
- Build and maintain CI/CD pipelines with a strong focus on automation, reliability, and scalability.
- Develop reusable Infrastructure as Code (Terraform) modules and manage multi-environment setups.
- Collaborate with engineering teams on system architecture, scalability, and performance optimisation.
- Implement robust monitoring, alerting, and observability across distributed systems.
- Lead incident response and root cause analysis, driving long-term improvements.
- Own infrastructure decisions end-to-end, including architecture, cost optimisation, and performance.
- Document systems, create runbooks, and establish best practices.
- Mentor engineers and promote DevOps best practices across the organisation.
Requirements
- 6+ years of DevOps / Infrastructure Engineering experience in production environments.
- Strong hands-on experience with at least one major cloud provider: GCP, AWS, or Azure using Terraform.
- Advanced experience with Terraform (modularisation, remote state, multi-environment setups).
- Proven experience designing and operating scalable cloud infrastructure.
- Mandatory: Deep MongoDB Atlas experience in production, including:
- Cluster configuration and scaling
- Replication and failover
- Backup and recovery strategies
- Performance tuning and indexing
- Security and access management
- Experience with containerised environments (Docker, Kubernetes, or equivalents).
- Experience building and maintaining CI/CD pipelines.
- Solid understanding of event-driven architectures.
- Strong knowledge of monitoring, logging, and observability in distributed systems.
- Ability to operate at an architect/owner level, not just execute tasks.
- Strong communication skills and ability to work in a remote, async-first team.
Nice to Have
- Experience working across multiple cloud providers.
- Experience implementing GitOps practices.
- Familiarity with advanced observability tools (Datadog, APM, tracing).
- Experience supporting high-scale SaaS platforms.
- Interest in platform engineering and developer experience.
Be a Smart Worker - valued, empowered, and part of a culture that celebrates integrity, excellence, and ambition.