Posted on 2026/02/07
DevOps Engineer with Deep AWS Experience
AllTrue.ai
Vancouver, BC
Full Description
We are a dynamic, well-funded and innovative startup in the security industry, dedicated to making AI secure.
Our cutting-edge product, backed by substantial funding, is set to make a significant impact in the market.
We are aggressively pursuing our goals and are looking for a highly skilled and motivated individual to join our team as a Site Reliability Engineer.
Key Responsibilities:
• Design,build, and maintain scalable AWS infrastructure with a focus on high availability and fault tolerance.
• Design and configure ECS scaling strategies.
• Optimize, monitor, and automate Amazon RDS (PostgreSQL) performance, backups, and failover strategies.
• Implement disaster recovery plans, backup solutions, and system restoration procedures.
• Develop and maintain infrastructure-as-code (IaC) using Terraform or CloudFormation.
• Create monitoring and alerting systems using CloudWatch, Prometheus, Grafana, or Datadog.
• Enhance CI/CD pipelines to improve deployment automation and system resilience.
• Perform incident management, troubleshoot production issues, and conduct post-mortems.
• Collaborate with engineering teams to ensure best practices in application reliability and performance.
• Stay up-to-date with AWS services and industry best practices to drive continuous improvement.
Qalifications:
• 3+ years of experience in SRE, DevOps, or Cloud Engineering roles.
• Previous experience in a high-scale, production environment.
• Strong expertise in AWS services, particularly EC2, ECS, RDS, S3, IAM, and VPC.
• Knowledge of event-driven architectures using AWS Lambda and SNS/SQS.
• Hands-on experience managing databases in production environments.
• Proficiency in Terraform, CloudFormation, or CDK for infrastructure automation.
• Experience with containerization (Docker, ECS, Kubernetes).
• Solid understanding of Linux systems, networking, and security best practices.
• Proficiency in scripting (Python or Bash) for automation.
• Strong troubleshooting and incident response skills.
• Experience with monitoring and logging tools like CloudWatch, Prometheus, Grafana, or Datadog.
• Experience working for a startup.
What We Offer:
• An exciting and challenging work environment where you can make a real impact.
• Competitive compensation and benefits package.
• Opportunity to make a huge impact on the industry and have proportionately great upside.
• The chance to work with a passionate and talented team on a groundbreaking product.
If you are a highly technical and hands-on professional with a passion for building secure and scalable SaaS solutions, we want to hear from you.
Join us and be a part of our journey to transform the AI journey.
Job Type: Full-time
Pay: $80,000.00-$120,000.00 per year
Benefits:
• Dental care
• Extended health care
• Paid time off
Work Location: In person

Zero to AI Engineer
Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.
Find AI, ML, Data Science Jobs By Location
Find Jobs By Position