< More Jobs

Posted on 2026/01/17

AI Devops Engineer - In person interview(Only GC and Citizen, Gc-EAD)

Canvendor

Westlake Village, CA, United States

Contractor

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)
  • 5+ years of experience in a DevOps or Site Reliability Engineering (SRE) role
  • 1+ year(s) of experience with AI services & LLMs
  • Extensive hands-on experience with Amazon Web Services (AWS)
  • Solid understanding of Windows/Linux Server administration and integration with cloud environments
  • Proven experience with infrastructure-as-code tools, specifically AWS CDK and Terraform
  • Strong experience designing and implementing CI/CD pipelines using GitLab CI/CD
  • Experience deploying and managing .NET applications in cloud environments
  • Deep understanding of security best practices and their implementation in cloud infrastructure and CI/CD pipelines
  • Solid understanding of networking principles (TCP/IP, DNS, load balancing, firewalls) in cloud environments
  • Experience with monitoring and logging tools (e.g., NewRelic, CloudWatch)
  • Strong scripting skills (e.g., PowerShell, Python, Ruby, Bash)
  • Excellent problem-solving and troubleshooting skills
  • Strong communication and collaboration skills
  • Experience with the configuration management tool Chef
  • Extensive hands-on experience with Amazon Web Services (AWS)

Responsibilities

  • Design, implement, and manage scalable and resilient infrastructure on AWS
  • Architect and maintain Windows/Linux based environments, ensuring seamless integration with cloud platforms
  • Develop and maintain infrastructure-as-code(IaC) using both AWS Cloudformation/CDK and Terraform/OpenTofu
  • Develop and maintain Configuration Management for Windows & Linux servers using Chef
  • Design, build, and optimize CI/CD pipelines using GitLab CI/CD for .

NET applications

  • Integrate and support AI services, including orchestration with AWS Bedrock, Google Agentspace, and other generative AI frameworks, ensuring they can be securely and efficiently consumed by platform services
  • Enable AI/ML workflows by building and optimizing infrastructure pipelines that support large-scale model training, inference, and deployment across AWS and GCP environments
  • Automate model lifecycle management (training, deployment, monitoring) through CI/CD pipelines, ensuring reproducibility and seamless integration with development workflows
  • Collaborate with AI engineering teams to deliver scalable environments, standardized APIs, and infrastructure that accelerate AI adoption at the platform level
  • Implement observability, security, data privacy and cost-optimization strategies specifically for AI workloads, including monitoring and resource scaling for inference services
  • Implement and enforce security best practices across the infrastructure and deployment processes
  • Collaborate closely with development teams to understand their needs and provide DevOps expertise
  • Troubleshoot and resolve infrastructure and application deployment issues
  • Implement and manage monitoring and logging solutions to ensure system visibility and proactive issue detection
  • Clearly and concisely contribute to the development and documentation of DevOps standards and best practices
  • Stay up-to-date with the latest industry trends and technologies in cloud computing, DevOps, and security
  • Provide mentorship and guidance to junior team members

Full Description

We do have an AI Devops Engineer role in Westlake Village CA(Onsite).

Please find the Job Description below and kindly respond back with your updated resume.

Job Title : AI Devops Engineer

Job Location : Westlake Village CA(Onsite)

Duration : 12+ Months

Job Description:

• Design, implement, and manage scalable and resilient infrastructure on AWS.

• Architect and maintain Windows/Linux based environments, ensuring seamless integration with cloud platforms.

• Develop and maintain infrastructure-as-code(IaC) using both AWS Cloudformation/CDK and Terraform/OpenTofu.

• Develop and maintain Configuration Management for Windows & Linux servers using Chef.

• Design, build, and optimize CI/CD pipelines using GitLab CI/CD for .NET applications.

• Integrate and support AI services, including orchestration with AWS Bedrock, Google Agentspace, and other generative AI frameworks, ensuring they can be securely and efficiently consumed by platform services.

• Enable AI/ML workflows by building and optimizing infrastructure pipelines that support large-scale model training, inference, and deployment across AWS and GCP environments.

• Automate model lifecycle management (training, deployment, monitoring) through CI/CD pipelines, ensuring reproducibility and seamless integration with development workflows.

• Collaborate with AI engineering teams to deliver scalable environments, standardized APIs, and infrastructure that accelerate AI adoption at the platform level.

• Implement observability, security, data privacy and cost-optimization strategies specifically for AI workloads, including monitoring and resource scaling for inference services.

• Implement and enforce security best practices across the infrastructure and deployment processes.

• Collaborate closely with development teams to understand their needs and provide DevOps expertise.

• Troubleshoot and resolve infrastructure and application deployment issues.

• Implement and manage monitoring and logging solutions to ensure system visibility and proactive issue detection.

• Clearly and concisely contribute to the development and documentation of DevOps standards and best practices.

• Stay up-to-date with the latest industry trends and technologies in cloud computing, DevOps, and security.

• Provide mentorship and guidance to junior team members.

Qualifications:

• Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).

• 5+ years of experience in a DevOps or Site Reliability Engineering (SRE) role.

• 1+ year(s) of experience with AI services & LLMs.

• Extensive hands-on experience with Amazon Web Services (AWS)

• Solid understanding of Windows/Linux Server administration and integration with cloud environments.

• Proven experience with infrastructure-as-code tools, specifically AWS CDK and Terraform.

• Strong experience designing and implementing CI/CD pipelines using GitLab CI/CD.

• Experience deploying and managing .NET applications in cloud environments.

• Deep understanding of security best practices and their implementation in cloud infrastructure and CI/CD pipelines.

• Solid understanding of networking principles (TCP/IP, DNS, load balancing, firewalls) in cloud environments.

• Experience with monitoring and logging tools (e.g., NewRelic, CloudWatch).

• Strong scripting skills (e.g., PowerShell, Python, Ruby, Bash).

• Excellent problem-solving and troubleshooting skills.

• Strong communication and collaboration skills.

• Experience with containerization technologies (e.g., Docker, Kubernetes) is a plus.

• Relevant AWS and/or GCP certifications are a plus.

• Experience with the configuration management tool Chef

Preferred Qualifications:

• Knowledge of and a strong understanding of PowerShell and Python Scripting

• Strong background with AWS EC2 features and Services (Autoscaling and WarmPools)

• Understanding of Windows server Build process using tools like Chocolaty for packages and Packer for AMI/Image generation.

Extensive hands-on experience with Amazon Web Services (AWS)

Zero to AI Engineer Program

Zero to AI Engineer

Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.