< More Jobs

Posted on 2025/03/30

Platform Infrastructure O&M Engineer

Pro5.ai

India

Full-time

Full Description

As a Platform Infrastructure O&M Engineer, you will be responsible for the operation

and maintenance of Univers PaaS/SaaS platform infrastructure, ensuring high

availability, reliability, and performance. You will work closely with development, operations, and security teams to optimize platform architecture, enhance system stability, and promote automation in operations.

Responsibilities:

• Manage the daily operation, monitoring, and optimization of the PaaS/SaaS

platform infrastructure to ensure high availability and stability.

• Design and implement automation tools to improve operational efficiency and

reduce manual intervention.

• Manage and optimize cloud computing resources (such as Azure, AWS, or

other cloud platforms) to ensure cost efficiency and resource utilization.

• Conduct system capacity planning, performance tuning, and troubleshooting

to enhance overall system efficiency.

• Participate in CI/CD process optimization to support DevOps teams in

continuous delivery and rapid deployment.

• Ensure platform security by collaborating with security teams to perform

vulnerability scanning, compliance checks, and security policy

implementation.

• Write and maintain operational documentation, troubleshooting guides, and

related technical materials.

Requirements:

• Bachelor’s degree or above in Computer Science, Information Technology,

Electronic Engineering, or related fields.

• 3+ years of experience in operations and infrastructure management,

preferably in PaaS/SaaS environments.

• Proficiency in Linux/Unix system administration, with scripting skills in Shell,

Python, or other automation languages.

• Hands-on experience with container technologies such as Kubernetes and

Docker.

• Familiarity with cloud computing architectures (Azure, AWS, GCP) and related

operational tools and best practices.

• Knowledge of database management (e.g., MySQL, PostgreSQL, MongoDB)

with optimization and troubleshooting capabilities.

• Experience in monitoring and log analysis tools such as Prometheus,

Grafana, ELK, Datadog.

• Understanding of DevOps culture and CI/CD tools (e.g., Jenkins, GitLab CI,

ArgoCD).

• Strong collaboration and communication skills, with the ability to work

efficiently with development, operations, and security teams.

• Excellent troubleshooting and problem-solving skills, with the ability to

respond quickly in high-pressure situations.

Preferred Qualifications:

• Azure certifications (e.g., Azure Administrator, Azure DevOps Engineer) are a

plus.

• Experience with large-scale distributed system operations.

• Knowledge of networking concepts including VPN, DNS, CDN, and load

balancing.

Zero to AI Engineer Program

Zero to AI Engineer

Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.