Posted on 2025/03/30
Platform Infrastructure O&M Engineer
Pro5.ai
India
Full Description
As a Platform Infrastructure O&M Engineer, you will be responsible for the operation
and maintenance of Univers PaaS/SaaS platform infrastructure, ensuring high
availability, reliability, and performance. You will work closely with development, operations, and security teams to optimize platform architecture, enhance system stability, and promote automation in operations.
Responsibilities:
• Manage the daily operation, monitoring, and optimization of the PaaS/SaaS
platform infrastructure to ensure high availability and stability.
• Design and implement automation tools to improve operational efficiency and
reduce manual intervention.
• Manage and optimize cloud computing resources (such as Azure, AWS, or
other cloud platforms) to ensure cost efficiency and resource utilization.
• Conduct system capacity planning, performance tuning, and troubleshooting
to enhance overall system efficiency.
• Participate in CI/CD process optimization to support DevOps teams in
continuous delivery and rapid deployment.
• Ensure platform security by collaborating with security teams to perform
vulnerability scanning, compliance checks, and security policy
implementation.
• Write and maintain operational documentation, troubleshooting guides, and
related technical materials.
Requirements:
• Bachelor’s degree or above in Computer Science, Information Technology,
Electronic Engineering, or related fields.
• 3+ years of experience in operations and infrastructure management,
preferably in PaaS/SaaS environments.
• Proficiency in Linux/Unix system administration, with scripting skills in Shell,
Python, or other automation languages.
• Hands-on experience with container technologies such as Kubernetes and
Docker.
• Familiarity with cloud computing architectures (Azure, AWS, GCP) and related
operational tools and best practices.
• Knowledge of database management (e.g., MySQL, PostgreSQL, MongoDB)
with optimization and troubleshooting capabilities.
• Experience in monitoring and log analysis tools such as Prometheus,
Grafana, ELK, Datadog.
• Understanding of DevOps culture and CI/CD tools (e.g., Jenkins, GitLab CI,
ArgoCD).
• Strong collaboration and communication skills, with the ability to work
efficiently with development, operations, and security teams.
• Excellent troubleshooting and problem-solving skills, with the ability to
respond quickly in high-pressure situations.
Preferred Qualifications:
• Azure certifications (e.g., Azure Administrator, Azure DevOps Engineer) are a
plus.
• Experience with large-scale distributed system operations.
• Knowledge of networking concepts including VPN, DNS, CDN, and load
balancing.

Zero to AI Engineer
Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.
Find AI, ML, Data Science Jobs By Location
Find Jobs By Position