Senior SRE (English)
An exciting opportunity awaits for a Senior Site Reliability Engineer to join a forward-thinking technology organisation in Taipei. This role is perfect for someone who thrives in building and maintaining robust, scalable infrastructure across multiple cloud platforms, with a particular focus on AWS and Terraform.
As a Senior Site Reliability Engineer based in Taipei, you will play a pivotal role in shaping the technical foundation of the organisation’s most critical systems. Your day-to-day activities will involve architecting resilient infrastructure across multiple clouds—primarily AWS—using Terraform to automate deployments and streamline operations. You will work hand-in-hand with development teams to embed reliability into every stage of the software lifecycle while proactively monitoring performance metrics to anticipate challenges before they arise. By developing comprehensive incident response plans and participating in on-call duties, you will help safeguard uptime for millions of users. Your expertise will also be instrumental in guiding architectural choices for new projects from their earliest stages (0-1), ensuring that every solution is built with scalability and sustainability in mind. Through thorough documentation and active participation in team discussions, you will foster an environment of shared learning and continuous improvement.
- Design, build, and maintain scalable infrastructure solutions across AWS and other cloud providers to support business-critical applications.
- Implement Infrastructure as Code practices using Terraform to automate deployment processes and improve system reliability.
- Collaborate closely with software engineering teams to ensure seamless integration of new features into production environments while maintaining system stability.
- Monitor system performance proactively, identifying potential issues before they impact users and implementing effective solutions.
- Develop robust incident response procedures and participate in on-call rotations to ensure rapid resolution of operational issues.
- Drive improvements in system observability by enhancing monitoring, logging, and alerting capabilities across all services.
- Champion best practices for security, compliance, and cost optimisation within cloud environments.
- Contribute to architectural decisions for new projects, providing guidance on scalability, reliability, and maintainability from inception (0-1) through production.
- Document processes thoroughly to facilitate knowledge transfer within the team and support ongoing training initiatives.
About the job
Contract Type: Perm
Specialism: IT & Digital Transformation
Focus: Infrastructure, Network & System
Industry: IT
Salary: Negotiable
Workplace Type: Hybrid
Experience Level: Mid Management
Location: Taipei
FULL_TIMEJob Reference: EC7CH9-FC0FF932
Date posted: 25 December 2025
Consultant: Amy Lin
taipei tech-transformation/infrastructure 2025-12-25 2026-02-23 it Taipei TW Robert Walters https://www.robertwalters.com.tw https://www.robertwalters.com.tw/content/dam/robert-walters/global/images/logos/web-logos/square-logo.png true