Deskripsi Pekerjaan
Are you obsessed with system scalability, high availability, and automating the mundane? Join NexusCloud Systems, a leader in cloud-native infrastructure, to help us architect the next generation of our global platform. As a Senior SRE, you will bridge the gap between development and operations, ensuring our services are resilient, performant, and secure.
We are looking for a platform-minded engineer who thrives in complex, distributed systems environments. You will work alongside world-class DevOps and Software Engineering teams to influence the architectural direction of our products.
Tanggung Jawab
- Design, build, and maintain scalable, highly available cloud infrastructure on AWS/GCP.
- Drive capacity planning, performance analysis, and optimization of distributed systems.
- Implement Infrastructure as Code (IaC) using Terraform and Pulumi to automate deployment lifecycles.
- Proactively identify system bottlenecks and execute automated remediation strategies.
- Lead incident response efforts and conduct blameless post-mortems to improve system reliability.
- Mentor junior engineers and promote a culture of operational excellence and SRE best practices.
- Manage CI/CD pipelines to ensure seamless, reliable software delivery for global teams.
Kualifikasi
- Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience.
- 5+ years of experience in SRE, DevOps, or Software Engineering roles.
- Deep expertise in Kubernetes, Docker, and container orchestration at scale.
- Proficiency in programming with Go, Python, or Ruby for automation and tool development.
- Strong background in Linux system internals, networking, and security best practices.
- Proven experience with observability tools like Prometheus, Grafana, ELK stack, or Datadog.
- Excellent communication skills with the ability to bridge technical and business requirements.