hit counter
Beranda Loker Detail
N
Information Technology 🏢 Full Time ⭐️ Terverifikasi

Senior Site Reliability Engineer (SRE)

Nexus Cloud Systems
San Francisco
Estimasi Gaji
USD 175.000 – USD 225.000
Live Update
2 Juni 2026
Batas Akhir
2 Jun 2027

Deskripsi Pekerjaan

Are you obsessed with system uptime, performance at scale, and the art of automation? Nexus Cloud Systems is seeking a high-impact Senior Site Reliability Engineer to join our core infrastructure team. In this role, you will be the architect of our reliability strategy, bridging the gap between development and operations to build highly resilient, cloud-native environments that support millions of global users.

We foster a culture of blameless post-mortems, continuous learning, and cutting-edge engineering. If you are passionate about observability, infrastructure-as-code, and eliminating toil, we want to hear from you.

Tanggung Jawab

  • Architect and maintain highly scalable, distributed systems within AWS/GCP cloud environments.
  • Automate infrastructure provisioning and configuration management using Terraform and Ansible.
  • Define and implement Service Level Objectives (SLOs) and Error Budgets to maintain system health.
  • Lead incident response efforts and conduct blameless post-mortems to improve system resilience.
  • Optimize production monitoring, logging, and alerting systems for proactive issue detection.
  • Collaborate with software engineering teams to ensure service architecture follows reliability best practices.
  • Manage CI/CD pipelines to ensure rapid, safe, and automated deployment cycles.

Kualifikasi

  • 5+ years of experience in SRE, DevOps, or large-scale Systems Engineering.
  • Expert-level proficiency with container orchestration (Kubernetes) and cloud infrastructure (AWS preferred).
  • Strong programming skills in Go, Python, or Ruby for automation and tool development.
  • Deep understanding of observability stacks (Prometheus, Grafana, ELK, or Datadog).
  • Proven experience managing high-traffic distributed systems and microservices architectures.
  • Solid foundation in networking protocols (TCP/IP, DNS, TLS) and security best practices.
  • Exceptional problem-solving skills and ability to thrive in a fast-paced, collaborative environment.

Keahlian yang Dibutuhkan

Kubernetes AWS Terraform Go Python Observability CI/CD Distributed Systems SRE

Siap Mengambil Tantangan Ini?

Pastikan resume Anda sudah siap. Kirimkan lamaran Anda sekarang sebelum tanggal deadline.

Lamar Sekarang

Lowongan Terkait

Rekomendasi pekerjaan serupa untuk Anda

Lihat Semua