Cette offre de poste n'est plus disponible

Site Reliability Engineer

Site Reliability Engineer (SRE) 🧑‍💻

We are currently looking for a talented and motivated Site Reliability Engineer (SRE) to join our dynamic team of five other SREs. If you are a curious problem-solver with experience in multi-region deployments, Linux, and Kubernetes, you'll fit right in!

Location: Rennes, Paris, or fully remote.

Your missions:

  • Build and maintain a scalable and highly available SOC platform over four regions.
  • Improve performance or availability issues using your expertise.
  • Automate the lifecycle of dozens of microservices on Kubernetes.
  • Automate backups and restorations for our databases.
  • Resolve production incidents in a 2-level on-call rotation.
  • Ensure we meet our SLA by providing observability and resilience at all levels.

Our technical stack:

  • Kubernetes: k3s, Traefik, Cilium, Ceph, ArgoCD, Helm, Rancher
  • Observability: Thanos, Prometheus, Grafana, Loki
  • Tools: Python, Ansible, SaltStack, Terraform
  • Databases: Elasticsearch (> 300 nodes), Kafka (> 3M rps), Clickhouse (> 10 TB), Redis, KeyDB, PostgreSQL, ArangoDB
  • CI/CD: GitHub Actions, Harbor
  • Cloud providers: OVH, Akamai, Azure, Scaleway

What we're looking for:

  • Strong experience in Linux systems administration and networking.
  • Experience with Kubernetes, Docker, and container orchestration.
  • Experience with cloud computing platforms (AWS, Azure, GCP).
  • Experience with monitoring and alerting systems (Prometheus, Grafana, etc.).
  • Experience with automation tools (Ansible, SaltStack, etc.).
  • Strong problem-solving and analytical skills.
  • Excellent communication and teamwork skills.

Benefits:

  • Competitive salary and benefits.
  • Opportunity to work with a talented team of engineers.
  • Chance to work on cutting-edge technology.
  • Flexible work environment.

To apply:
Please send your resume and cover letter to [email protected]

Référence :SSREEIEN

Skills

Ops
Kubernetes
Ansible
ArgoCD
Docker
Terraform
Traefik
Data
Grafana
ArangoDB
Elasticsearch
Kafka
PostgreSQL
Redis
Cloud
Azure
Prometheus
Cloud Computing
Helm
Tooling
Github
Inconnu
Harbor
Back-end
Python