Cette offre de poste n'est plus disponible

SRE Platform Engineer

> 5 années d'expérience
CDI
Site Reliability Engineer (SRE)
55k€ ➞ 70k€/an
Kubernetes
Go
ArgoCD

About Agicap

Created in Lyon in 2016 by three French entrepreneurs, Agicap is one of the fastest-growing scale-ups in Europe, with over 8,000 clients, 600 employees, and rapid revenue growth (multiplied by 7 between 2021 and 2024). Agicap is part of the French Tech 120, recognizing France's most promising startups.

Agicap empowers the financial departments of medium-sized businesses to effectively manage and forecast their cash flow, pay their suppliers, and get paid. By combining treasury management with accounts payable and receivable solutions on a single platform, Agicap aims to become the new benchmark for treasury management for mid-sized companies.

We have raised €145 million since our inception from prestigious venture capital funds such as AVP, Greenoaks, Partech, and BlackFin, with our latest round in November 2024.

These additional resources further fuel our product innovation, enabling us to grow significantly and accelerate our international expansion worldwide.

We are convinced that our success lies in our passionate and dedicated teams, committed to building a company where everyone can evolve and build their career. We are constantly seeking new talent, aiming for excellence and ready to join our ambitious adventure!

Home page
Customer Stories
Our values


At Agicap, we have a SRE Platform team that builds and operates the low-level infrastructure layers (Kubernetes clusters, networking, etc.) on which our products run.
The infrastructure is already well-established, stable, and at scale. The focus now is on consolidation / maintainability / optimization of the existing, with tooling decisions driven by concrete needs (and not by the "tool of the moment" effect).
We are therefore looking for someone who enjoys deeply understanding what exists, making it reliable, optimizing it, and evolving it properly.


🚀 Missions:

As a SRE Platform Engineer, you will join the Platform team to:

  • Ensure the reliability, maintainability, and performance of the low-level infrastructure layers (Kubernetes, networking, IAM, Kubernetes operators).
  • Pragmatically improve the existing (architecture, costs, tooling, observability).
  • Support tech/product teams in healthy and autonomous infrastructure usage.

🛠️ Responsibilities:

Reliability & Run

  • Operate and evolve our Kubernetes clusters (200+ nodes / cluster).
  • Improve the platform's resilience, capacity, and performance.
  • Participate in on-call duties and manage incidents with a post-mortem / continuous improvement approach.

Platform Engineering

  • Maintain and develop our Kubernetes operators (Go) that provision a significant part of the infrastructure.
  • Automate platform workflows (provisioning, audit, compliance, lifecycle).
  • Contribute to distributed architecture patterns and platform networking.

Observability / Costs

  • Evolve our metrics / logs / traces stack with a cost-performance-simplicity logic.
  • Example of recent decisions: migration from Prometheus/Thanos to VictoriaMetrics to reduce complexity and run cost.

Communication / Enablement

  • Clearly explain the "how / why" to internal teams.
  • Produce useful documentation, share best practices, and provide training if needed.
  • The goal: not just "to fix," but to make teams autonomous.

💻 Stack & Technical Environment:

  • Kubernetes (GKE), large scale
  • Internal Kubernetes operators in Go
  • ArgoCD, GitOps
  • Istio, HAProxy, advanced networking
  • Prometheus / VictoriaMetrics, Loki
  • eBPF (low-level diagnostics / observability)
  • Automations / audits with MCP
  • GCP cloud ecosystem

🎯 Profile Sought:

Indispensable

  • Concrete experience with Kubernetes in production (not just local development).
  • Good foundations in networking and distributed systems.
  • Go applied at least to infra / operators / tooling (no need to be a language expert).
  • Comfortable with digging into existing systems, understanding complex systems, and optimizing them without breaking them.
  • Reliability / monitoring / post-mortem culture.

Highly Valued

  • GitOps / ArgoCD experience.
  • Istio / service mesh knowledge.
  • Practice with Prometheus / VictoriaMetrics / Loki.
  • FinOps sensibility (run costs, sizing, architectural choices).
  • eBPF or low-level debugging experience.

Soft Skills

  • Pragmatic and impact-oriented: tooling is a means, not an end.
  • Curious, enjoys understanding details when necessary.
  • Good communicator: explains, shares, documents.
  • Capable of managing incidents calmly and deriving improvements from them.

OnCall

  • Participation in SRE on-call rotation.
  • The goal is for it to be sustainable and well-equipped (solid alerting, runbooks, useful post-mortems).

✨ Why Join Us:

  • A large-scale Kubernetes platform with real platform challenges.
  • Many "internal bricks" to understand and grow (operators, automations, SRE patterns).
  • Technical decisions made for good reasons (simplicity / cost / reliability).
  • A mature SRE team that values quality, impact, and sharing.

🔍 Recruitment Process:

  1. First interview with Théo, our Talent Acquisition Partner - 45 min
  2. Meeting with Matthis, our Head of SRE, Culture fit / Motivation - 45 min
  3. Technical interview with one or more SRE team members - 2H
  4. Team / culture / ways of working meeting - informal - 15/30 min

€55,000 - €70,000 a year

AGICAP is committed to offering equal opportunities to all; we cultivate an inclusive work environment that values diversity!

Référence :agicap-lever+Agicap-SRE-Platform-Engineer

Skills

Ops
Kubernetes
ArgoCD
Back-end
Go
Gestion de projet
Management
Cloud
Prometheus
Tooling
Flow
No code
Make
Autres
MCP