Site Reliability Engineer Object Storage

Scaleway

Lille, Nord Paris
CDI
Temps-plein

Il y a 1 mois

Fondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l'un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à n'importe quelle infrastructure.Depuis nos bureaux situés à Paris et à Lille, nous perfectionnons quotidiennement l'écosystème cloud de Scaleway, dont nous sommes les premiers utilisateurs.Nos quelques 25 000 clients nous choisissent pour notre redondance multi-AZ, notre expérience-utilisateur fluide, nos datacenters neutres en carbone ainsi que nos outils natifs de gestion d'architectures multi-cloud. Nos produits incluent des solutions entièrement gérées pour le bare metal, la conteneurisation et les architectures serverless, offrant ainsi un choix responsable dans le domaine du cloud computing.Rejoignez notre équipe dynamique de près de 600 collaborateurs venant de divers horizons, dans un environnement stimulant et international alliant excellence technique, créativité et partage.About the JobThe Object Storage team is a cornerstone of Scaleway. Our mission is to provide S3-compatible Object Storage to our clients but also to all the other Scaleway Elements products that rely on it (Instances, Databases, Registry, and more). In a challenging environment, we (and hopefully you soon) manage hundreds of Object Storage servers across various regions, dealing with petabytes of data while ensuring high availability. As a Site Reliability Engineer in our team, you will be responsible for developing, automating, and enhancing Scaleway's Object Storage solution. On top of your daily activities within the team, you will need to interact with all of Scaleway's teams, especially Instance, Network, Hardware, and Platform.\nResponsibilities

Design, implement, and maintain highly available and resilient Object Storage solutions to ensure scalability, availability and performance.
Develop automation tools and workflows to streamline provisioning, monitoring, and management of Object Storage infrastructure, ensuring that it scales effectively.
React to incident and troubleshooting activities in collaboration with other teams
Design technical solutions that address market defined challenges
Present your work during tech meetings

Technical Stack

Linux (Ubuntu servers)
Go, C
gRPC, Protobuf
PostgreSQL, Redis
Vector, ElasticSearch, Kibana
VictoriaMetrics, Prometheus, Grafana
Ansible
GitLab CI/CD, Git
HAProxy, ExaBGP

Minimum qualifications

Strong Linux knowledge
Good system-level programming skills
Good understanding of C
Basic understanding of Go
Experience with Git and CI/CD.
Proactive mindset with a focus on identifying and addressing issues before they impact scalability and reliability.
Great oral and written communication skills

Preferred qualifications

Experience in designing, implementing, or maintaining storage infrastructure in production environments
Experience with (and love for) distributed systems
Experience with incident management and on-call support in a production environment.
Passion for automation and tooling
Infrastructure deployment with Ansible
Strong problem-solving skills
Experience with the S3 API
Logging and monitoring (Vector, VictoriaMetrics, Grafana, …)
Able to work efficiently in written English

\nLocationThis position is based in our offices in Paris or Lille (France).Recruitment ProcessScreening call - 30 mins with the recruiterManager Interview - 45 minsTechnical InterviewsHR Interview - 45 minsHead of Interview - 45 minsOffer sentSi vous ne vous voyez pas cocher toutes les cases, n'hésitez pas à postuler tout de même. Ne vous limitez pas à une description de poste - on ne sait jamais !🌐 | |

Scaleway

Postuler