Site Reliability Engineer Object Storage
Scaleway
- Lille, Nord Paris
- CDI
- Temps-plein
- Design, implement, and maintain highly available and resilient Object Storage solutions to ensure scalability, availability and performance.
- Develop automation tools and workflows to streamline provisioning, monitoring, and management of Object Storage infrastructure, ensuring that it scales effectively.
- React to incident and troubleshooting activities in collaboration with other teams
- Design technical solutions that address market defined challenges
- Present your work during tech meetings
- Linux (Ubuntu servers)
- Go, C
- gRPC, Protobuf
- PostgreSQL, Redis
- Vector, ElasticSearch, Kibana
- VictoriaMetrics, Prometheus, Grafana
- Ansible
- GitLab CI/CD, Git
- HAProxy, ExaBGP
- Strong Linux knowledge
- Good system-level programming skills
- Good understanding of C
- Basic understanding of Go
- Experience with Git and CI/CD.
- Proactive mindset with a focus on identifying and addressing issues before they impact scalability and reliability.
- Great oral and written communication skills
- Experience in designing, implementing, or maintaining storage infrastructure in production environments
- Experience with (and love for) distributed systems
- Experience with incident management and on-call support in a production environment.
- Passion for automation and tooling
- Infrastructure deployment with Ansible
- Strong problem-solving skills
- Experience with the S3 API
- Logging and monitoring (Vector, VictoriaMetrics, Grafana, …)
- Able to work efficiently in written English