Site Reliability Engineer

Notino

  • Brno, Jihomoravský
  • Trvalý pracovní poměr
  • Plný úvazek
  • Před 1 měsícem
What is it about?You will be part of the team that participates in application management and is in constant contact with our application architects, technical leads, developers, and other parties. Your main goal will be to improve our e-commerce platform and always look for new technologies and best approaches.What will your job be about?· Maintaining production services by measuring and monitoring availability, latency, and overall system health· Increasing our observability by identifying gaps and serving as an expert on discussion around logging, meta-data, and response codes, creating dashboards and custom alerts/metrics in our APM tools· Implementing further automation and reducing toil· Collaborating with cross-functional teams to improve the reliability and performance of systems· Understanding service level indicators and utilizing service level objectives to resolve issues before they impact customers proactivelyHow do we imagine you?· Strong drive for process automation· Urge to apply the most pragmatic technical approach at hand· Openness to discussion to reach a consensus with the rest of the team· Ability to solve complex tasks independently· Expert knowledge of at least one high-level programming language· Advanced knowledge of Kubernetes· Advanced knowledge of Linux· Experience with NoSQL databases· Knowledge of code version control culture (git, merging strategies, code build & deploy automation)· Advanced communication skills in EnglishExtra mile for us:· Experience with observability tools like Prometheus / New Relic / Grafana· Knowledge of .NET / Node.js / Python· Experience with Gitlab & Gitlab CI· E-commerce platform development experience· Experience with a big production / high-traffic website (ideally a transactional retail website)· Experience with a complex infrastructure setup· Knowledge of cloud technologies and infrastructure

Notino