Senior SRE Engineer

Moonpig

Manchester London
Permanent
Full-time

21 days ago

Our Ways of Working Principles:We believe that most of us do our best work when we work together, but we know that everyone works in different ways, and quite frankly, has other commitments and responsibilities outside of work.As we further adjust to hybrid working, we want to take what we've learnt from working remotely and keep the flexibility that's enabled us to thrive and keep driving our business forward.We have some core principles which support us in this:Do what's rightTrust & give permissionDelivery mattersWe understand ways of working can look different based on your role, team and you as an individual so we are here to support and discuss this with you during the interview process.Please note, there is no mandatory office working days and only occasional travel required to London or Manchester.Work with usAt Moonpig Group our mission is to help people connect and create moments that matter. We're an international group made up of three brilliant brands - Moonpig in the UK, Ireland, US and Australia, and Greetz in the Netherlands - with our newest addition Buyagift joining us in 2022.We were founded with a goal to disrupt the traditional greetings industry. Two decades on, we're an established leader within the online gifting market, offering a wide range of products to customers across the world.Moonpig is an iconic brand and innovator, with clear values (read more about our values ). These values set our teams and our business up for success in an environment that's fun, supportive and challenging. They're the glue that binds us together and we think of them as a platform to help us deliver our best work. You have every chance to drive impact here at Moonpig, and most importantly, we genuinely want you!Our architecture is built for scale and flexibility which will allow us to quickly innovate and launch new propositions - coupling that with the wealth of data we have on our customers, the sky's the limit in the world of experimenting with cutting edge ideas.About the role:We are currently looking for a Senior Site Reliability Engineer to join our Enablement Infrastructure team, who are natural problem solvers and solve cross cutting problems across our entire technology estate.Site reliability engineers are key to running our platform at scale, so the position requires solid experience running operational systems in the wild and working closely with our engineering teams to identify root causes of stability and performance issues so they can be resolved by the teams.What you'll be doing:

Optimising our cloud infrastructure to balance platform stability vs cost.
Coding infrastructure automation using Terraform, deployed by CI/CD
Supporting & patching our infrastructure as necessary to ensure we maintain high-levels of system security.
Improving our Monitoring and logging stack so we can identify issues early
Develop a relationship with our engineering teams, helping them define their SLAs and improve their overall system reliability

Our expectations of our Senior Site Reliability Engineers:

Good engineering comes first - You'll have a great technical knowledge base and the experience to know what works and what doesn't. We expect you to apply these skills in making the right decisions and applying best practices wherever possible.
Technical mentoring and leadership - You'll be collaborative, inclusive and spreading knowledge wherever possible. People will be looking up to you for technical guidance and part of your role will be to help them on that journey. You will also be responsible for creating the right forums to drive engineering principles and practices across all of engineering. You have the autonomy to drive decisions, but it's your responsibility to ensure everyone is involved.
Culture and advocacy - You will be supporting a growth culture (e.g. running lunch & learns, brown bags, etc.) as well as advocating the organisation externally through meetups, blogging, hackathons etc. This is important to us as we are all in this together.

What you can expect:

Be part of a cross-functional team of SREs and Software Engineers implementing platform tooling and helping us maximise our uptime.
Work in an environment that cares about operational concerns.
Deliver value by jumping feet first into a wide range of problems to be solved. These could be internally facing within our technology organisation or for our external customers.
Be on an on-call PagerDuty rotation to respond to platform availability issues and provide support for engineers with incidents.
Be challenged to learn new skills and techniques; the range of problems you'll be solving means it's almost impossible not to learn something new.
Work in a fun and social environment!

You'll be a good fit for the role if you:

Designing infrastructure using code with Terraform
You have worked with highly available, high transactional websites and applications within a microservice architecture, clustered systems, automated deployments, disaster recovery and business continuity.
Hands-on expertise in designing, analysing and troubleshooting large-scale distributed systems.
Have a good understanding of network and operational security concepts
Used AWS and its associated products, including API Gateway, Lambda, EC2, S3, VPC, CloudWatch and ALB.
Monitoring production systems using industry standard tooling including Grafana and Opensearch
Good communication skills; you are able to share status updates clearly and to ask timely and relevant questions when working with your peers.

Our Tech Environment

AWS, Serverless, Terraform, TypeScript, C#, .NET, GraphQL and React.
GitHub for SCM and CI/CD
Robust and performant cloud/serverless applications, with a focus on user experience and business growth.
Full-stack, cross-functional teams, working closely with people of different specialisms within your team and across the business.

How we get there:

Kanban
Jira / Confluence
Grafana and AWS Cloudwatch
Google Analytics
Clean Architecture
TDD
Pair Programming
Focus on experimentation to validate our hypothesis

Want to hear more?Find out more about Moonpig Group and what it has to offer !Moonpig's Commitment to Equality, Diversity and InclusivityAt Moonpig Group, we're committed to creating an inclusive and caring culture with brilliant people who feel a real sense of belonging. We welcome and celebrate all diverse backgrounds to Moonpig Group, from working parents who need flexibility with their hours to individuals who are neurodiverse and prefer to work a certain way.We're proud to have several employee-led committees within our organisation, including the LGBTQ+, Gender Balance, Neurodiversity and our EMBRACE (Educating Myself for Better Racial Awareness and Cultural Enrichment) Committees.We'll continue to push for diversity and that sense of belonging so that all Moonpig Group employees feel safe and comfortable to be their true authentic self at work.

Moonpig

Apply Now