Lead Infrastructure Engineer - Supporting Customer Facing Applications
JPMorgan Chase
- Jersey City, NJ
- Permanent
- Full-time
- Ensure user visible uptime and quality, providing operational and development expertise in making our systems have proactive monitoring, fail rarely and automatically fix when they do fail
- Minimize manual involvement by driving solutions, automation and implementing continuous improvements that creates an operating environment, including development & configuration for dynamic monitoring, alerting & recovery
- Own day-to-day health, uptime, monitoring, reliability of services & server infrastructure, performance improvements, change management and capacity management of the services supported
- Build strong relationship & Engage in with the development team throughout the life cycle to help build for reliability
- Identify and/or analyze patterns of incidents/problem, conduct flawless post-mortems, develop permanent remediation plans, implement automation to prevent future incidents from re-occurring again
- Facilitate maximum speed of delivery by objectively binding to disruptions of the service
- Applies technical expertise and problem-solving methodologies to projects of moderate scope
- Drives a work stream or project consisting of one or more infrastructure engineering technologies
- Works with other platforms to architect and implement changes required to resolve issues and modernize the organization and its technology processes
- Executes creative solutions for the design, development, and technical troubleshooting complex problems
- Strongly considers upstream/downstream data and systems or technical implications and advises on mitigation actions
- Adds to team culture of diversity, equity, inclusion, and respect
- Formal training or certification on Infrastructure Engineering concepts and 5+ years applied experience
- Strong experience with Automation and Configuration tools like Ansible
- Deep knowledge of one or more areas of infrastructure engineering such as hardware, networking terminology, databases, storage engineering, deployment practices, integration, automation, scaling, resilience, or performance assessments
- Deep knowledge of one specific infrastructure technology and scripting languages (e.g., Scripting, Python, etc.)
- Drives to continue to develop technical and cross-functional knowledge outside of the product
- Experience with instrumentation, monitoring, alerting and responding - relative to performance and availability of application, using tools such as Dynatrace, Splunk, Grafana etc.
- Deep knowledge of cloud infrastructure and multiple cloud technologies with the ability to operate in and migrate across public and private clouds
- Good to have - knowledge of Cloud Engineering. Understanding of private cloud principles and exposure to public cloud offerings such as AWS, Azure, Google cloud or similar technology is preferred