Sr. Site Reliability Engineer - Monitoring
Marriott
- Bethesda, MD Montgomery, AL
- $96,038-209,169 per year
- Permanent
- Full-time
Job Category Information Technology
Location Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States
Schedule Full-Time
Located Remotely? Y
Relocation? N
Position Type ManagementJOB SUMMARYLead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of applications and infrastructure in support of incident and problem investigation and application release management. Develops solutions to automate IT operations tasks. Develops custom monitoring solutions when standard tools are not available. Partners closely with Product Teams, Applications teams, Infrastructure, and the broader Applications and Infrastructure Delivery teams to develop key metrics and KPIs to improve applications stability, availability and performance. Develop and delivers reports on metrics about the operations of the IT environment's hardware and software to ensure everything functions as expected to support applications and services. Drives adoption of monitoring tools and metrics across IT and partners closely with Applications team, Infrastructure, and the broader Application Production Support team to better monitor the performance of applications.CANDIDATE PROFILEEducation and ExperienceRequired:
- 7+ years’ experience in information technology process and/or technical project management across diverse areas and technologies
- 3+ years’ experience in the following:
- Application Performance Management (APM) and in the use of related tools such as Dynatrace.
- Site reliability engineering
- Automation of self-healing solutions via tooling
- Undergraduate degree or equivalent experience/certification
- 5+ years experience in a technical discipline role with experience in planning, implementing and evaluating processes, systems and/or initiatives
- Broad technical acumen across multiple disciplines applications with a solid understanding of current technologies
- Familiar with Site Reliability Engineering principles and concepts. Engineers automation to perform routine IT tasks.
- Experience applying measurement processes/methods for assessing program outputs and outcomes or progress toward goals and objectives.
- Extremely high level of analytical ability with complex problems
- Ability to work across organizational boundaries, to help lead and influence change
- Ability to command the process across all levels to ensure customer focus; including being assertive and self-starting
- Demonstrated leadership experience in influence and garnering alignment from external organizations
- Ability to align change management strategies with project
- Skilled in conceptualizing creative solutions, documenting them, and presenting/selling them to senior management
- Familiar with ITIL processes. ITIL v4 certifications a plus.
- Very high level of interpersonal skills to work effectively with others, motivate employees, and elicit work output in a team environment
- Proven experience, knowledge and demonstration of continuous process improvement
- Assess application architectures to identify key monitoring points
- Identify Key Performance Indicators, apply monitoring, and report out on compliance.
- Own the application availability tiering definition and assignment for every inventoried application
- Gather information to develop reporting metrics and KPIs
- Ensure that all applications adhere to appropriate monitoring standards based on their technology/business process
- Ensures that all monitors are active and accurate
- Partner with Incident Management and Problem Management teams to augment monitoring as required to avoid future Incidents/Problems
- Partner with Infrastructure team on application monitoring requirements for monitoring platform selection
- Determine forums and cadence to provide regular monitoring updates
- Collaborates with Enterprise Application and Architecture and Infrastructure teams to continuously improve processes and procedures.
- Liaises with vendors and Service Providers to select services and tools that best meet company goals
- Functions as a strategic senior technical expert within the department.
- Develops specific goals and plans to prioritize, organize, and accomplish work.
- Champions leaders’ vision for product and service delivery.
- Makes and executes the necessary decisions to keep moving forward toward achievement of goals.
- Provides direction and assistance to other teams regarding projects.
- Determines priorities, schedules, plans and necessary resources to promote completion of any projects on schedule.
- Analyzes information and evaluates results to choose the best solution and solve problems.
- Reviews vendor proposals and selects appropriate vendor for services/technologies/hardware.
- Thinks creatively and practically to develop, execute and implement new project plans.
- Generates and provides accurate and timely results in the form of reports, presentations, etc.
- Plans, develops, implements, and evaluates the quality of operations.
- Understands and meets the needs of key stakeholders.
- Communicates concepts in a clear and persuasive manner that is easy to understand.
- Demonstrates an understanding of business priorities.
- Supports achievement of performance goals, budget goals, team goals, etc.
- Provides technical expertise and technical leadership within own and other teams.
- Provides recommendations to improve the effectiveness of processes and programs.
- Demonstrates advanced knowledge of job-relevant issues, products, systems, and processes.
- Demonstrates advanced knowledge of function-specific procedures.
- Applies knowledge/judgment to achieve business goals.
- Foresees, identifies and resolves problems.
- Keeps up-to-date technically and applies new knowledge to job.
- Performs other reasonable duties as required for this position.