Site Reliability Engineer
Impact Tech
- Columbus, OH
- $100,000-110,000 per year
- Permanent
- Full-time
- Become an expert of our observability tools, such as Elastic APM, Splunk, Logic Monitor, Grafana. Build and enhance monitoring and alerting solutions with these tools to ensure we are aware of the health of customer-facing product features
- Leverage data analyst tools such as DataBricks, Python, Machine Learning, Anomaly Detection to merge and augment observability datasets
- Ensure performance and metrics of our product features are within acceptable thresholds. Examples include: response latency, cpu and memory consumption, cloud costs, error rates, uptime
- Analyze and understand client-generated workloads - ensure resource consumption is consistent with their contract (e.g. Enterprise clients generate larger requests than Small Businesses, not the other way around).
- Configure alerts and document associated run books when thresholds are exceeded
- Drive and contribute to root-cause analysis when issues are identified
- Troubleshoot across the entire stack: hardware, software, database, network, applications, customer-generated workloads
- This role will have a particular focus on database performance and query optimization
- Remediations may include software or database optimizations, changes to rate limit configurations, or client outreach to reduce heavy requests.
- Identify targets for optimization, and collaborate with appropriate teams and stakeholders to ensure issues are resolved
- Analyze platform usage data to calculate costs generated by customers using our platform, inform capacity planning decisions, and find area for improvement
- Solid understanding of systems and application design.
- Experience building and using observability features with tools such as ElasticSearch, Grafana, LogicMonitor, Splunk.
- Experience database monitoring and SQL tuning
- Proficient in at least one high-level programming language and shell scripting.
- Ability to prioritize tasks and work independently.
- Ability to adapt and focus on the simplest, most efficient and reliable solutions.
- B.S. in Computer Science or similar field or equivalent experience.
- Medical, Dental and Vision insurance
- Unlimited responsible PTO
- Flexible work hours
- Continued access to
- Catered lunch every Thursday, a healthy snack bar, and great coffee to keep you fueled.
- Flexible spending accounts and 401(k)
- An employee-led culture team that plans inclusive events- meaning time together and other events to celebrate our many successes!
- An established company with a cool, high-velocity work ethos, where each person can make a difference!