Role Summary
This position is for an Observability Engineer specializing in deploying and managing leading monitoring tools to achieve full-stack observability. The role focuses on leveraging Dynatrace, AppDynamics, and Splunk across applications, infrastructure, and cloud environments. The engineer is crucial in building proactive monitoring solutions like dashboards, alerts, and SLOs to maintain system health, detect performance bottlenecks, and support timely incident resolution.
Responsibilities
Deploy and manage Dynatrace, AppDynamics, and Splunk to deliver full-stack observability.
Build dashboards, alerts, and SLOs for proactive system health monitoring.
Detect and analyze performance issues to support swift incident resolution.
Work closely with development, operations, and SRE teams to instrument services.
Analyze telemetry data and optimize system reliability.
Perform hands-on keyboard implementation, configuration, and troubleshooting for the tools and Operating System.
Required Skills
Dynatrace APM : Minimum 5 years of hands-on experience in implementation, configuration, and troubleshooting.
AppDynamics APM : Minimum 5 years of hands-on experience in implementation, configuration, and troubleshooting.
Splunk : Minimum 5 years of hands-on experience in implementation, configuration, and troubleshooting.
Requires a Bachelor's degree.
Must be able to perform hand-on keyboard work related to the tool and Operating System.
Job Responsibilities
Deploy, administer, and manage observability platforms including Dynatrace, AppDynamics, and Splunk .
Establish full-stack observability across applications, underlying infrastructure, and cloud platforms.
Design and implement effective monitoring dashboards, alerts, and Service Level Objectives (SLOs).
Proactively identify performance bottlenecks and system health issues.
Collaborate with Development, Operations, and Site Reliability Engineering (SRE) teams to instrument services.
Execute hands-on configuration and troubleshooting tasks related to the monitoring tools and Operating System.