"Alerted.org

Job Title, Industry, Employer
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Advanced Search

Advanced Search

Cancel
Remove
+ Add search criteria
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Related to

  • Monitoring Engineer

    SAIC (Springfield, VA)



    Apply Now

    Description

    The Senior Monitoring Engineer in Springfield, VA is a senior-level technical expert who is accountable for the advanced troubleshooting, performance analysis, and optimization of enterprise monitoring platforms. This position is responsible for the design, implementation, and ongoing enhancement of observability solutions in hybrid environments, including on-premises, cloud, and virtual infrastructure. The engineer is responsible for the final escalation point for complex monitoring issues, collaborates with other teams to guarantee system reliability, and promotes best practices in observability.

    Key Responsibilities:

    + Serve as the Tier 3 escalation point for issues related to any of the monitoring/observability platforms and tools.

    + Lead root cause analysis (RCA) for major incidents and recurring performance issues.

    + Maintain, configure, and optimize monitoring tool deployments across cloud (e.g., AWS, Azure), on-premises, and VMware environments.

    + Design and implement custom dashboards, synthetic monitoring, and service-level objectives (SLOs).

    + Develop and maintain alerting strategies that reduce noise and ensure actionable notifications.

    + Work closely with application, infrastructure, DevOps, and security teams to define monitoring requirements and integrate observability into CI/CD pipelines.

    + Analyze metrics, logs, and traces to ensure end-to-end service visibility and performance optimization.

    + Assist in onboarding applications and teams into the observability platform.

    + Provide training and mentorship to Tier 1 and Tier 2 support teams.

    + Ensure platform resilience, availability, and compliance with internal standards and SLAs.

    + Participate in on-call rotations for high-priority incidents as needed.

    Qualifications

    Required Education & Experience:

    + BS an 9 years experience; MS and 7 years experience; may accept additional experience in lieu of degree.

    + 5+ years of experience in IT infrastructure, application performance monitoring, or site reliability engineering (SRE).

    + 2+ years of hands-on experience using platforms such as Dynatrace, Zabbix, and monitoring tools in VMware Cloud Foundation (VCF).

    + Solid understanding of observability concepts including metrics, logs, traces, and user experience monitoring.

    + Experience supporting complex, distributed systems in cloud and hybrid environments.

    + Proficient with scripting and automation (e.g., PowerShell, Python, Bash, or Ansible).

    + Strong understanding of networking, Linux/Windows systems, containers, and application architectures (microservices, APIs, etc.).

    + Experience curating and implementing dashboards.

    + Excellent troubleshooting and problem-solving skills.

    + Strong written and verbal communication.

    + Ability to work independently and collaboratively across teams.

    + Customer-focused mindset and attention to detail.

    + Continuous learning and adaptability in a fast-paced environment.

    Required Clearance:

    + US Citizenship.

    + Active secret security clearance with the ability to obtain a top secret clearance.

    Preferred Qualifications:

    + Dynatrace Associate or Professional Certification.

    + Experience with Dynatrace, including OneAgent deployment, Smartscape, PurePath, and Davis AI.

    + Experience with integration of Dynatrace with tools such as ServiceNow, Splunk, Jira, or CI/CD pipelines.

    + Experience with other observability tools (e.g., Prometheus, Grafana, New Relic, AppDynamics, Splunk, Elastic).

    + Familiarity with DevOps practices and Infrastructure-as-Code (e.g., Terraform).

    + Understanding of ITIL framework and change management processes.

    REQNUMBER: 2508991

    SAIC is a premier technology integrator, solving our nation's most complex modernization and systems engineering challenges across the defense, space, federal civilian, and intelligence markets. Our robust portfolio of offerings includes high-end solutions in systems engineering and integration; enterprise IT, including cloud services; cyber; software; advanced analytics and simulation; and training. We are a team of 23,000 strong driven by mission, united purpose, and inspired by opportunity. Headquartered in Reston, Virginia, SAIC has annual revenues of approximately $6.5 billion. For more information, visit saic.com. For information on the benefits SAIC offers, see Working at SAIC. EOE AA M/F/Vet/Disability

     


    Apply Now



Recent Searches

  • Sr Staff Engineer Systems (United States)
  • Advanced Practice Provider NP (Tennessee)
  • mass spectrometry scientist reach (United States)
  • Access Optimization Analyst (United States)
[X] Clear History

Recent Jobs

  • Monitoring Engineer
    SAIC (Springfield, VA)
  • Mechanical Engineer
    Actalent (Riverside, MI)
[X] Clear History

Account Login

Cancel
 
Forgot your password?

Not a member? Sign up

Sign Up

Cancel
 

Already have an account? Log in
Forgot your password?

Forgot your password?

Cancel
 
Enter the email associated with your account.

Already have an account? Sign in
Not a member? Sign up

© 2025 Alerted.org