- Microsoft Corporation (Reston, VA)
- …no further than the Microsoft Defender engineering team. We are looking for a Site Reliability Engineer II who will be building and delivering cloud solutions to ... as a Designated Responsible Individual (DRI) in a 24x7 on-call rotation, monitoring service health and responding to incidents within SLA timelines. + Automation &… more
- Coinbase (Richmond, VA)
- …Q3 2023. *What you'll be doing (ie. job duties):* * Improve observability, reliability and availability by defining and measuring key metrics * Build automation and ... Kubernetes, EC2, etc.) * Collaborate with Coinbase product teams to reduce service disruptions and automate incident response * Proactively find and analyze … more
- Microsoft Corporation (Reston, VA)
- …+ 24x7 On-Call Rotation: Participate in a regular on-call schedule to monitor service health, respond to incidents, and escalate complex issues as needed. + Support ... Development & Design: Make basic code changes to improve reliability , security, and observability, and engage in design/code reviews with guidance from senior… more
- Rubrik (Richmond, VA)
- …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
- Amazon (Arlington, VA)
- Description Join AWS Region Reliability and help revolutionize how AWS operates at scale! We're building innovative solutions that redefine and optimize AWS ... - from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
- Nutanix (Richmond, VA)
- …or specific events. **Your Role** + Ensure the 24/7 availability and reliability of Nutanix's cloud services and infrastructure. + Respond promptly to alerts ... + Participate in on-call rotation to provide after-hours support and maintain service level agreements (SLAs). + Develop and enhance automation scripts using… more
- Palo Alto Networks (Reston, VA)
- …automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, ... PKI concepts + Expertise in troubleshooting and resolving cloud infrastructure and service issues, identifying root cause and devising effective solutions for high… more
- Amazon (Arlington, VA)
- …this goal, we are continually striving to innovate and provide best in class service levels through the introduction of pioneering new products and services. To that ... end, Amazon is seeking an engineer over maintenance, repair and vendor management operations to support grocery expansion across the North American market. Key job… more
- SitusAMC (Richmond, VA)
- …+ Experience writing IAM Policies, Permission sets and creating roles for service -to- service communication + Experience with Scrum or Agile methodologies + ... Extensive experience in Scripting and Automation using any Scripting Language preferably Python + Knowledge and development experience with Terraform or CloudFormation + In-depth knowledge on any of the GIT platforms such as Azure DevOps, GitHub etc. +… more
- Dominion Energy (Glen Allen, VA)
- Electric Transmission - Engineer / Senior Engineer Dominion Energy is committed to providing reliable, affordable, and increasingly clean energy that powers our ... office, two days of teleworking) to accommodate the need for flexibility. Military service members and veterans with ranks from E5-E9, W1-CW5, or O3-O6, plus… more