- Northrop Grumman (Los Angeles, CA)
- …be a part of our mission! We are looking for you to join our team as a Systems Safety & Reliability Engineer based out of Woodland Hills, CA. Our success is ... DO-178B/C, DO-254, ARP-4754, ARP-4761, etc. A fundamental knowledge of Reliability Engineering principles and System Safety Program conduct in...Sr. Principal Level. **Basic Qualifications for a Principal Systems Safety Engineer :** +… more
- Lumen (Sacramento, CA)
- …the world and shape the future. **The Role** We are looking for a Senior Site Reliability Engineer (SRE)/ Platform Engineer / DevOps Engineer with deep ... expertise in Kubernetes to design, implement, and manage high-availability, scalable systems primarily on AWS EKS. In this role, you will leverage tools like… more
- JPMorgan Chase (Palo Alto, CA)
- …projects. You've discovered the perfect environment to have a major impact. As a ** Principal Site Reliability Engineer ** at JPMorgan Chase within the ... involve overseeing, designing, and deploying infrastructure components to enhance reliability and ensure operational efficiency. **Job responsibilities** + Architect… more
- Palo Alto Networks (Santa Clara, CA)
- …insights into our systems ' performance and health. **Your Impact** As a Principal SRE with the Cortex Cloud Security Posture Management team, you will: + Cloud ... including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you… more
- NVIDIA (Santa Clara, CA)
- …on the world. NVIDIA is looking to hire a deeply technical and creative Site Reliability Engineer to build, support and maintain the next generation AI powered ... challenges, automate processes, and iterate for efficiency + Tackle systemic reliability issues with multi-functional teams. + Monitor, optimize, and manage system… more
- Palo Alto Networks (Santa Clara, CA)
- …a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer , you will be part of a team supporting the services running ... This includes automation, architecture, performance, metrics, troubleshooting, security, and reliability . Our stack includes Kubernetes, Docker, GCP, AWS, Ansible,… more
- NVIDIA (Santa Clara, CA)
- …Will Be Doing: + Architect, lead, and scale globally distributed production systems supporting AI/ML, HPC, and critical engineering platforms across hybrid and ... change safety, and release velocity. + Define and evolve platform-wide reliability metrics, capacity forecasting strategies, and uncertainty testing approaches for… more
- Palo Alto Networks (Santa Clara, CA)
- …a large infrastructure and is one of the biggest GCP customers. As a Principal SRE, you'll be at the forefront of building and maintaining highly reliable, scalable, ... champion SRE best practices, and work collaboratively to ensure our systems are robust and performant. This includes automation, architecture, performance,… more
- Verint Systems, Inc. (Sacramento, CA)
- … Reliability Engineer ensures the scalability, availability, performance, and reliability of cloud-based systems and participates in and leads the design, ... at www.verint.com . **Overview of Job Function:** Verint's Sr. Reliability Engineer is responsible for all aspects...team of engineers to build robust, observable, and resilient systems that meet business objectives, following DevOps and SRE… more
- City and County of San Francisco (San Francisco, CA)
- …the minimum qualifications. Application Deadline: Continuous How to Apply: Applications for Principal Information Systems Engineer - Networks Specialty are ... that integrate these systems together as an enterprise networking backbone. The 1044 Principal Networks Engineer is the highest level in the Engineer … more