- Sacramento Municipal Utility District (Sacramento, CA)
- Title: Senior Reliability Risk Engineer Department: Reliability Risk & Internal Controls Location: Sacramento, CA, US, 95817-1899 Category: Miscellaneous ... to apply early to ensure they are considered.** **This posting is for the Senior Reliability Risk Engineer . If you would also like to be considered for… more
- TP-Link North America, Inc. (Irvine, CA)
- …simpler, smarter, and more reliable connectivity. We're looking for a passionate and experienced Senior Site Reliability Engineer to join our team and play ... and tools + Help to mentor and train less senior members of the team + Ability to be...field. + 5+ years of experience as a Site Reliability Engineer . + Proficiency in programming and… more
- NVIDIA (Santa Clara, CA)
- Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: + Own the solutions you build, collaborating… more
- Celonis (Redwood City, CA)
- …that, we need you to join us. **The Team** As a member of our Reliability Engineering team, you will play a critical role in ensuring the health, performance, and ... of our platform. The team applies advanced software engineering and Site Reliability Engineering (SRE) principles to drive system reliability , scalability, and… more
- NVIDIA (Santa Clara, CA)
- …efficient and reliable systems is an imperative. We are looking for a System Reliability Engineer to join NVIDIA's existing Reliability Engineering team, ... Servers. What you'll be doing: + Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, rack, cluster) from… more
- NVIDIA (Santa Clara, CA)
- …drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of ... workflows. You will design, implement and support operational and reliability aspects of large scale distributed systems with focus...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- Ford Motor Company (Irvine, CA)
- …this highly interdisciplinary role, you will work with multiple teams and help set reliability targets at the system and subsystem level. You will oversee the design ... an entire vehicle. **What you'll do ** + Define reliability targets for different systems and subsystems by cascading...or Chemistry with 5+ years of relevant experience in reliability engineering, test design, and failure analysis or equivalent… more
- NVIDIA (Santa Clara, CA)
- …foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big picture of how ... comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency. + Develop, define and document standard methodologies… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
- Insight Global (Novato, CA)
- …Enablement Work closely with backend and DevOps teams. Contribute to system reliability standards and documentation. Mentor engineers on Unix system performance and ... Hands-on experience with observability tools. Ability to troubleshoot complex reliability issues. Nice to Have Experience with live game infrastructure.… more
Recent Jobs
-
Beef Room Operator, AM, Deli
- Publix (Lakeland, FL)
-
Direct Support Professional Relief - PDM086 - On-Call
- WellLife Network (Brooklyn, NY)
-
QA/QC Inspection Specialist
- ICF (Newark, NJ)
-
PMO Manager, Information Systems
- Community Health Systems (Franklin, TN)