- SpaceX (Hawthorne, CA)
- Site Reliability Engineer , Hardware and Infrastructure (Starshield) Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity ... this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER , HARDWARE AND INFRASTRUCTURE (STARSHIELD) At SpaceX we're… more
- Amazon (Cupertino, CA)
- …AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary staff to ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna...functions as a vertically integrated team including software, firmware, hardware , and silicon design in a single organization. We… more
- SpaceX (Hawthorne, CA)
- Site Reliability Engineer , GNC (Falcon) Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity is out exploring the stars is ... the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER , GNC (FALCON)...deployment, operation and refinement + Make recommendations for future hardware purchases + Practice sustainable incident response and postmortems… more
- SpaceX (Hawthorne, CA)
- Site Reliability Engineer (Special Programs) Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity is out exploring the stars ... the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER (SPECIAL PROGRAMS)...in the cloud + Build, maintain, and scale on-premises hardware systems designed to host GPU-accelerated machine learning workloads… more
- Google (Sunnyvale, CA)
- …platforms, and low level software/operating systems We are looking for a Principal Engineer to join Infrastructure Site Reliability Engineering. You will ... Experience developing and implementing technical programs in compute infrastructure or hardware platform design + Experience defining system or process requirements… more
- ServiceNow, Inc. (San Diego, CA)
- It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... technical engineers who are tasked with maintaining and developing the reliability , scalability and performance of the ServiceNow cloud infrastructure. Our SRE's… more
- Google (Sunnyvale, CA)
- …**Preferred qualifications:** + Master's degree in Computer Science or Engineering. Site Reliability Engineering (SRE) combines software and systems engineering ... Google Cloud's services-both our internally critical and our externally-visible systems-have reliability , uptime appropriate to customer's needs and a fast rate of… more
- NVIDIA (Santa Clara, CA)
- …frameworks like PyTorch, TensorFlow, JAX, and Ray. + A strong background in hardware health monitoring and system reliability . + Hands-on expertise in operating ... large-scale systems supporting critical use cases for AI Infrastructure, driving reliability , operability, and scalability across global public and private clouds. +… more
- Amazon (Cupertino, CA)
- …culture that welcomes bold ideas and empowers you to own them to completion. Server Hardware Engineer (aka Lead Engineer (LE)). Amazon Web Services (AWS) ... key factors such as total cost of ownership, quality, reliability , performance, and serviceability. You will be an end-to-end...implementation of your work. Key job responsibilities As Server Hardware Engineer you will be responsible for… more
- Amazon (Cupertino, CA)
- …culture that welcomes bold ideas and empowers you to own them to completion. Server Hardware Engineer (aka Lead Power Design Engineer ). Amazon Web Services ... ownership for the implementation of your work. Key job responsibilities As Senior Hardware Engineer you will be responsible for the architecture, design,… more