- PennyMac (Westlake Village, CA)
- …quickly and accurately, is critical to the success of anyone in this role. The Engineer III, Site Reliability Operations will: + Monitoring - Oversee 24/7 health ... A Typical Day As a member of the Site Reliability Operations (SRO) team, you will help provide 24/7...Pennymac is now almost completely migrated into the AWS cloud . Individuals in this role should be comfortable working… more
- Nutanix (Sacramento, CA)
- …Python and Bash. + Experience in a 24/7 NOC environment, preferably with a cloud service provider. + Solid understanding of cloud infrastructure components ... collaboration or specific events. **Your Role** + Ensure the 24/7 availability and reliability of Nutanix's cloud services and infrastructure. + Respond promptly… more
- Coinbase (Sacramento, CA)
- …less customer impact . *Role* - We would like to add a Senior Software Engineer to help promote reliability culture across Coinbase. You would be helping ... you'll be doing (ie. job duties):* *Team* - Core Reliability team is a vital part of Infrastructure(Platform) org...to scale the system by 10-20x and help secure service configurations & secrets by building/enhancing world class … more
- Nelnet (Sacramento, CA)
- …range for this role is $110,000-$155,000 **What you'll do:** **As a System Reliability Engineer at Nelnet, you will:** * Ensure solutions are running ... company committed to enriching lives through the power of service as a student loan servicer, professional services company,...to standards defined by the SDLC. **As a System Reliability Engineer , a typical day might include:**… more
- JPMorgan Chase (Palo Alto, CA)
- …the perfect environment to have a major impact. As a **Principal Site Reliability Engineer ** at JPMorgan Chase within the **Enterprise Technology, AI/ML & ... involve overseeing, designing, and deploying infrastructure components to enhance reliability and ensure operational efficiency. **Job responsibilities** + Architect… more
- Palo Alto Networks (Santa Clara, CA)
- …perspective. + Design, review and enhance software architecture to improve scalability, service reliability , cost, and performance - you've helped create ... CDSS group is looking for a seasoned platformization and cloud automation engineer to design, develop, and...development teams to ensure that applications have scalability and reliability built-in from day one- agile is second nature… more
- Palo Alto Networks (Santa Clara, CA)
- …perspective + Design, review and enhance software architecture to improve scalability, service reliability , cost, and performance - you've helped create services ... CDSS group is looking for a seasoned platformization and cloud automation engineer to design, develop and...development teams to ensure that applications have scalability and reliability built-in from day one - agile is second… more
- Palo Alto Networks (Santa Clara, CA)
- …win with precision. **Your Career** We are looking for an exceptional Sr Principal Software Engineer to enhance our ATP Cloud team. This role is central to our ... of backend services, with a keen eye for scalability, reliability , and performance. The ideal candidate will possess a...The ideal candidate will possess a deep understanding of cloud computing, particularly within the Google Cloud … more
- Palo Alto Networks (Santa Clara, CA)
- …enhance software architecture to improve scalability in networking like BGP, OSPF, service reliability , capacity, and performance + Collaborate with development ... help our customers in their journey to the public cloud by ensuring they have the best in class...in this space. We are seeking development heavy Site Reliability Engineers to design, build, maintain, and scale production… more
- NVIDIA (Santa Clara, CA)
- …next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, ... High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed… more