- NVIDIA (Santa Clara, CA)
- …SOL quality and efficiency. The DFP team is looking for a Speed and Reliability Lead. You will be leading and crafting testability features related to Speed, Timing ... and Reliability from ground up as you help turbocharge NVIDIA's...bringup and tuning a plus, related to timing, speed, reliability and power. + Familiarity with STA timing closure,… more
- ServiceNow, Inc. (San Diego, CA)
- …technical engineers who are tasked with maintaining and developing the reliability , scalability and performance of the ServiceNow cloud infrastructure. Our SRE's ... repeatable issues. + Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design. + Drive… more
- NVIDIA (Santa Clara, CA)
- …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... architectures and identify opportunities for containerization to improve scalability, reliability , and efficiency. + Strong analytical skills with the ability… more
- Rubrik (Sacramento, CA)
- …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
- Electric Power Research Institute (Palo Alto, CA)
- **Job Title:** Fuel Reliability Principal Team Lead **Location:** Charlotte, NC, Palo Alto, CA **Job Summary and Description:** The position is for an individual ... demonstrated with experience and activities related to nuclear fuel operation, reliability and performance + Understanding of current fuel operation technical issues… more
- PennyMac (Westlake Village, CA)
- …the complete mortgage journey. A Typical Day As a member of the Site Reliability Operations (SRO) team, you will help provide 24/7 monitoring and support of the ... critical to the success of anyone in this role. The Engineer III, Site Reliability Operations will: + Monitoring - Oversee 24/7 health monitoring of the company's IT… more
- NVIDIA (Santa Clara, CA)
- …aspect of the network infrastructure, ensuring its high availability and reliability . + Partnering with architecture and deployment teams to guarantee that ... + Minimum of 8 years of industry experience in network site reliability engineering, network automation, network operations, or related areas. Experience on both… more
- Palo Alto Networks (Santa Clara, CA)
- …team to influence the operability of the product and ensure the reliability and availability of our services **Your Experience** + DevOps/SRE Expertise: 5+ ... engineer with a passion for technology and a strong motivation for high reliability at the service level + Observability Tools: High proficiency with Thanos,… more
- Walmart (Stockton, CA)
- …spending plans and expense budgets with maintenance operations driving reliability and improved maintenance practices equipment enhancements and replacements ... auditing facility asset base for example building structure grounds parking lots material handling equipment energy center pump house retention ponds advising on condition of assets for future replacements and maintenance needs developing annual overhaul and… more
- PennyMac (Westlake Village, CA)
- …through the complete mortgage journey. Job Overview As a member of the Site Reliability Operations (SRO) team, you will help provide 24/7 monitoring and support of ... the company's IT Infrastructure. Ideal candidates should have experience in Windows and Linux administration, in addition to experience working in AWS, as Pennymac is now almost completely migrated into the AWS cloud. Individuals in this role should be… more