• Reliability Testing Intern, Operations…

    NVIDIA (Santa Clara, CA)
    …advancement. Our team is searching for Graduate Intern to join our Board Level Reliability team to evaluate PCBA/Module level reliability . You will be work in ... the Board Level Reliability lab environment testing for various NV products including...environment testing for various NV products including large server systems and perform various functional tests for GPU/Tegra products.… more
    NVIDIA (01/10/26)
    - Related Jobs
  • Senior Site Reliability Engineer - Cloud…

    Oracle (Sacramento, CA)
    …management, and postmortems + Solid grasp of networking, security fundamentals, and performance engineering **Nice to have** + Experience in regulated or ... **Job Description** Senior Site Reliability Engineer - Cloud Automation (Oracle Health |...operations, and on-call excellence + Build automation and self-healing systems using IaC (eg, Terraform) and CI/CD + Design,… more
    Oracle (12/03/25)
    - Related Jobs
  • Site Reliability Engineer Intern

    IBM (San Jose, CA)
    …knowledge in monitoring/observability, issue response, and troubleshooting for optimal system performance . * Automation: knowledge in automation for ... systems and services around the clock, ensuring continuous reliability and optimal customer experience. * Cross-Functional Troubleshooting: Collaborate with … more
    IBM (11/22/25)
    - Related Jobs
  • Sr. Staff Software Engineer, Reliability

    LinkedIn (Mountain View, CA)
    …1 billion members and the application layer. We do this with a focus on performance , security, and reliability . As a Sr. Staff Software Engineer, you will fill ... will be based in Mountain View, CA. LinkedIn's Edge Engineering team builds and operates the infrastructure that resolves,...L4/7 proxies, CDN, RUM, WAF and DDoS, and web performance . + Experience with Linux operating systems more
    LinkedIn (10/16/25)
    - Related Jobs
  • Senior Site Reliability Engineer, BCM - DGX…

    NVIDIA (Santa Clara, CA)
    …in Computer Science or related field. + 8+ years of experience in site reliability engineering and/or software development roles. + Fluency in Python + In-depth ... and networking Ways to stand out from the crowd: + Experience with C++, high- performance computing, Kubernetes and/or system administration would be an asset +… more
    NVIDIA (10/28/25)
    - Related Jobs
  • Principal Network Reliability Engineer…

    Oracle (Sacramento, CA)
    …services deployed in more than 40 regions worldwide. The mission of our Network Reliability Engineering team is to provide services that allow our customers to ... Scrum & Agile Methodologies + Strong technical knowledge in cloud networking, high performance computing, and GPU systems . \#LI-KR4 Oracle is an Equal Employment… more
    Oracle (12/21/25)
    - Related Jobs
  • Sr. Network Reliability Developer

    Oracle (Sacramento, CA)
    **Job Description** The mission of our Network Reliability Engineering team is to provide exceptional network reliability and automation services that enable ... minimize downtime, quickly resolve incidents, and continuously enhance network performance through automation, advanced monitoring, and a customer-centric approach.… more
    Oracle (12/01/25)
    - Related Jobs
  • Principal Network Reliability Engineer

    Oracle (Sacramento, CA)
    **Job Description** The mission of our Network Reliability Engineering team is to provide exceptional network reliability and automation services that enable ... minimize downtime, quickly resolve incidents, and continuously enhance network performance through automation, advanced monitoring, and a customer-centric approach.​… more
    Oracle (12/01/25)
    - Related Jobs
  • Data Center Design Lead, System

    Google (Sunnyvale, CA)
    …on machine learning systems . + Experience with GPU/TPU architectures, AI system integration, and performance techniques. + Experience with data center ... Data Center Design Lead, System Engineering and Architecture _corporate_fare_ Google...infrastructure, including power, networking, storage, and cooling systems . + Experience with cost and performance more
    Google (12/17/25)
    - Related Jobs
  • Sr Site Reliability Engineer (Prisma…

    Palo Alto Networks (Santa Clara, CA)
    … are robust and performant. This includes automation, architecture, performance , observability, troubleshooting, security, and reliability . Our Infrastructure ... and weekend, to support critical business operations and production systems and for incident response. + **Lead root cause...align cloud operations with business goals. **The Team** Our engineering team is at the core of our products… more
    Palo Alto Networks (12/12/25)
    - Related Jobs