• Distributed Systems Engineer , Cassandra,…

    DoorDash (San Francisco, CA)
    …have experience managing large fleets of database clusters, including building tools, monitoring , capacity planning, backups and disaster recovery strategies. + ... the Role The Storage team is building and operating a high performance , scalable, reliable data abstraction layer that optimizes reliability and efficiency. You… more
    DoorDash (07/04/25)
    - Related Jobs
  • Linux Site Reliability Engineer

    Nutanix (Sacramento, CA)
    …Nutanix team, where your skills will directly impact the availability and performance of our cutting-edge cloud technology while collaborating with a global network ... QA, Development, and Infrastructure teams to design and implement robust monitoring solutions. + Manage deployment of software patches, upgrades, and administrative… more
    Nutanix (09/24/25)
    - Related Jobs
  • Engineer , Software & Information Platform

    Cardinal Health (Sacramento, CA)
    …and takes timely corrective action. + Works with the Unix/Linux/Windows Administrator on capacity planning and performance tuning of servers hosted on GCP cloud. ... SCM, Portal, PO,BI, Netweaver, SLT, SideCar) + Daily administration, monitoring , troubleshooting and tuning of a complex SAP production...+ System Copy and Refreshes + ABAP and Java performance Tuning + SNC Encryption/Cryptography + SAP SSO and… more
    Cardinal Health (09/18/25)
    - Related Jobs
  • Sr. Hardware Development Engineer /Signal…

    Amazon (Cupertino, CA)
    …the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience ... will be crucial for optimizing high-speed interfaces and ensuring robust system performance . You will interact with an interdisciplinary team of engineers to design,… more
    Amazon (09/16/25)
    - Related Jobs
  • Sr. Hardware Dev Engineer (AWS Generative…

    Amazon (Cupertino, CA)
    …training and inference? Want to do industry leading work delivering continuous price performance improvements in the cloud for AI model training for multi billion ... in designing, delivering and operating AWS cloud offerings that enable high performance and scalability in AI/ML and HPC workloads. AWS Infrastructure Services owns… more
    Amazon (10/01/25)
    - Related Jobs
  • Hardware Development Engineer II, AWS…

    Amazon (Cupertino, CA)
    …the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience ... creative and new designs that set the standards on performance , quality, cost, and operational excellence. What you will...you will oversee the fleet of servers you develop, monitoring their quality and how they are meeting the… more
    Amazon (09/23/25)
    - Related Jobs
  • Senior GPU and HPC Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    …and excellent communication and planning abilities. Experience working with High Performance Computing (HPC), GPUs, and high- performance networking (RDMA, ... the better. You and other engineers on this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of… more
    NVIDIA (07/10/25)
    - Related Jobs
  • Staff Engineer Commercial Technologies

    Cardinal Health (Sacramento, CA)
    …to production outages. + Analyze production system operations using tools such as monitoring , capacity analysis and outage root cause analysis to identify and ... process improvements and back-end solutions for commercial technologies to maximize performance and suitability for business needs. This job family manages… more
    Cardinal Health (09/03/25)
    - Related Jobs
  • Senior Software Engineer , Bare Metal…

    NVIDIA (Santa Clara, CA)
    …challenged, improving, and evolving for the better. You will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of ... software related to managing fleets of GPU nodes. + Implementing monitoring and health management capabilities that enable industry leading reliability,… more
    NVIDIA (09/29/25)
    - Related Jobs
  • Principal Site Reliability Engineer (Sase)

    Palo Alto Networks (Santa Clara, CA)
    …architecture to improve scalability in networking like BGP, OSPF, service reliability, capacity , and performance + Collaborate with development teams to ensure ... work follow using python or go code + Build BGP and networking monitoring / remediation tools + Engage with customers on escalations to provide remediation +… more
    Palo Alto Networks (09/25/25)
    - Related Jobs