- DoorDash (San Francisco, CA)
- …have experience managing large fleets of database clusters, including building tools, monitoring , capacity planning, backups and disaster recovery strategies. + ... the Role The Storage team is building and operating a high performance , scalable, reliable data abstraction layer that optimizes reliability and efficiency. You… more
- Nutanix (Sacramento, CA)
- …Nutanix team, where your skills will directly impact the availability and performance of our cutting-edge cloud technology while collaborating with a global network ... QA, Development, and Infrastructure teams to design and implement robust monitoring solutions. + Manage deployment of software patches, upgrades, and administrative… more
- Cardinal Health (Sacramento, CA)
- …and takes timely corrective action. + Works with the Unix/Linux/Windows Administrator on capacity planning and performance tuning of servers hosted on GCP cloud. ... SCM, Portal, PO,BI, Netweaver, SLT, SideCar) + Daily administration, monitoring , troubleshooting and tuning of a complex SAP production...+ System Copy and Refreshes + ABAP and Java performance Tuning + SNC Encryption/Cryptography + SAP SSO and… more
- Amazon (Cupertino, CA)
- …the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience ... will be crucial for optimizing high-speed interfaces and ensuring robust system performance . You will interact with an interdisciplinary team of engineers to design,… more
- Amazon (Cupertino, CA)
- …training and inference? Want to do industry leading work delivering continuous price performance improvements in the cloud for AI model training for multi billion ... in designing, delivering and operating AWS cloud offerings that enable high performance and scalability in AI/ML and HPC workloads. AWS Infrastructure Services owns… more
- Amazon (Cupertino, CA)
- …the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience ... creative and new designs that set the standards on performance , quality, cost, and operational excellence. What you will...you will oversee the fleet of servers you develop, monitoring their quality and how they are meeting the… more
- NVIDIA (Santa Clara, CA)
- …and excellent communication and planning abilities. Experience working with High Performance Computing (HPC), GPUs, and high- performance networking (RDMA, ... the better. You and other engineers on this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of… more
- Cardinal Health (Sacramento, CA)
- …to production outages. + Analyze production system operations using tools such as monitoring , capacity analysis and outage root cause analysis to identify and ... process improvements and back-end solutions for commercial technologies to maximize performance and suitability for business needs. This job family manages… more
- NVIDIA (Santa Clara, CA)
- …challenged, improving, and evolving for the better. You will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of ... software related to managing fleets of GPU nodes. + Implementing monitoring and health management capabilities that enable industry leading reliability,… more
- Palo Alto Networks (Santa Clara, CA)
- …architecture to improve scalability in networking like BGP, OSPF, service reliability, capacity , and performance + Collaborate with development teams to ensure ... work follow using python or go code + Build BGP and networking monitoring / remediation tools + Engage with customers on escalations to provide remediation +… more