- Google (San Francisco, CA)
- …Scenario Planning , Monte Carlo simulation, Sensitivity Analysis). + Experience in cloud capacity planning . **Preferred qualifications:** + Master's or ... equivalent practical experience across Cloud Capacity Planning and Operations....cross-functional partners. + Experience with Product, Technical expertise with Infrastructure , Cloud , and Enterprise B2B businesses. +… more
- NVIDIA (Santa Clara, CA)
- …We are specifically looking for a TPM with extensive experience in cloud infrastructure bring-up and relationship management. You'll be instrumental in ... of continuous improvement, consistently finding opportunities for process improvements within our cloud infrastructure operations. What we need to see: + 10+… more
- Google (Sunnyvale, CA)
- …and unifying demand and capacity planning across all of Google's infrastructure , including ML, Cloud , and Standard Fleets. + Partner with Google's ML, ... + Experience in developing and implementing large-scale demand and capacity planning systems + Experience with enterprise...demand and capacity across all of Google's infrastructure , encompassing Machine Learning, Cloud , and Standard… more
- NVIDIA (Santa Clara, CA)
- …for large scale AI training and Inferencing platform built on top of cloud infrastructure + Conduct in-depth performance characterization and analysis on large ... looking for an outstanding, passionate, and talented Senior AI Infrastructure Engineer to join our DGX Cloud ...changes to the existing system through careful preparation and planning while managing capacity and performance. NVIDIA's… more
- Google (Sunnyvale, CA)
- …deliver capacity plans and roadmaps. + Define and prioritize infrastructure features with researchers, engineers, and stakeholders across various workstreams. + ... Program Manager (TPM), you will manage and deploy large scale distributed training infrastructure and Hardware (HW) New Product Introductions (NPIs). You will play a… more
- NVIDIA (Santa Clara, CA)
- …developers to make changes to the existing system through careful preparation and planning while keeping an eye on capacity , latency and performance. What ... empowering your team to leverage and contribute to both foundational infrastructure and pioneering AI/ML tools for smarter debugging, automation, knowledge sharing,… more
- NVIDIA (Santa Clara, CA)
- …for the better. You and other engineers on this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range ... NVIDIA is hiring engineers to scale up its AI Infrastructure . We expect you to have a strong programming background, knowledge of datacenter hardware, operations,… more
- Meta (Menlo Park, CA)
- …Qualifications: 17. Experience in developing and implementing large-scale demand and capacity planning systems 18. Experience running large scale program ... **Summary:** The Developer Infrastructure (DevInfra) team at Meta is on a...(TPM) to lead high-impact initiatives focused on scaling and capacity across DevInfra. This role is critical in ensuring… more
- NVIDIA (Santa Clara, CA)
- …for Technical Program Management team to lead a high-impact team within our DGX Cloud Infrastructure organization. You will play a critical role in driving ... innovation powering breakthroughs in research, autonomous vehicles, robotics, and more. The DGX Cloud team builds and operates the AI infrastructure that fuels… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- …and system integration. + Engage in long-term planning to advance cloud systems and infrastructure evolution. **To be successful in this position ... within SLAC IT, playing a vital role in designing, deploying, and maintaining cloud infrastructure to enable scalable, secure, and efficient operations. This… more