- NVIDIA (Santa Clara, CA)
- …data processing and model post-training. + Deep understanding of distributed systems for large- scale model inference and serving. Your base salary will be ... efficient serving of LLMs, VLMs, and WFMs at datacenter scale , leveraging technologies like Dynamo. + Collaborate with research...the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for… more
- NVIDIA (Santa Clara, CA)
- …era of machine learning innovation. In this role, you will architect, scale , and optimize high-performance ML infrastructure used across NVIDIA's AI research and ... maintain scalable ML platforms and infrastructure for training and inference on large- scale , distributed GPU clusters. + Develop internal tools and automation for ML… more
- NVIDIA (Santa Clara, CA)
- …be doing: + Provide leadership and strategic mentorship on the management of large- scale HPC systems including the deployment of compute, networking, and storage. + ... to accelerate researchers' velocity, troubleshooting, and software performance at scale . What we need to see: + Bachelor's degree...If you're a tech enthusiast, apply now! Your base salary will be determined based on your location, experience,… more
- NVIDIA (Santa Clara, CA)
- …Engineer to design, deploy, and manage high speed storage offering in our large- scale GPU clusters. These clusters will power AI workloads across multiple teams and ... Infrastructure teams to ensure our GPU clusters perform efficiently, scale well, and remain reliable. The ideal candidate has...technology, we want to hear from you. Your base salary will be determined based on your location, experience,… more
- NVIDIA (Santa Clara, CA)
- …Contribute to product roadmap decisions by synthesizing findings from large- scale model training and inference environments. Identify cross-industry patterns and ... + Extensive experience working with or developing platforms that facilitate large- scale AI/ML training and inference workloads. This includes distributed systems,… more
- Palo Alto Networks (Santa Clara, CA)
- …service architecture. **Your Impact** + Lead a team of engineers building high- scale backend services, deployed in Kubernetes and leveraging GCP's data and compute ... SRE teams to ensure performance, reliability, and security at scale + Encourage engineering best practices in CI/CD, observability,...an offer at the posted level, the starting base salary (for non-sales roles) or base salary … more
- NVIDIA (Santa Clara, CA)
- …necessary frameworks. Your responsibilities will include defining and implementing large- scale telemetry pipelines, ensuring data integrity, and designing simulation ... to defining the architecture for next-generation monitoring and analytics solutions for large- scale Data Centers. + Expand the capabilities of existing solutions and… more
- DoorDash (San Francisco, CA)
- …database agnostic abstractions. In this role, you will design, optimize, and scale distributed data access layers that power DoorDash's most critical systems, ... + Lead data modeling, performance tuning, and capacity planning for large- scale , mission-critical storage workloads. + Partner with product engineering and… more
- NVIDIA (Santa Clara, CA)
- …are purpose-built to process data, refine it into models, and produce tokens with scale and efficiency. In the AI industrial revolution, data is the raw material, ... integrates NVIDIA's GPUs, CPUs and Networking platforms to deliver unmatched data center- scale performance. What you'll be doing: + Lead execution and technical… more
- NVIDIA (Santa Clara, CA)
- …language models, and their application in agentic and reasoning use cases. As the scale and complexity of these LLM systems continues to increase, we are seeking ... seamlessly integrating improvements to ensure NVIDIA's solutions can efficiently handle large- scale , sophisticated tasks. What you'll be doing: + Research and… more