- Amazon (Cupertino, CA)
- …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more
- Amazon (Cupertino, CA)
- …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more
- Amazon (Cupertino, CA)
- …and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to ... and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a… more
- Amazon (Sunnyvale, CA)
- Description The Sensory Inference team at AGI is a group of innovative developers working on groundbreaking multi-modal inference solutions that revolutionize ... interact with the world. We push the limits of inference performance to provide the best possible experience for...2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
- Amazon (Palo Alto, CA)
- …the future of advertising. Key job responsibilities As a Software Development Engineer in Machine Learning, you will: * Enhance the scalability, automation, and ... efficiency of large-scale training and real-time inference systems. * Pioneer the development of LLM ...for action. We are looking for a talented Software Engineer with a strong background in machine learning engineering… more
- The Walt Disney Company (Nicasio, CA)
- …a related field. Master's Degree is preferred + 5+ years of experience in DevOps, Site Reliability Engineering, or a related role, with at least 2+ years ... The Skywalker Sound Development Group is seeking a highly skilled Sr ML Ops Engineer to build and maintain the infrastructure powering our machine learning and AI… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual contributor role ... clusters, storage solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. + Cloud AI Service Integration:… more
- PennyMac (Westlake Village, CA)
- …equivalent experience). + 5+ years of experience in a Platform Engineering, DevOps or Site Reliability Engineering (SRE) role. + 1+ year(s) of experience with AI ... through the complete mortgage journey. A Typical Day The Senior AI Platform Engineer will: + Design, implement, and manage scalable and resilient infrastructure on… more
- Amazon (San Francisco, CA)
- …breakthrough foundation models run at production scale. As a Software Development Engineer embedded in our science team, you'll be instrumental in transforming novel ... applications, leveraging your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll balance deep… more
- Amazon (San Francisco, CA)
- …foundation models run at production scale. As a Senior Machine Learning Engineer embedded in our science team, you'll be instrumental in transforming cutting-edge ... your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more