- Oracle (San Juan, PR)
- …the AI /ML Stack + Explore and incorporate contemporary research on generative AI , agents, and inference systems into the LLM software stack. + Lead ... initiatives in Generative AI systems design, including...+ Expertise in orchestrating, running, and optimizing large-scale distributed training/ inference workloads + Have deep understanding of AI… more
- NVIDIA (Santa Clara, CA)
- …+ Research and Development: Explore and incorporate contemporary research on generative AI , agents, and inference systems into the NVIDIA LLM software ... + Experience in building large-scale LLM inference systems , especially those involving compound AI . +...inference systems , especially those involving compound AI . + Experience with processor and system-level performance modeling.… more
- Amazon (Seattle, WA)
- …of applied scientists, system engineers, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your work ... ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement...expertise to push the boundaries of what's possible in AI acceleration. As part of the broader Neuron organization,… more
- Amazon (Seattle, WA)
- …of applied scientists, system engineers, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your work ... ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement...expertise to push the boundaries of what's possible in AI acceleration. As part of the broader Neuron organization,… more
- Amazon (Cupertino, CA)
- …Key job responsibilities * Architect and lead the design of distributed ML serving systems optimized for generative AI workloads * Drive technical excellence ... and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to… more
- Amazon (Seattle, WA)
- … Inference Technology building blocks team, you will guide your expert AI engineers to build fundamental inference technology building blocks and libraries ... to enable AI developers to optimize model for inference on Trainium and Inferentia devices. We're currently focusing on MoE models such as GPT OSS for Trainium 2… more
- NVIDIA (Santa Clara, CA)
- …training and inference frameworks. + Hands-on experience training or fine-tuning generative AI models on large-scale GPU clusters. + Proficient in GPU ... NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model...NVIDIA platform integration and expand market adoption across the AI inference ecosystem. What we need to… more
- NVIDIA (Santa Clara, CA)
- …most sophisticated AI systems - from large language models to multimodal generative AI - all accelerated on NVIDIA GPUs. The Deep Learning Inference ... and optimization of large-scale models for LLM, multimodal, and generative AI applications. + Guide engineers in...passionate technical leader ready to shape the future of AI inference frameworks - and build the… more
- Microsoft Corporation (Mountain View, CA)
- …programming. + Experience benchmarking, profiling, and optimizing PyTorch generative AI models. + Experience with open source inference frameworks like ... great match for you if you: + Understand modern generative AI architectures and how to optimize...frontier AI research ideas. + Introduce new systems , tools, and techniques to improve model inference… more
- GE Vernova (Niskayuna, NY)
- …and delivering scalable, innovative AI solutions using cutting-edge generative AI models. This role entails architecting systems that leverage advanced ... + Architect and oversee the development of robust, scalable systems for deploying generative AI ...inference , and monitoring in production environments. + Ensure systems meet high standards for performance, scalability, and security… more
Recent Jobs
-
CT Tech - In House Agency - South Shore/New Orleans Region
- Ochsner Health System (New Orleans, LA)
-
Senior Staff Software Engineer - Time
- Rippling (San Francisco, CA)
-
Senior Appian Developer
- Jamison Professional Services, Inc. (Washington, DC)
-
Senior Java + Spring Boot Microservices Developer
- Highbrow LLC (Columbus, OH)