- NVIDIA (Santa Clara, CA)
- …efficient, resilient deployment of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and roadmap for memory ... inference. + Architect and implement deep integrations with leading LLM serving engines (such as vLLM, SGLang,...are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Principal Software Engineer , TensorRT- LLM ! NVIDIA is hiring experienced principal software engineer for its TensorRT- LLM ... an "iPhone moment". Join the team building the AI serving software which is foundational to product lines within...in deep learning like LLMs + Experience working with LLM inference frameworks like vLLM, SGLang, etc. + Experience… more
- Palo Alto Networks (Santa Clara, CA)
- …ensuring a formidable security posture from development through runtime. As a Senior Principal Machine Learning Engineer , you will drive research on cutting-edge ... areas, including AI-Native Security ( LLM , AI Agent, Model Supply-Chain, Runtime AI) and the...scalable, low-latency, and resilient ML inference platform capable of serving a diverse range of models for real-time security… more
- NVIDIA (Santa Clara, CA)
- …engineers enthusiastic about building the next generation of scalable AI systems. As a Principal Software Engineer on the Dynamo project, you will address some ... supporting a variety of LLM frameworks (eg, TensorRT- LLM , vLLM, SGLang). + Disaggregated Serving : Architect...are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want… more
- Microsoft Corporation (Mountain View, CA)
- …major Microsoft products, including Office, Windows, Bing, SQL Server, and Dynamics. As a Principal Engineer on the team, you will have the opportunity to work ... performance of OpenAI and other state of the art LLM models and work directly with OpenAI on the...locations considered for very strong candidates. **Responsibilities** As a Principal Software Engineer on the team the… more
- Oracle (Sacramento, CA)
- …Preferred Qualifications - MS in Computer Science. - Experience working with LLM serving frameworks like vLLM, DeepSpeed, or FasterTransformer. - Exposure ... applications and agents that integrate seamlessly with cloud services. Role Summary As a Principal Software Engineer (IC4), you will contribute to the design and… more
- Palo Alto Networks (Santa Clara, CA)
- …while ensuring a formidable security posture from development through runtime. As a Principal Machine Learning Inference Engineer , you will serve as a technical ... scalable, low-latency, and resilient ML inference platform capable of serving a diverse range of models for real-time security...Vision: Act as a key technical liaison to other principal engineers, architects, and product leaders to shape the… more
- Oracle (Sacramento, CA)
- **Job Description** The Senior Principal AI/ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI/ML ... Infra offerings + Design and implement scalable orchestration for serving and training AI/ML models, Model Parallelism & Performance...on generative AI, agents, and inference systems into the LLM software stack. + Lead initiatives in Generative AI… more
- Autodesk (San Francisco, CA)
- …systems, eCommerce personalization engines, and intelligent search capabilities. As a Principal Engineer , you will set technical direction, establish best ... **Job Requisition ID #** 25WD90545 **Position Overview** We are seeking a Principal Data Engineer to provide technical leadership in designing, building, and… more
- Microsoft Corporation (Mountain View, CA)
- **Overview** As a Principal Software Engineer on the Azure Artificial Intelligence Core team at Microsoft, you will design, build, and maintain AI systems that ... Our work spans the entire stack-from API interfaces to inference backends serving AI models-delivering end-to-end solutions that drive innovation at scale. As part… more