- Ally (Raleigh, NC)
- …**The Opportunity** Join Ally's Generative AI journey as a Principal Software Test Engineer on the Ally.ai platform. You'll define the quality strategy and test ... architecture that ensure our LLM -powered APIs and platform services are secure, reliable, and compliant. This role leads the design of automated API, performance,… more
- Red Hat (Raleigh, NC)
- …Adoption and Innovation (CAI) team is looking for a **Forward Deployed AI Engineer ** to join our rapidly growing AI Business Unit. As inference technologies become ... with the customer, helping them navigate the complexities of LLM inference in their specific clusters. + **Optimization &...with inference technologies such as Kserve, vLLM, and potentially llm -d. + **Functional Python Skills:** You are capable of… more
- Insight Global (Alpharetta, GA)
- Job Description We are seeking a highly technical and hands-on "Lead AI Engineer " to drive the integration of Large Language Model ( LLM ) capabilities into our ... and execute rigorous, data-driven evaluation methodologies (eg, A/B testing, LLM -as-a-Judge, human-in-the-loop validation) to quantify the business impact and… more
- Amazon (Seattle, WA)
- …cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible ... for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize ... will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA accelerators, from datacenter GPUs… more
- NVIDIA (Santa Clara, CA)
- …Senior High Performance Computing (HPC) and AI Networking Performance Research and Analysis Engineer to join our Performance group. In this exciting role, you will ... on large GPUs and CPUs scale clusters for distributed Deep Learning LLM training focused on collectives communication and networking. You will interact with… more
- Palo Alto Networks (Santa Clara, CA)
- …developing multi-tiered applications in a rapidly growing company. As a Principal AI Engineer , you will leverage your extensive experience to act as the trailblazer, ... a Proxy-First "AI Gateway" intermediary layer to ensure seamless LLM vendor independence. + Design and build a unified...Design and build a unified API endpoint to abstract LLM complexities and provide centralized control for access and… more
- Microsoft Corporation (Redmond, WA)
- …Visual Studio, VS Code and beyond This position is for a **Senior Software Engineer - CoreAI** eager to work on developer scenarios and who is passionate about ... coding language teams, data scientists, and internal and external LLM provider partners across the globe and time zones....equivalent experience. + 1+ year of experience with AI LLM models, such as OpenAI, Azure AI, ML **Other… more
- GE Vernova (Niskayuna, NY)
- **Job Description Summary** As a Sr. Staff Software Engineer , you'll lead enterprise full stack and AI platform architecture at scale for multiple initiatives across ... ship microservices and REST APIs, build RAG and multimodal LLM features, and set standards for AI-assisted development. You'll...REST APIs and microservices. + Build RAG and multimodal LLM features. + Design agentic workflows for autonomous tasks.… more
- Oracle (San Juan, PR)
- …that integrate seamlessly with cloud services. Role Summary As a Principal Software Engineer (IC4), you will contribute to the design and implementation of scalable, ... Preferred Qualifications - MS in Computer Science. - Experience working with LLM serving frameworks like vLLM, DeepSpeed, or FasterTransformer. - Exposure to… more