- Microsoft Corporation (Mountain View, CA)
- **Overview** The Artificial Intelligence Performance team at Microsoft develops AI software that enables running AI models everywhere, from world's fastest AI ... on a collaborative and inclusive culture. We own inference performance of OpenAI and other state of the art...Bing, SQL Server, and Dynamics. As a Senior Software Engineer on the team, you will have the opportunity… more
- NVIDIA (Santa Clara, CA)
- …AI inference frameworks (eg, vLLM, TensorRT-LLM, SGLang). + Experience with GPU resource scheduling, cache management, or high- performance networking. + ... scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture,… more
- NVIDIA (Santa Clara, CA)
- …and reasoning models across multi-node distributed environments. Built in Rust for performance and Python for extensibility, Dynamo orchestrates GPU shards, ... outgrow the memory and compute budget of any single GPU , this platform enables efficient, resilient deployment of cutting-edge...cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and roadmap for memory… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA AI and HPC software stack. We are searching for a highly motivated engineer to lead performance benchmarking and optimization efforts for our data center ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...instrumental in ensuring our data center solutions deliver industry-leading performance for accelerated computing workloads. What you will be… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA is seeking a Senior High Performance Computing (HPC) and AI Networking Performance Research and Analysis Engineer to join our Performance group. ... and interact with people are no longer science fiction. GPU Deep Learning has provided the foundation for machines...HCAs, Switches, CPUs, GPUs, and Systems. You will develop performance analysis tools and methodologies to dive deeply into… more
- NVIDIA (Santa Clara, CA)
- …define the next era of computing. An era in which our tightly coupled CPU, GPU and DPU technology acts as the brains of computers, robots, and self-driving cars that ... impact on the world! NVIDIA is searching for a highly motivated, technical engineer to join the Tegra system-on-chip (SoC) software organization. You will work on… more
- NVIDIA (Santa Clara, CA)
- …real-time sensor, imaging, and multimodal data processing-balancing developer usability with peak GPU performance . + Prototype GPU -accelerated algorithms for ... framework for sensor AI, enabling developers to build, optimize, and deploy GPU -accelerated pipelines that process multimodal sensor data in real time. Originally… more
- NVIDIA (Santa Clara, CA)
- … GPU and driving breakthroughs in gaming, computer graphics, high- performance computing, and artificial intelligence. Our technology powers everything from ... and build the next-generation observability platform for large-scale AI workloads, GPU clusters, and high- performance computing environments. This role blends… more
- LinkedIn (Mountain View, CA)
- …product and infrastructure engineers to optimize their models and deliver the best performance possible. As a Senior Software Engineer , you will have first-hand ... large language models, to computer vision models. We optimize performance across algorithms, AI frameworks, data infra, compute software,...software, and hardware to harness the power of our GPU fleet with thousands of latest GPU … more
- LinkedIn (Mountain View, CA)
- …product and infrastructure engineers to optimize their models and deliver the best performance possible. As a Software Engineer , you will have first-hand ... large language models, to computer vision models. We optimize performance across algorithms, AI frameworks, data infra, compute software,...software, and hardware to harness the power of our GPU fleet with thousands of latest GPU … more