- Walmart (Sunnyvale, CA)
- …do:** + Lead product strategy for Wally's LLM reasoning and decisioning layer for improved contextual understanding, multi-turn memory, and dynamic tool invocation. ... + Develop the integration strategy between Wally Core LLM and Squiggly, establishing a shared orchestration and reasoning fabric with seamless hand-offs + Partner with merch sub-agent PMs to design cross-agent execution + Lead roadmap for multi-tenant… more
- Microsoft Corporation (Mountain View, CA)
- …an AI Networking Engineer, you'll shape the end-to-end networking architecture, link- layer to fabric-wide systems for hyperscale AI training clusters. design, bring ... up, and scale the distributed Ethernet and InfiniBand fabrics that connect hundreds of thousands of GPUs across multi-megawatt data halls. You'll benchmark, profile, debug and tune the training and inference of AI workloads running in the production clusters.… more
- Microsoft Corporation (Mountain View, CA)
- …would be responsible for designing and building our compute orchestration and scheduling layer on top of Kubernetes and Ray, working on everything from workload ... placement and scaling to reliability and developer experience. You'll work closely with research and framework teams to turn their requirements into scalable abstractions, improve cluster efficiency, and ensure our compute platform is observable, and easy to… more
- NVIDIA (Santa Clara, CA)
- …with networking control plane CPU subsystems, PCIE, I2C, PSUs, SMBus, PHY Layer technologies and hardware platform bringups. + Excellent knowledge of Linux systems ... administration, Linux internals and tools. + Experience driving projects from concept to production + Excellent written and verbal communication and interpersonal skills. Comfortable articulating value propositions to customers and influencing internal teams +… more
- General Atomics (Poway, CA)
- …the network to the physical including the medium access controller (MAC) layer + Demonstrates a broad understanding of engineering principles, concepts, theory, and ... practice with the ability to organize, plan, schedule, conduct, and coordinate workloads to meet established deadlines or milestones with some experience in project leadership. + Must be able to understand and apply new concepts quickly in a fast-paced,… more
- Google (Sunnyvale, CA)
- …hardware. You will work on a vertically integrated system, contributing to every layer of the stack. Your work will include developing services in Go/Python to ... automate the entire life cycle of our hardware testbeds, from initial provisioning to ongoing fleet management. You will build automation for OS installation and software deployment using custom installers like Subiquity and will solve unique engineering… more
- Oracle (Sacramento, CA)
- …data center networking concepts. + Demonstrated experience with network or transport- layer features and best practices. + Proficiency in systems programming ... languages such as C, C++, Go, or Rust, and experience with modern network monitoring and diagnostic tools. + Proven ability to analyze, troubleshoot, and optimize network performance at scale. + Excellent collaboration and communication skills; able to work… more
- Amazon (Cupertino, CA)
- …Solve challenging technical problems, often ones not solved before, at every layer of the stack. Design, implement, test, deploy and maintain innovative software ... solutions to transform service performance, durability, cost, and security. Research implementations that deliver the best possible experiences for customers. A day in the life As you design and code solutions to help our team drive efficiencies in software… more
- Herbalife (Torrance, CA)
- …teams to ensure reliability, scalability, and cost-efficiency are built into every layer of the stack. * Mentor and influence engineering teams to adopt ... modern SRE practices and drive a culture of operational excellence. **WHAT'S SPECIAL ABOUT THE TEAM:** The SRE team is evolving to expand its scope beyond traditional operations, embedding observability, automation, and cloud-native practices across… more
- Leidos (San Diego, CA)
- …(ATDD). Experience with Behavior Driven Development (BDD). Secure Software development (ie, Layer 7 Policy). Experience with the Scrum, Scaled Agile Framework (SAFe) ... methodology, SAFe Agilest Certification, or experience as a member of an Agile team. If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the status quo - because the mission demands it. We're not hiring followers. We're… more