- Google (Sunnyvale, CA)
- …+ 2 years of experience with developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage or hardware ... and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with opportunities to… more
- Meta (Menlo Park, CA)
- …are seeking for engineers to work on the space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking ... SW stacks around NCCL and PyTorch to improve the full-stack distributed ML reliability and performance (eg Large-Scale GenAI/LLM training) from the trainer down to… more
- Palo Alto Networks (Santa Clara, CA)
- …and application development. We are looking for a Senior Staff IT Data Engineer with extensive experience in Data engineering, SQL, Cloud engineering and business ... and implementing GenAI-driven solutions to achieve measurable improvements in the reliability and performance of data pipelines or to optimize key processes… more
- NVIDIA (Santa Clara, CA)
- …yield, and quality to define groundbreaking products as a product definition engineer for NVIDIA's family of chips and products. + Architect crucial next-generation ... chip and board designers, software/firmware engineers, HW/SW applications engineering, process/ reliability specialists, ATE engineers, product managers, sales, and operations… more
- Amazon (San Francisco, CA)
- …language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience ... for the app architecture, developer onboarding, mobile app releases, reliability , and ensuring production issues gets routed to the...and resolved in a timely manner. As an Android Engineer , you will be developing and expanding the next… more
- Google (Sunnyvale, CA)
- …Physics, or a related field. + Experience with Linux or other Unix operating systems and shell scripts. + Experience in more than one discipline related to hardware ... datasheets, and written work instructions. + Understanding of computer systems : physical, functional, logical, mechanical, electrical, software, thermal etc. +… more
- Microsoft Corporation (Mountain View, CA)
- …all data inputs and outputs. + Design and maintain comprehensive monitoring and alerting systems to ensure the reliability and performance of data pipelines and ... in revenue annually. We are seeking a highly skilled and experienced **Principal Software Engineer ** to join our team in Mountain View, CA or Redmond, WA. In this… more
- Google (Sunnyvale, CA)
- …and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with opportunities to ... analysis of ML workloads such as Gemini on future TPU systems , evaluate performance/cost trade-offs of Hardware features and Software optimization techniques… more
- Lenovo (San Jose, CA)
- …Architecture:** Design and maintain the overall architecture of our AI systems , ensuring scalability, reliability , and performance. + **Model Expertise:** ... Principal Engineer , AI Architecture **General Information** Req # WD00086176...lead the design and implementation of our next-generation AI systems . This is a pivotal role responsible for the… more
- Walmart (Sunnyvale, CA)
- …Summary ** **What you'll do ** **About the Role:** We're seeking a **Staff Software Engineer ** to lead the design and evolution of our backend systems built on ... cross-functional teams, solving complex architectural problems, and ensuring scalability, reliability , and developer efficiency across the platform. **Key Responsibilities:**… more