- General Motors (Sunnyvale, CA)
- …experience. This is a hands-on engineering role that requires a strong background in distributed systems , infrastructure, and a product mindset with a keen eye ... **Job Description** **Staff ML Engineer , ML Compute Platform** **About the Team:** The...kubernetes at scale + Relevant experience building large-scale with distributed systems + Experience leading and driving… more
- The Walt Disney Company (Nicasio, CA)
- …project requirements and supports iterative experimentation. + Manage compute resources ( cloud and on-premises) to enable large-scale distributed training and ... Group is seeking a highly skilled Sr ML Ops Engineer to build and maintain the infrastructure powering our...research to production. + Implement robust monitoring and logging systems to track model performance and identify potential issues… more
- NVIDIA (Santa Clara, CA)
- …building, search and chatbots + Proven expertise of performance, reliability in sophisticated distributed systems and the teams that build them. + Strong ... generation AI platforms and products that improve business efficiency and productivity. This engineer is expected to be familiar with concepts of RAG, agentic AI to… more
- Oracle (Pleasanton, CA)
- …Do?** + Design, develop, and maintain scalable, resilient, and highly available data systems . + Build large-scale distributed data pipelines that power automated ... of Oracle Fusion Analytics Warehouse. Purpose-built for Oracle Fusion Cloud Applications, bringing together business data, ready-to-use analytics, and...We are looking for a passionate and skilled Data Engineer to join the FDI team. In this role,… more
- Google (Sunnyvale, CA)
- …key teams working on the development of our TPUs, Vertex AI for Google Cloud , Google Global Networking, Data Center operations, systems research, and much more. ... Staff Software Engineer , AI/ML Infrastructure _corporate_fare_ Google _place_ Kirkland, WA,...systems for standalone deployment. + Build and integrate Cloud Compute software to bootstrap TPU AI Infrastructure. +… more
- Google (Sunnyvale, CA)
- …key teams working on the development of our TPUs, Vertex AI for Google Cloud , Google Global Networking, Data Center operations, systems research, and much more. ... Software Engineer III, Performance, AI Platforms _corporate_fare_ Google _place_...languages. + 2 years of experience with performance, large-scale systems data analysis, visualization tools, or debugging. + 2… more
- Amazon (East Palo Alto, CA)
- …base. You'll bring a passion for innovation, data, search, analytics, and distributed systems . You'll also: Solve challenging technical problems, often ones ... Description Amazon Aurora DSQL is a serverless, distributed SQL database with virtually unlimited scale, highest availability, and zero infrastructure management.… more
- Meta (Menlo Park, CA)
- …infrastructure for AI/ML (GenAI, LLMs, multimodal models), Distributed storage systems , data lakes, or cloud object stores, High-performance data pipelines ... across the organization, fostering best practices in large-scale data engineering and distributed systems 7. Stay abreast of industry and Meta-wide trends… more
- Walmart (Sunnyvale, CA)
- …deployment automation tools (Docker, Kubernetes, Helm, Nomad, etc.) + Extensive knowledge in Distributed Systems and the challenges that come with them + Strong ... and physical. We are looking for a Staff Software Engineer on our Streaming and Messaging Systems ...to remove the complexity of provisioning, operating, and maintaining cloud -based streaming services by providing an automated, lights- out,… more
- General Motors (Sacramento, CA)
- …+ Building tools to enable engineers to collect and act on observability signals from distributed cloud systems and on-vehicle sensors + Influence the team's ... systems (eg. Kubernetes) + Proficient in designing and developing sophisticated distributed systems , with expertise in one or more high-level programming… more