- Bosch (Sunnyvale, CA)
- …journals such as CVPR, ICRA, IROS, RSS, NeurIPS and CoRL. **Job Description** As the Distributed Embodied AI Systems intern, you will perform research on ... that take advantage of technologies in the field of reliable distributed computing. We work with internal...future prediction for latency mitigation in distributed embodied AI systems . A… more
- Cisco (San Jose, CA)
- …platforms, such as AWS, Azure, or Google Cloud. + Understanding of distributed systems concepts, including scalability, reliability, fault tolerance, and data ... Team** Our dedicated team members are building the future of Cisco's AI -driven platforms and data infrastructure, supporting innovation across the globe. You will… more
- NVIDIA (Santa Clara, CA)
- …and inference more reliable , scalable, and efficient. If you're passionate about AI , distributed systems , and high-performance computing, we want to hear ... driving down cluster downtime towards zero, ensuring that our AI systems remain robust and reliable...detection. + Hands-On Coding & Optimization: Contribute to large-scale distributed systems with high-quality, production-level C++ and… more
- Oracle (Sacramento, CA)
- …learning, LLM applications, and agentic AI . Our team builds real-world AI systems and deploys scalable, production-ready solutions across Oracle's enterprise ... engineer to contribute to the design and deployment of advanced AI systems , including LLM-powered agents, Retrieval-Augmented Generation (RAG) pipelines,… more
- Oracle (Redwood City, CA)
- …data architectures (data mesh, lakehouse, etc.). + Expertise in **data modeling, distributed systems , and performance optimization.** + Proven ability to ... you ready to shape the future of intelligent data systems ? We're seeking an ** AI and Data...Collaborate with engineers, product teams, and researchers to build systems that are ** reliable , scalable, and production-ready.**… more
- Oracle (Sacramento, CA)
- …. This is a highly technical, hands-on role where you'll build large-scale distributed systems , optimize AI /ML workflows, and collaborate with ... observability, CI/CD pipelines, and operational excellence. Troubleshoot complex issues in distributed systems and participate in on-call rotations as needed.… more
- Oracle (Sacramento, CA)
- …Work closely with a collaborative and experienced global team. - Expand your knowledge in AI , cloud computing, and distributed systems . - Contribute to one ... tools to operationalize Large Language Models (LLMs) and agentic AI systems . Our goal is to empower...will contribute to the design and implementation of scalable, distributed systems that serve LLMs and support… more
- Charles Schwab (San Francisco, CA)
- …+ Champion reliability, monitoring, observability, and operational best practices for AI systems and data pipelines. + Collaborate with cross-functional ... in the development process. You will ensure that the systems we build are robust, reliable , and...troubleshoot complex problems with ambiguous or incomplete data in distributed systems . + Curiosity about new technologies… more
- Walmart (Sunnyvale, CA)
- …build dynamic, context-aware systems . 2. **Architecture ; Scalability:** + Architect scalable, distributed AI systems with a focus on performance, fault ... to lead the design, development, and deployment of advanced AI systems . This role involves architecting scalable...Walmart GTP, you will be building highly scalable and reliable APIs, services and applications which will drive the… more
- Charles Schwab (San Francisco, CA)
- …bring curiosity, creativity, and technical depth to help shape the next generation of reliable AI at Schwab. **What you have** **Required Qualifications** + 8+ ... you will play a key role in ensuring our AI solutions are reliable , scalable, and resilient-enabling...Experience implementing monitoring, alerting, and incident response for large-scale distributed systems . + Proven track record in… more