- Terra Quantum AG (San Francisco, CA)
- …are opened up every day. Hybrid computer systems that combine classic high - performance computing with quantum computers are already being used to ... performance , and scalability best practices. Troubleshoot and optimize system performance , bottlenecks, and data pipelines. Everyday Iterative updates… more
- SingleStore, Inc. (San Francisco, CA)
- …system level and implementation detail) in at least one of: storage scientific or high - performance computing Natural curiosity or drive to learn about new ... initiatives. The historical deployment primarily supported our database test system : A system that runs over 7.25...the design and development of major infrastructure components and systems , focusing on fault tolerance and performance .… more
- IFS (San Francisco, CA)
- …scalable, resilient, and observable by design . If you're passionate about high - performance computing , resilient architecture, and enabling real-time ... pressure. Define and implement metrics, tracing, and observability for end-to-end system behavior and performance . Collaborate closely with infrastructure, SRE,… more
- IFS (San Francisco, CA)
- …scalable, resilient, and observable by design . If you're passionate about high - performance computing , resilient architecture, and enabling real-time ... pressure. Define and implement metrics, tracing, and observability for end-to‑end system behavior and performance . Collaborate closely with infrastructure, SRE,… more
- Hamilton Barnes ? (San Francisco, CA)
- …stacks (Prometheus, Grafana, Loki) and incident response frameworks. Familiarity with high - performance computing (HPC) or AI/ML training infrastructure ... across Slurm and Kubernetes environments. Develop observability, alerting, and auto-healing systems for high -availability GPU workloads. Collaborate with ML,… more
- Menlo Ventures (San Francisco, CA)
- …and sandboxed code execution environments Experience with Kubernetes Experience with distributed systems or high - performance computing Experience with ... and engineering excellence, with a deep commitment to building high -quality, scalable systems that push the boundaries...love to pair!) Care about code quality, testing, and performance Have strong systems design and communication… more
- Anthropic (San Francisco, CA)
- …and sandboxed code execution environments Experience with Kubernetes Experience with distributed systems or high - performance computing Experience with ... at the intersection of cutting-edge research and engineering excellence, focusing on building high -quality, scalable systems that push the boundaries of what AI… more
- GEICO (Palo Alto, CA)
- …Staff Engineer or Tech Lead roles in ML/AI organizations* Background in distributed systems and high - performance computing * Open-source contributions to ... partnerships* Deep understanding of GPU optimization, memory management, and high -throughput systems **Annual Salary**$105,000.00 - $300,000.00The above annual… more
- Hamilton Barnes Associates Limited (San Francisco, CA)
- …stacks (Prometheus, Grafana, Loki) and incident response frameworks. Familiarity with high ‑ performance computing (HPC) or AI/ML training infrastructure ... across Slurm and Kubernetes environments. Develop observability, alerting, and auto-healing systems for high -availability GPU workloads. Collaborate with ML,… more
- Oracle (Seattle, WA)
- …network architectures. Strong experience and detailed technical knowledge in high ‑ performance computing and GPU systems . Other Information Disclaimer: ... customers we're building provisioning, repair, monitoring, maintenance, configuration and validation systems that enable us to deliver high ‑quality GPU clusters… more