- NVIDIA (Santa Clara, CA)
- …expertise in different domains, such as storage architecture, high-performance distributed storage, data management, systems , networking, coding, database ... storage solutions, optimizing data placement and access patterns, managing large-scale distributed storage systems , and ensuring low-latency data access for… more
- MongoDB (Denver, CO)
- …tools like Grafana, Prometheus, Kibana, etc + Knowledge of infrastructure deployments for distributed systems such as Cloud Providers (AWS, GCP, Azure) and ... guiding our customers and users to design and build reliable , scalable systems using our data platform....teams + Experience designing with scalable and highly available distributed systems in the cloud and on-prem… more
- Amazon (Sunnyvale, CA)
- …of computer science fundamentals, and practical experience building efficient large-scale distributed systems . This person is comfortable delivering quality ... that manage and orchestrate the IR system infrastructure, including the distributed compute clusters, storage systems , and networking components. * Design… more
- Palo Alto Networks (Santa Clara, CA)
- … (Git), and modern DevOps practices. + Experience testing complex, distributed systems and microservices architectures. + Excellent analytical, debugging, ... a senior, hands-on technical role focused on driving our AI -First quality strategy. You will be instrumental in architecting,...the development process and ensure the delivery of high-quality, reliable products. + Good hold on DevOps & non-prod… more
- Microsoft Corporation (Redmond, WA)
- …implement long-term fixes. + Mentor and guide engineers by sharing expertise in distributed systems , fostering technical growth, and promoting a culture of ... you will lead from the front in driving an AI -first culture, leveraging AI technologies to improve...experience. + 3+ years of experience developing high scale, distributed systems on a cloud platform. +… more
- Coinbase (Des Moines, IA)
- …managers across North America, LATAM, and India, guiding them to deliver high-quality, reliable , and intelligent systems that support over 100 million customers ... In this role, you'll oversee Coinbase's both custom-built and SaaS-based AI -powered customer self-service and automation capabilities , as well as applications… more
- MongoDB (Raleigh, NC)
- …As an SRE on the Fabric team, you will leverage your expertise in networking, distributed systems , and automation to ensure our systems are resilient, ... plays a crucial role in developing and maintaining the reliable and globally connected multi-cloud network that supports MongoDB...6+ years of experience working on software and operating distributed systems , with deep expertise in networking… more
- DoorDash (San Francisco, CA)
- …the bar of technical excellence. + Operate hands-on: Dig into complex distributed systems challenges, from scaling infrastructure to optimizing end-to-end ... tools that help them solve problems faster. By combining reliable platforms with the latest AI , we're...large-scale, high-impact systems . + Deep expertise in distributed systems , real-time data pipelines, or large-scale… more
- Meta (Menlo Park, CA)
- …leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI /GPU communication stack. ... learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, performance optimizations, or… more
- DoorDash (San Francisco, CA)
- …expertise in IT controls, ITGC, data governance, cybersecurity, cloud architecture, systems implementation, and emerging tech risks ( AI /ML, privacy, automation). ... a broad portfolio of audits, including IT SOX, cybersecurity, data governance, AI governance, and operational technology. You will report directly to the Chief… more