- NVIDIA (Santa Clara, CA)
- …AI/ML datacenters with NVIDIA GB200, and upcoming GB300 GPUs. NVIDIA seeks a Senior Software Engineer for our CSP (Cloud Service Provider) Engagements team ... + Drive joint architecture reviews and "whiteboard" sessions with CSP and internal platform teams; convert findings into RFCs and upstream pull requests. + Create… more
- NVIDIA (Santa Clara, CA)
- …Drive platform stability and performance through strategic integration of observability frameworks. + Own technical strategy for broad and complex challenges. ... databases, including aspects pertaining to query optimization, scaling strategies and observability metrics. + Strong fundamentals in file handling and scripting for… more
- Zoom (San Jose, CA)
- …end to end, partner with product, frontend and SRE, improve performance, security and observability , use data to resolve issues. About the team We are the Zoom ... via CI/CD in the cloud. + Guiding performance tuning, monitoring, and observability for Spring-based services. + Writing automated tests and maintaining high code… more
- LinkedIn (Mountain View, CA)
- …community while making a real impact within our company. As a Sr. Staff Software Engineer , you will be a key technical leader and role model within the organization. ... transformation, collaboration and results driving key technology adoption and seamless platform migrations. You will work closely with technical leadership and… more
- Coinbase (Sacramento, CA)
- …that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're ... experiments into enterprise-grade tools, making AI an indispensable partner for every engineer . *What you'll do:* * Build cutting-edge infrastructure and tools that… more
- NVIDIA (Santa Clara, CA)
- …can make a lasting impact on the world. Join as a Data Engineer Cloud Gaming to develop and maintain Kubernetes-based GPU-accelerated data processing services, ... optimizing query response times and driving platform engagement with innovative solutions. What you'll be doing: + Optimize distributed computing infrastructure by… more
- NVIDIA (Santa Clara, CA)
- …and trouble-shooting of compute hardware and networking equipment. As a software engineer , you will work with other software engineers, product architects, and ... with the broad architectural vision for the NVIDIA Cloud Platform , working with other teams to develop a robust...(mutual-TLS, IPsec, or similar). + Knowledge of SRE principles ( observability , SLOs, logging, etc.) Ways to stand out from… more
- Rubrik (Sacramento, CA)
- …and reliability goals * Manage and streamline monitoring systems to enhance observability and enable proactive identification of issues. * Coordinate and manage ... data management best practices and strong experience in any logging and/or SIEM platform * Experience with Vault, Terraform, Puppet, Jenkins and Github * Proficiency… more
- LiveRamp (San Francisco, CA)
- **LiveRamp is the data collaboration platform of choice for the world's most innovative companies. A groundbreaking leader in consumer privacy, data ethics, and ... Go programming language.** + **Experience with SRE best practices, working knowledge of observability principles is a big plus** + **Ability to lead and mentor other… more
- NVIDIA (Santa Clara, CA)
- …compute infrastructure and codify reliability best-practices in the broader DGX Cloud platform ecosystem. What you'll be doing: + Design, build, and run cloud ... internal facing service level objectives and error budgets as part of our overall observability strategy. + Eliminate toil or automate it where the ROI of building… more