• Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …or public cloud system in Production + 8+ years experience delivering foundational infrastructure and observability platforms. + Experience in one or more of ... production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline… more
    NVIDIA (11/01/25)
    - Related Jobs
  • Senior Engineering Manager, ML Optimization Tools…

    Google (Sunnyvale, CA)
    …ease of use. **About the job** Like Google's own ambitions, the work of a Software Engineer goes beyond just Search. Software Engineering Managers have not ... infrastructure has dramatically boosted the productivity of Google Software Engineers. We are at the beginning of that...developers. The primary focus of our ML Performance and Observability Services team is to build the infrastructure more
    Google (10/25/25)
    - Related Jobs
  • Senior EDA Observability Architect

    NVIDIA (Santa Clara, CA)
    NVIDIA's Hardware Infrastructure organization is seeking a Senior EDA Observability Engineer to help architect and implement distributed observability ... Collaborate with HW, and SW engineering teams to deliver observability solutions that meet their needs in EDA clusters.... technologies, and GPU technology. + Prior experience in infrastructure software , production application software more
    NVIDIA (11/04/25)
    - Related Jobs
  • Infrastructure Software

    Amazon (Pasadena, CA)
    …on a mission to develop a fault-tolerant quantum computer. We are looking to hire an Infrastructure Software Engineer to join our growing software team. ... in initiating and driving projects to completion. As an Infrastructure Software Engineer , this role...offered by our team. - Implement best practices for observability and monitoring to enable rapid debugging and high… more
    Amazon (10/22/25)
    - Related Jobs
  • Senior DGX Cloud AI Infrastructure

    NVIDIA (Santa Clara, CA)
    …with the necessary resources and scale to foster innovation. We are seeking an AI infrastructure software engineer to join our team. You'll be instrumental ... of AI systems. As a senior DGX Cloud AI Infrastructure software engineer at NVIDIA,...+ Experience with AI training and inferencing and data infrastructure services. + Familiar in operating large-scale observability more
    NVIDIA (11/01/25)
    - Related Jobs
  • Software Engineer , Systems…

    Matroid (Palo Alto, CA)
    …industrial IoT, government and security. We're looking for a talented Software Engineer to help develop the systems & infrastructure that powers Matroid's ... and automation + Apply modern best practices such as infrastructure -as-code and observability How you'll be doing...Bonus points if + Significant work experience as a software engineer in backend infrastructure more
    Matroid (09/07/25)
    - Related Jobs
  • Senior AI Infrastructure Software

    NVIDIA (Santa Clara, CA)
    …developing scalable AI infrastructure services globally. We are seeking an AI infrastructure software engineer to join our team. You'll be instrumental ... AI in production. As a senior DGX Cloud AI Infrastructure software engineer at NVIDIA,... services. + Familiar in Kubernetes and operating large-scale observability platforms for monitoring and logging (eg, ELK, Prometheus,… more
    NVIDIA (11/01/25)
    - Related Jobs
  • Principal Staff Software Engineer

    LinkedIn (Mountain View, CA)
    …and scalability while reducing complexity and operational cost. As a Principal Staff Engineer in Service Infrastructure , you will be the primary domain expert ... or equivalent practical experience. + 7+ years of industry experience in software design, distributed systems, or infrastructure engineering. + 7+ years… more
    LinkedIn (10/08/25)
    - Related Jobs
  • Senior Systems Software Engineer

    NVIDIA (Santa Clara, CA)
    …center of this revolution. We are seeking a motivated Senior Systems Software Engineer to join our AV Infrastructure organization and become a key driver in ... design, and implement distributed infrastructure solutions to support AV software builds, large-scale simulation testing, and real-time observability . +… more
    NVIDIA (09/11/25)
    - Related Jobs
  • Sr. Staff Software Engineer

    LinkedIn (Mountain View, CA)
    …do this with a focus on performance, security, and reliability. As a Sr. Staff Software Engineer , you will fill the mission-critical role of ensuring that our ... CA. LinkedIn's Edge Engineering team builds and operates the infrastructure that resolves, routes, and delivers all traffic between...of the edge domain, and will also believe that software automation is a key component to operating large-scale… more
    LinkedIn (10/16/25)
    - Related Jobs