• Distinguished Engineer

    NVIDIA (Santa Clara, CA)
    …at NVIDIA, you will own the development of DGX Cloud strategy for observability , monitoring , and remediation across all layers of infrastructure, IaaS, platforms ... define and drive the technical implementation for DGX Cloud offerings in the observability , monitoring , and remediation practice. + Collaborate on Cross Domain… more
    NVIDIA (08/25/25)
    - Related Jobs
  • Senior Staff Software Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a skilled and motivated Sr. Staff Software Engineer to join our dynamic team. You will contribute to innovative solutions and optimize our ... operations using brand new technology. This role offers an outstanding opportunity to work with groundbreaking technologies and contribute to NVIDIA's innovation. The ideal candidate will have extensive experience understanding complex engineering challenges,… more
    NVIDIA (09/25/25)
    - Related Jobs
  • Senior Staff Site Reliability Engineer

    Palo Alto Networks (Santa Clara, CA)
    …that such a role provides, you will have a deep knowledge of modern observability and monitoring tools and practices, having managed high cardinality metrics, ... Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: Utilize your...you will: + Cloud Expertise: Utilize your expertise in monitoring cloud platforms, particularly GCP, to optimize our infrastructure,… more
    Palo Alto Networks (07/15/25)
    - Related Jobs
  • JR- Senior Software Engineer - Site…

    General Motors (Sunnyvale, CA)
    …live and deliver a better future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements of the infrastructure health and ... reliability monitoring for GM's commercial fleet. We are an innovation...You'll Do** + Implement scalable, reliable, secure SRE and Observability platform to monitor health of our production system… more
    General Motors (09/13/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …Design, implement and support operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, ... real time monitoring , logging and alerting + Engage in and improve...Production + 8+ years experience delivering foundational infrastructure and observability platforms. + Experience in one or more of… more
    NVIDIA (08/02/25)
    - Related Jobs
  • Principal DevOps Engineer (Cortex…

    Palo Alto Networks (Santa Clara, CA)
    engineer who is passionate about automation, cloud infrastructure, observability , and continuous integration/deployment. You will contribute to the evolution of ... of XSIAM, XSOAR, and XPANSE. As a Senior DevOps Engineer , you will be responsible for designing, building, and...(GKE preferred) and Docker, ensuring resilience and security + Monitoring & Performance Optimization - Implement monitoring ,… more
    Palo Alto Networks (08/28/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    Eliassen Group (Concord, CA)
    …or similar service mesh technologies, including gateway configuration, traffic routing, and observability . . Monitoring & Observability : Design and implement ... Monitoring Tools: Proficiency with AppDynamics, Splunk Cloud, Splunk Observability , Prometheus, Grafana, or similar. . Distributed Tracing: Experience using… more
    Eliassen Group (09/09/25)
    - Related Jobs
  • Staff Enterprise Engineer - IT Service…

    LinkedIn (Mountain View, CA)
    …is to drive reliability for employee facing systems, advocate for effective observability and monitoring , and leverage automation to reduce employee-impacting ... the team. LinkedIn is looking for a Staff Enterprise Engineer to transform and lead our Service Management team...as a primary advocate and key customer for Engineering-led observability initiatives. + Use monitoring and telemetry… more
    LinkedIn (09/22/25)
    - Related Jobs
  • Senior Enterprise Operations Engineer

    Mastercard (San Francisco, CA)
    …concepts, advanced DevOps practices, CI/CD pipelines, cloud security, and proficiency with monitoring and observability tools like Splunk and Dynatrace. The ... PCI DSS, GDPR). o Conduct regular security assessments and audits to mitigate risks. * Monitoring and Observability : o Design and implement monitoring and … more
    Mastercard (09/06/25)
    - Related Jobs
  • Distinguished Software Engineer

    LinkedIn (Mountain View, CA)
    …excellence and incident response + Define and build frameworks to improve monitoring , alerting, and observability across hundreds of services and systems ... as a senior technical leader driving the long-term reliability and observability strategy across LinkedIn's infrastructure + Re-architect LinkedIn's backend systems… more
    LinkedIn (09/24/25)
    - Related Jobs