• Distinguished Engineer

    NVIDIA (Santa Clara, CA)
    …at NVIDIA, you will own the development of DGX Cloud strategy for observability , monitoring , and remediation across all layers of infrastructure, IaaS, platforms ... define and drive the technical implementation for DGX Cloud offerings in the observability , monitoring , and remediation practice. + Collaborate on Cross Domain… more
    NVIDIA (11/24/25)
    - Related Jobs
  • Observability Architect Engineer

    TEKsystems (Los Angeles, CA)
    Description Summary We are seeking a senior engineer /architect to lead the design, implementation, and optimization of full stack observability solutions ... solutions and mentor engineering teams. Key Responsibilities * Own end-to-end observability architecture: Design and implement integrated monitoring solutions… more
    TEKsystems (11/24/25)
    - Related Jobs
  • Sr. Staff Software Engineer - Network…

    LinkedIn (Mountain View, CA)
    Lead the architectural design and implementation of large-scale observability platforms, including telemetry ingestion, real-time analytics, network health ... in hyperscale or large distributed cloud environments. + Background in building observability stacks (metrics, logs, traces) or network monitoring platforms. +… more
    LinkedIn (11/20/25)
    - Related Jobs
  • Distinguished, Software Engineer

    Walmart (Sunnyvale, CA)
    **Position Summary ** **What you'll do ** As an observability Distinguished Engineer , you will be a key researcher and technical lead expert in the ... architecture and development of cloud native observability designs, managed services, and real-time telemetry software systems. You will use your depth of… more
    Walmart (10/21/25)
    - Related Jobs
  • Lead Site Reliability Engineer (SRE)

    EPAM Systems (Los Angeles, CA)
    …not just building software - we're engineering excellence. We're looking for a ** Lead Site Reliability Engineer (SRE)** with a passion for performance, ... Troubleshoot mission-critical systems and implement preventative problem management solutions + Lead on promoting observability , scalability, and resiliency best… more
    EPAM Systems (11/01/25)
    - Related Jobs
  • Lead Engineer , Inference Platform

    MongoDB (Palo Alto, CA)
    We're looking for a Lead Engineer , Inference Platform to join our team building the inference platform for embedding models that power semantic search, ... all deeply integrated into Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with design and… more
    MongoDB (09/27/25)
    - Related Jobs
  • Lead Data Engineer

    AbbVie (Irvine, CA)
    …Aesthetics on LinkedIn. Allergan Aesthetics | An AbbVie Company Job Description As the Lead Data Engineer ,you will report to the Engineer Manager (Data ... expose and integrate data products with software systems Implement monitoring , logging, and alerting systems to proactively identify and...field + 7+ years of experience as a Data Engineer or Software Engineer developing and maintaining… more
    AbbVie (09/04/25)
    - Related Jobs
  • Lead Software Engineer - Back…

    The Walt Disney Company (Santa Monica, CA)
    …behind personalization, commerce, lifecycle, and identity. **Job Summary:** As a Tech Lead Software Engineer , you will collaborate closely with engineers, ... and scaling within a cloud infrastructure. + Experience with observability tools for metrics, logging, and monitoring ...with observability tools for metrics, logging, and monitoring (such as Datadog, Grafana). + Strong communication skills… more
    The Walt Disney Company (11/13/25)
    - Related Jobs
  • Distinguished Software Engineer

    LinkedIn (Mountain View, CA)
    …excellence and incident response + Define and build frameworks to improve monitoring , alerting, and observability across hundreds of services and systems ... as a senior technical leader driving the long-term reliability and observability strategy across LinkedIn's infrastructure + Re-architect LinkedIn's backend systems… more
    LinkedIn (09/24/25)
    - Related Jobs
  • Sr. Lab Software Engineer , Control…

    Amazon (Pasadena, CA)
    …include the following: - Develop and integrate telemetry systems to enable real-time observability and monitoring of our laboratory software and hardware. - ... web development (eg client-server architectures, databases) - Experience working with observability and monitoring tools (eg OpenTelemetry, Grafana) - Experience… more
    Amazon (11/14/25)
    - Related Jobs