• Senior System Design Engineer , SSD…

    SanDisk (Milpitas, CA)
    …new memory system firmware designs + Development and validation of data observability features and tools for validation purpose + Perform system-level verification ... tests of NAND management FW features + Perform end-of-life (EOL) reliability verification tests + Perform failure analysis on EOL and other FW maturity test failures + Data analysis based on drive logs and statistical data * Collaborate with FW and other… more
    SanDisk (07/31/25)
    - Related Jobs
  • Senior DGX Cloud Software Engineer

    NVIDIA (Santa Clara, CA)
    …internal facing service level objectives and error budgets as part of our overall observability strategy. + Eliminate toil or automate it where the ROI of building ... and maintaining automation is worth it. + Practice sustainable blameless incident prevention and incident response while being a member of an on-call rotation. + Consult with and provide consultation for peer teams on systems design best practices. +… more
    NVIDIA (07/26/25)
    - Related Jobs
  • Senior Software Engineer

    Aeris Communications (San Jose, CA)
    …non-functional requirements such as high-availability, scalability, security and observability . Plan development activities, develop accurate schedule estimates and ... provide daily progress updates in a stand-up meeting. Deliver high quality software in a predictable and reliable manner. Collaborate actively with other developers and other cross-functional teams like QA, SRE, and Operations. Assist in support of the… more
    Aeris Communications (07/24/25)
    - Related Jobs
  • Distinguished Software Engineer

    LinkedIn (Mountain View, CA)
    senior technical leader driving the long-term reliability and observability strategy across LinkedIn's infrastructureRe-architect LinkedIn's backend systems to ... and incident responseDefine and build frameworks to improve monitoring, alerting, and observability across hundreds of services and systemsDefine and own the roadmap… more
    LinkedIn (06/04/25)
    - Related Jobs
  • Data Engineer II

    The Walt Disney Company (Santa Monica, CA)
    …with other parts of The Walt Disney Company. **Job Summary:** The Data Reliability Engineer II will help us in the ongoing mission of delivering outstanding services ... be more data-driven. You will work closely with the senior members of our team to monitor and drive...team to monitor and drive improvements for reliability and observability of critical data pipelines and deliverables. This is… more
    The Walt Disney Company (07/25/25)
    - Related Jobs
  • Principal Platform Engineer

    The Walt Disney Company (Glendale, CA)
    …across all media platforms. **Job Summary:** We're looking for a Principal Platform Engineer , Infrastructure & Tooling for Services, Data, and GenAI to help shape ... scalable, and built to accelerate delivery without compromising quality. As a senior technical expert, you'll drive the architecture and evolution of our AWS-based… more
    The Walt Disney Company (07/17/25)
    - Related Jobs
  • Solutions Engineer - Pacific NW

    Cisco (CA)
    …a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers ... technical and domain expertise to introduce and operationalize Security and Observability use-cases and solutions. The Solutions Engineers (SEs) are Splunk's… more
    Cisco (08/15/25)
    - Related Jobs
  • Lead Software Engineer

    ADP (Pasadena, CA)
    **ADP is hiring a** **Lead Software Engineer ** **.** + _Are you empathetic to client needs and inspired by transformation and impacting the lives of millions of ... for you. Ready to design what's next?_ We are seeking a Lead Software Engineer to lead the technology transformation of our Tax Compliance platform into a modern,… more
    ADP (08/15/25)
    - Related Jobs
  • Sr. Platform Engineer (Hadoop Admin)

    Hyundai Autoever America (Fountain Valley, CA)
    Purpose: Hyundai AutoEver America is seeking a highly experienced Senior or Lead Platform Engineer /Site Reliability Engineer (SRE)/Hadoop Admin to manage and ... performance tuning, and troubleshooting across platform components + Optimize observability and telemetry using tools like Prometheus, Grafana, and OpenTelemetry… more
    Hyundai Autoever America (07/03/25)
    - Related Jobs
  • Staff, Software Engineer - NodeJS, GraphQL,…

    Walmart (Sunnyvale, CA)
    …** **What you'll do ** **About the Role:** We're seeking a **Staff Software Engineer ** to lead the design and evolution of our backend systems built on **Node.js** ... + Define and enforce engineering best practices in code quality, CI/CD, observability , and system resilience. + Partner with product, platform, and infrastructure… more
    Walmart (08/08/25)
    - Related Jobs