• Senior Software Development Engineer

    Zoom (San Jose, CA)
    …APIs with Spring MVC and related technologies. + Guiding performance tuning, monitoring , and observability of Spring-based services. + Collaborating with product ... What you can expect You will work as a hands-on backend software engineer with system-level thinking, using Java to extend functionality, lead development and… more
    Zoom (09/11/25)
    - Related Jobs
  • Senior Systems Software Engineer

    NVIDIA (Santa Clara, CA)
    …the center of this revolution. We are seeking a motivated Senior Systems Software Engineer to join our AV Infrastructure organization and become a key driver in ... to support AV software builds, large-scale simulation testing, and real-time observability . + Innovate developer tooling and automation frameworks to mitigate… more
    NVIDIA (09/11/25)
    - Related Jobs
  • Sr. Engineer

    IBM (San Jose, CA)
    …of DevOps principles in a cloud environment. * Familiarity with cloud monitoring tools to implement robust observability practices that prioritize metrics, ... matter expert on quality development with an emphasis on Golang development * Lead and execute large-scale projects, ensuring the reliable delivery of key features… more
    IBM (09/03/25)
    - Related Jobs
  • Senior Data Processing Platform Engineer

    NVIDIA (Santa Clara, CA)
    …data systems like Ray, Spark Rapids + Familiarity with metrics collection, health monitoring , and observability tools + Building, operating and maintaining full ... data scientists to use. As a data processing platform engineer , you will design, implement and operate Kubernetes based...at scale, with high availability and reliability. You will lead and encourage adoption of the data processing service,… more
    NVIDIA (08/09/25)
    - Related Jobs
  • Principal, Software Engineer

    Walmart (Sunnyvale, CA)
    …adaptive security frameworks across the enterprise. **What you'll do:** As a **Principal Engineer ** at Walmart, you will serve as a key technical thought leader ... enterprise. You will influence strategic technology decisions, mentor teams, and lead by example in building high-scale, intelligent systems that integrate… more
    Walmart (07/12/25)
    - Related Jobs
  • Principal Site Reliability Engineer (Prisma…

    Palo Alto Networks (Santa Clara, CA)
    …robust and performant. This includes automation, architecture, performance, observability , troubleshooting, security, and reliability. Our Infrastructure Platform ... tools and automation frameworks, championing Infrastructure as Code (IaC) and Monitoring as Code (MaC) principles + Automate robust deployments and orchestrate… more
    Palo Alto Networks (09/06/25)
    - Related Jobs
  • Principal Staff Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …cloud. Join us in this exciting endeavor! What You Will Be Doing: + Lead initiatives to transform IT Compute Core Team, architecture to build new service offerings ... building for performance and reliability at global scale, covering automation, monitoring , high availability, capacity planning, and lifecycle management. + Define… more
    NVIDIA (08/21/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    LiveRamp (San Francisco, CA)
    …with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and Terraform ... clouds (GCP or AWS)** + **Experience with deployment and monitoring of highly scalable products.** + **Hands on experience...+ **Experience with SRE best practices, working knowledge of observability principles is a big plus** + **Ability to… more
    LiveRamp (08/07/25)
    - Related Jobs
  • Senior, Data Engineer

    Walmart (Sunnyvale, CA)
    …automation. + Deploy and monitor products on **cloud platforms with agent observability ** , telemetry, and auditability in mind. + Develop and implement ... best-in-class **data health monitoring , traceability, and context enrichment** processes to ensure data...data used by agents is reliable and governed. + Lead technical solutioning for full-lifecycle projects, with an eye… more
    Walmart (09/13/25)
    - Related Jobs
  • AVP, Technology Operations

    PennyMac (Westlake Village, CA)
    …maintaining service level agreements (SLAs) that meet or exceed business requirements. + Monitoring & Observability - Lead the development and implementation ... operational efficiency and system reliability. + Team Management - Lead , mentor, and develop a team of Site Reliability...of comprehensive monitoring and observability practices using New Relic and other tools to… more
    PennyMac (08/07/25)
    - Related Jobs