• (USA) Principal, Software Engineer

    Walmart (Sunnyvale, CA)
    …(including LLMs) and agentic frameworks (eg, RAG, Crew AI) for performance monitoring , anomaly detection, and automated remediation. + Develop and optimize LLM-based ... and disaster recovery. + Integrate external data sources (vector databases, observability stacks) to build dynamic, context-aware, and self-healing systems. + Lead… more
    Walmart (12/24/25)
    - Related Jobs
  • Senior Machine Learning Engineer

    Cisco (San Jose, CA)
    …the team** Join the engineering team building the intelligent backbone of Splunk Observability Cloud. We are committed to leveraging the latest advancements in data ... and Agentic AI to enable AI features in Splunk Observability + Collaborate across engineering and product teams to...in AWS environment with cloud native solutions. + Experience monitoring and analyzing metrics, trace, span, and log content… more
    Cisco (12/16/25)
    - Related Jobs
  • Site Reliability Engineer - Platform

    Coinbase (Sacramento, CA)
    …systems capable of handling high throughput and low latency * Experience with observability and monitoring systems such as Kibana, Datadog, etc. * Familiarity ... projects within the context of strong support and mentorship. * Improve observability , reliability and availability by defining and measuring key metrics * Build… more
    Coinbase (11/14/25)
    - Related Jobs
  • Senior AI Platform Engineer

    PennyMac (Westlake Village, CA)
    …through the complete mortgage journey. A Typical Day The Senior AI Platform Engineer will: + Design, implement, and manage scalable and resilient infrastructure on ... GCP environments. + Automate model lifecycle management (training, deployment, monitoring ) through CI/CD pipelines, ensuring reproducibility and seamless integration… more
    PennyMac (01/07/26)
    - Related Jobs
  • Software Engineer , Cloud Dataproc, Open…

    Google (Sunnyvale, CA)
    Software Engineer , Cloud Dataproc, Open Source _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and ... on and is growing every day. As a software engineer , you will work on a specific project critical...+ Enhance Apache Spark for performance, reliability, security, and monitoring , and simultaneously enhance Lake House technologies like Iceberg,… more
    Google (01/21/26)
    - Related Jobs
  • Site Reliability Engineer

    IBM (San Jose, CA)
    …Kubernetes). Knowledge of CI/CD pipelines and automation tools. Exposure to monitoring and observability tools (Prometheus, Grafana, ELK). Strong analytical ... talk **Your role and responsibilities** As a Site Reliability Engineer at IBM, you'll get to work on the...such as: * Design and implement automation for deployment, monitoring , and incident response. * Maintain and improve system… more
    IBM (01/17/26)
    - Related Jobs
  • Site Reliability Engineer Intern

    IBM (San Jose, CA)
    …problems? If so, lets talk **Your role and responsibilities** As a Site Reliability Engineer Interm at IBM, you'll get to work on the systems that are driving ... today's market. IBM has an opening for Site Reliability Engineer Intern to join our team and help build...test, and support process, such as: * Assist in monitoring and maintaining system reliability across production environments. *… more
    IBM (01/17/26)
    - Related Jobs
  • Principal Salesforce Development Engineer

    CVS Health (CA)
    …manage Java logging frameworks such as Grafana and Prometheus to ensure effective monitoring and observability of applications, services and error handling. * ... is on the lookout for a highly skilled and experienced Principal Engineer to lead the architectural vision, design and implementation of innovative, patient-focused… more
    CVS Health (01/16/26)
    - Related Jobs
  • Senior, Software Engineer

    Walmart (Sunnyvale, CA)
    …automation, AI-driven risk analysis, and tight integration with DevOps, cloud, and observability platforms. We are looking for a **Senior Software Engineer ** ... lifecycle, including data ingestion, feature engineering, model evaluation, deployment, monitoring , and iteration, ensuring solutions are reliable, explainable, and… more
    Walmart (01/13/26)
    - Related Jobs
  • DevOps Engineer (Cortex)

    Palo Alto Networks (Santa Clara, CA)
    …advanced SecOps platform, including XSIAM, XSOAR, and XPANSE. As a Staff DevOps Engineer , you will help build, operate, and evolve the infrastructure and automation ... of large-scale production systems. This position is ideal for an experienced DevOps engineer who enjoys automation, cloud infrastructure, and CI/CD, and is ready to… more
    Palo Alto Networks (12/15/25)
    - Related Jobs