• Distinguished, Software Engineer

    Walmart (Bentonville, AR)
    …Engineers, Product Managers, and MLOps/DevOps teams to streamline model deployment, monitoring , and lifecycle management **What You Will Bring** + Expert-level ... systems, including data ingestion, feature engineering, model training, deployment, and monitoring . + A solid understanding of core machine learning principles,… more
    Walmart (08/23/25)
    - Related Jobs
  • Principal Staff Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …building for performance and reliability at global scale, covering automation, monitoring , high availability, capacity planning, and lifecycle management. + Define ... optimizations (SR-IOV/ DPU) + Experience with Technologies like eBPF and XDP for Observability & DDoS mitigation + Collect and review system data for capacity and… more
    NVIDIA (08/21/25)
    - Related Jobs
  • Staff, Software Engineer - MLE, People.AI

    Walmart (Bentonville, AR)
    …Engineers, Product Managers, and MLOps/DevOps teams to streamline model deployment, monitoring , and lifecycle management. **What You Will Bring** + **Expert-level ... , including data ingestion, feature engineering, model training, deployment, and monitoring . + A solid understanding of **core machine learning principles** ,… more
    Walmart (08/15/25)
    - Related Jobs
  • Senior Software Engineer

    Microsoft Corporation (Redmond, WA)
    …solutions, and patterns that will improve the availability, reliability, efficiency, observability , and performance of products while also driving consistency in ... monitoring and operations at scale. **Qualifications** **Required/Minimum Qualifications** + Bachelor's Degree in Computer Science or related technical field AND 4+… more
    Microsoft Corporation (10/23/25)
    - Related Jobs
  • Intl AI Engineer - AOR

    Insight Global (Woonsocket, RI)
    …retraining workflows using Vertex AI or Kubeflow on GCP. o Implement observability and reliability for AI decisions by logging predictions, confidence scores, and ... fallbacks into data lakes or monitoring tools. * Performance and Scalability o Ensure high availability, scalability, and security of AI services. o Optimize… more
    Insight Global (10/23/25)
    - Related Jobs
  • Software Engineer , Edge Platforms

    New York Times (New York, NY)
    …Improve performance and reliability of primary systems by improving upon software observability , monitoring , logging, and instrumentation + You will design and ... implement automation to reduce operational toil for the team + You will use cloud native technology and design patterns such as Kubernetes and Pub/Sub + This role may require limited on-call hours. An on-call schedule will be determined when you join, taking… more
    New York Times (10/23/25)
    - Related Jobs
  • Principal Engineer

    Microsoft Corporation (Redmond, WA)
    …solutions, and patterns that will improve the availability, reliability, efficiency, observability , and performance of products while also driving consistency in ... monitoring and operations at scale and shares knowledge with other engineers. **Qualifications** **Required/Minimum Qualifications (RQs/MQs)** + Bachelor's Degree in… more
    Microsoft Corporation (10/22/25)
    - Related Jobs
  • Senior Software Engineer , OneDrive…

    Microsoft Corporation (Redmond, WA)
    …of current developments that will improve the availability, reliability, efficiency, observability , and performance of products while also driving consistency in ... monitoring and operations at scale. + Reviews work items to deepen knowledge of product features in partnership with appropriate stakeholders (eg, project managers)… more
    Microsoft Corporation (10/21/25)
    - Related Jobs
  • Software Engineer : Azure Data Intern…

    Microsoft Corporation (Austin, TX)
    …new knowledge that will improve the availability, reliability, efficiency, observability , and performance of query execution systems-while driving consistency in ... monitoring and operations at scale. **Qualifications** **Required Qualifications** + Enrolled in a full time bachelor's or master's program in Computer Science,… more
    Microsoft Corporation (10/18/25)
    - Related Jobs
  • Senior DevSecOps Engineer

    Lockheed Martin (Highlands Ranch, CO)
    …around container orchestration, package management, admission control, logging and monitoring , service mesh and traffic observability \. \-Familiarity with ... test driven development\. \-Experience contributing to opensource\. \-Familiarity with Government DevSecOps initiatives\. \-Self\-motivated with strong teamwork and organizational skills\. \-Comfortable working with different customers in varying work… more
    Lockheed Martin (10/16/25)
    - Related Jobs