• Senior Platform Telemetry Engineer

    NVIDIA (Santa Clara, CA)
    …product development. + Bring up clarity on architecture for fleet health monitoring and fault-remediation solution at scale. Work with customers and other ... architects, understand their requirements on health monitoring , making best use of available capabilities in-band as well as out of band. Detailed architecture, do… more
    NVIDIA (08/15/25)
    - Related Jobs
  • Senior DevOps Engineer

    Zoom (San Jose, CA)
    …availability and performance optimization; + Operate, and maintain an in-house monitoring system to proactively identify and resolve system and application issues; ... enhancing operational efficiency and reliability; + Manage deployment and continuous monitoring of the async search platform, ensuring scalability and responsiveness… more
    Zoom (08/13/25)
    - Related Jobs
  • Senior , Software Engineer

    Walmart (Sunnyvale, CA)
    …to the roadmap of Walmart's core machine learning capabilities. + Create monitoring dashboards; perform latency tuning of deep learning models, scaling solutions to ... compare models, features, and hyperparameters; utilize A/B testing and continuous monitoring to validate and adjust models. + Possess excellent communication skills… more
    Walmart (08/08/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    LiveRamp (San Francisco, CA)
    …with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and Terraform ... Containers and public clouds (GCP or AWS)** + **Experience with deployment and monitoring of highly scalable products.** + **Hands on experience on FinOps and… more
    LiveRamp (08/07/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …of large scale Kubernetes clusters with focus on performance at scale, real time monitoring , logging and alerting + Engage in and improve the whole lifecycle of ... reviews. + Maintain services once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through… more
    NVIDIA (08/01/25)
    - Related Jobs
  • Senior RF Electrical Engineer

    Teledyne (Rancho Cordova, CA)
    …and defense, factory automation, air and water quality environmental monitoring , electronics design and development, oceanographic research, deepwater oil and ... and defense, factory automation, air and water quality environmental monitoring , electronics design and development, oceanographic research, energy, medical imaging… more
    Teledyne (07/29/25)
    - Related Jobs
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    …with kubernetes including cluster operations, operator development, node health monitoring and working with GPU resource scheduling. We welcome out-of-the-box ... software related to scheduling GPU resources on kubernetes. + Implementing monitoring and health management capabilities that enable industry leading reliability,… more
    NVIDIA (07/02/25)
    - Related Jobs
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …of NVIDIA Server platforms. + Designing and developing performance optimized active monitoring BMC solutions using DMTF Standards including MCTP, Redfish, SPDM and ... BMC-BIOS communication, thermal management, power management, firmware update, device monitoring , firmware security, etc. + Board Bring-up expertise with hands-on… more
    NVIDIA (07/01/25)
    - Related Jobs
  • Senior , Software Engineer (Machine…

    Walmart (Sunnyvale, CA)
    …expertise with Cloud Technologies like Azure and GCP. + Experience in monitoring production system and using different systems like Grafana, Prometheus. + Strong ... inclination towards exploring and learning new technologies. + You have strong written and oral communication skills. + Experience with all phases of the software development life cycle, best practices, and Agile Software Development. **About Walmart Global… more
    Walmart (08/22/25)
    - Related Jobs
  • Senior Network Engineer

    Cadence Design Systems, Inc. (San Jose, CA)
    …issues. + Experience in security policy implementation and enhancing network monitoring . + Proven ability to manage mission-critical infrastructure projects. Minimum ... Requirements + Degree in Computer Science. + Expertise in atleast two functions: Cisco Remote Access & Site-to-site VPN, Aruba ClearPass NAC, Cloud Networking and F5 Load-balancer is a must. + CCNP/CCIE or equivalent expert level certifications required. +… more
    Cadence Design Systems, Inc. (08/22/25)
    - Related Jobs