• Sr. Staff Software Engineer, Reliability (Edge…

    LinkedIn (Mountain View, CA)
    …Engineer, you will fill the mission-critical role of ensuring that our complex, web- scale systems are healthy, automated, redundant and designed to scale . You ... that software automation is a key component to operating large- scale systems. Responsibilities: + You will function as the...by driving innovation while building and shipping software at scale . + You will design products, services, tools and… more
    LinkedIn (10/16/25)
    - Related Jobs
  • Senior Software Engineer, AI Resiliency

    NVIDIA (Santa Clara, CA)
    …in defining and implementing critical resiliency features for AI supercomputers at a scale of 100,000+ GPUs. Your expertise will be crucial in driving down cluster ... software features that improve AI system reliability at a massive scale , such as fast checkpoint-recovery, error detection, error isolation, and straggler/hang… more
    NVIDIA (10/15/25)
    - Related Jobs
  • Staff Technical Program Manager, Security

    DoorDash (San Francisco, CA)
    …Role We're looking for a Staff Technical Program Manager to drive large- scale programs across Security Engineering and Enablement. You'll focus on building ... programs that deliver scalable, reliable, and efficient security solutions at scale , aligned to DoorDash's business growth. + Drive Execution & Optimization:… more
    DoorDash (10/08/25)
    - Related Jobs
  • Principal Staff Software Engineer, Service…

    LinkedIn (Mountain View, CA)
    …provides the foundation for all online services to run reliably, efficiently, and at scale . We power core technologies that enable every product team at LinkedIn to ... balancing scalability, reliability, developer experience, and efficiency. + Lead large- scale gRPC infrastructure initiatives, including evolution from bridged to… more
    LinkedIn (10/08/25)
    - Related Jobs
  • Sr. Staff Software Engineer, AI Infra

    LinkedIn (Mountain View, CA)
    …engineering and serving with hundreds of billions of parameters models and large scale feature engineering infra for all AI use cases from recommendation models, ... high performance, enable on-device and online training. Challenges include scale (10s of thousands of QPS, multiple terabytes of...using thousands of features), and enabling GPU inference at scale . As a Sr. Staff Software Engineer, you will… more
    LinkedIn (09/27/25)
    - Related Jobs
  • Manager, Software Engineering - Enterprise…

    LinkedIn (Mountain View, CA)
    …of Work." You will shape the future of employee productivity through large scale AI, automation, and SaaS (vendor and homegrown) systems. You will collaborate with ... You will partner with senior management in owning responsibility for large scale vendor SaaS and on-prem solutions, including contract negotiations, solution design,… more
    LinkedIn (09/26/25)
    - Related Jobs
  • Sr. Applied Scientist, FAR (Frontier AI…

    Amazon (Seattle, WA)
    …that bridge the gap between research and real-world deployment at Amazon scale . In this role, you'll combine hands-on technical work with scientific leadership, ... robotic foundation models and efficient, promptable model architectures that can scale across diverse robotic applications. Key job responsibilities - Lead technical… more
    Amazon (09/24/25)
    - Related Jobs
  • Sr. Technical Program Manager

    LinkedIn (Mountain View, CA)
    …projects, as well as initiating, planning, and executing intermediate-to-large scale cross-functional programs. The ideal candidate has balance of people, ... Designing, expanding, retrofitting, and decommissioning data center capabilities at scale . + Data Center Operations & Engineering: Driving forecasting, budget… more
    LinkedIn (09/22/25)
    - Related Jobs
  • Senior Site Reliability Engineer - Observability…

    NVIDIA (Santa Clara, CA)
    …at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of ... + Design, implement and support operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at… more
    NVIDIA (12/19/25)
    - Related Jobs
  • Principal Customer Solutions Manager , Strategic…

    Amazon (Seattle, WA)
    …on the planet, tackling extraordinary technical and business challenges at a scale few organizations in the world face. Foundation Model Provider's compute footprint ... on the AWS platform, pushing the boundaries of what's possible with exabyte- scale data, millions of interconnected GPUs, complex networking topologies, and custom… more
    Amazon (12/18/25)
    - Related Jobs