• Software Development Engineer AI/ML, Inference…

    Amazon (Cupertino, CA)
    …silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a particular focus on ... large-scale generative AI applications. Key job responsibilities * Architect and lead the design of distributed ML serving systems optimized for generative AI… more
    Amazon (09/21/25)
    - Related Jobs
  • AVP, Technology Operations

    PennyMac (Westlake Village, CA)
    …field + 8+ years of progressive experience in technology operations, infrastructure management, or site reliability engineering , with at least 3-5 years in a ... of Technology Operations is a key leadership role responsible for overseeing the Site Reliability Operations (SRO) team that provides 24/7 monitoring and support… more
    PennyMac (10/23/25)
    - Related Jobs
  • Principal DevOps Engineer (Cortex Observability)

    Palo Alto Networks (Santa Clara, CA)
    … and resolve production incidents **Your Experience** + 5+ years of experience in DevOps, Site Reliability Engineering , or Cloud Infrastructure roles + BS or ... that powers our large-scale cloud platform. You will work closely with engineering teams to enable fast and reliable software delivery, optimize system performance,… more
    Palo Alto Networks (11/07/25)
    - Related Jobs
  • Senior Software Engineer, Server Control Firmware

    Amazon (Cupertino, CA)
    …test infrastructure for our ML acceleration hardware, ensuring quality and reliability across our manufacturing operations. You will work on developing at-scale ... at board and server level test. You will work together with other engineering teams to unify testing solutions between manufacturing and data center operations… more
    Amazon (11/09/25)
    - Related Jobs
  • Principal Analyst, Disaster Recovery

    Vail Resorts (CA)
    …Partnership:** + Collaborate with Enterprise Architecture, DevOps, Security, Compliance, and SiteReliability teams to embed DR checks in CI/CD pipelines. ... workloads (VMware, Nutanix) to modern cloud‑native and container platforms, and architect an automated, continuously‑updated DR playbook that is embedded in our… more
    Vail Resorts (10/10/25)
    - Related Jobs
  • Senior Technical Product Manager

    Capital One (San Francisco, CA)
    …collaboration and delivering great experiences for our customers. Cloud Operations Resilience Engineering (CORE) is at the heart of our approach. CORE delivers ... enablement, cloud infrastructure for emerging Business, data infrastructure, production reliability , and recovery platforms. These products are focused on the… more
    Capital One (11/04/25)
    - Related Jobs
  • Senior Machine Learning Engineer , AGI - Amazon…

    Amazon (Sunnyvale, CA)
    …and AWS technologies to power AKG's knowledge access infrastructure. Architect fault-tolerant solutions that efficiently manage petabytes of structured knowledge ... Establish monitoring frameworks and define alerting strategies that ensure 24/7 reliability . Engage directly with customers to understand their needs and … more
    Amazon (11/18/25)
    - Related Jobs
  • Project Management - Project Manager - P2S

    Legence (Long Beach, CA)
    **P2S** stands as a provider of professional engineering services to a broad range of markets, including higher education, healthcare, ports/harbors, industrial, ... fire protection, and technology integration. Our offered services range from engineering and commissioning to construction management. With over 300 dedicated… more
    Legence (11/14/25)
    - Related Jobs
  • Senior Staff Machine Learning Engineer - Risk…

    Coinbase (Sacramento, CA)
    …like fraud detection, recommender systems, feed ranking, and risk assessment. * * Architect Scalable Models & Systems: * Architect and build production-grade AI/ML ... models & pipelines, that enable low-latency, high- reliability predictions. You will develop and deploy robust, low-maintenance applied AI/ML solutions * *Drive… more
    Coinbase (11/06/25)
    - Related Jobs
  • Senior Software Engineer/Technical Lead

    Pet Food Express (Concord, CA)
    …core systems with cutting-edge Azure tech. + Grow with us: Opportunities toward Solution Architect or Engineering Manager pathways. + Make a real impact: your ... solution architecture experience who thrives at the intersection of hands-on engineering and solution design. You'll lead technical efforts across the enterprise,… more
    Pet Food Express (10/31/25)
    - Related Jobs