• Senior Machine Learning Engineer , Customer…

    Amazon (Santa Clara, CA)
    …knowledge sources and actuation capabilities. - Innovate and implement observability and logging mechanisms for proactive issue identification, troubleshooting, and ... language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a… more
    Amazon (09/06/25)
    - Related Jobs
  • Software Engineer , Blockchain Platform…

    Coinbase (Sacramento, CA)
    …* *We build infrastructure to provide the most secure and highest uptime*: Observability and monitoring is a cornerstone of the team's philosophy in order to ... [Reimagining Ethereum staking node architecture to improve performance and reliability ](https://www.coinbase.com/developer-platform/discover/solutions/ethereum-staking-node) *Pay Transparency Notice:* Depending on your work location,… more
    Coinbase (08/09/25)
    - Related Jobs
  • AVP, Technology Operations

    PennyMac (Westlake Village, CA)
    …of Technology Operations is a key leadership role responsible for overseeing the Site Reliability Operations (SRO) team that provides 24/7 monitoring and support ... Management - Lead, mentor, and develop a team of Site Reliability Operations Engineers across all levels...that meet or exceed business requirements. + Monitoring & Observability - Lead the development and implementation of comprehensive… more
    PennyMac (08/07/25)
    - Related Jobs
  • Principal Analyst (Architect), Enterprise…

    Vail Resorts (CA)
    …(eg, call recording consent models, toll-free usage, CLI rules). + Observability : global telemetry (NetFlow/IPFIX), synthetic testing, MOS/RTT analytics, SIP ladder ... traces, DEM-correlated across regions/clouds. **Security, Reliability & Compliance:** + Partner with Security on zero-trust zone models, least-privilege access,… more
    Vail Resorts (09/10/25)
    - Related Jobs
  • Cloud & Platform Engineering Architect

    Rubrik (Palo Alto, CA)
    …experience. + **10+ years of hands-on experience** in a Cloud Engineering, DevOps, or Site Reliability Engineering role, with a strong focus on cloud operations. ... for multi cloud resource governance & Management + **Cloud Operations & Reliability :** Lead the day-to-day operations of our cloud infrastructure, ensuring high… more
    Rubrik (08/14/25)
    - Related Jobs