- NVIDIA (Santa Clara, CA)
- …infrastructure, test automation (SDET), and Infrastructure as Code (IaC) + Architect and implement scalable test automation strategies for AI inference workloads, ... effectively. + Attain operational proficiency encompassing 24x7 on-call rotations, SRE methodologies, automated monitoring, and self-repairing systems to guarantee… more
- Microsoft Corporation (Redmond, WA)
- …Engineer. This role is a dynamic blend of Platform Engineering, DevOps/ SRE , and Big Data Infrastructure Engineering, focused on enabling large-scale data ... subject to local law and may vary by jurisdiction. **Responsibilities** + Architect and maintain scalable, reliable, and observable Big Data Infrastructure for… more
- The Boeing Company (Seattle, WA)
- …loops and user research to build tools they want to use + Architect and maintain "paved roads", highly automated, opinionated workflows that simplify the journey ... resources + Lead engineering teams in implementing Site Reliability Engineering ( SRE ) practices, utilizing error budgets and blame-free post-mortems to balance… more
- Travel + Leisure Co. (Orlando, FL)
- …high-performing, primarily remote team of cloud engineers. Foster a DevOps/ SRE culture including blameless postmortems, automation-first mindsets, and continuous ... science, Engineering, or related field-or equivalent experience + AWS Certified Solutions Architect (preferred) + Multi-cloud certifications (nice to have) + ITIL or… more
- Huntington National Bank (Columbus, OH)
- …data synchronization. + Reliability & Site Reliability Engineering ( SRE )Establish and enforce SLOs/SLAs, observability standards, disaster recovery strategies, ... architectures, and implementing data quality controls. + Strong grounding in SRE /DevOps practices and a security-first mindset. Preferred Qualifications + GCP… more
- Amazon (Denver, CO)
- …platforms to support hybrid infrastructure environments - System Design: Architect and implement secure, scalable solutions while considering system ... Basic Qualifications - 4+ years of site reliability engineering ( SRE ), systems engineering, systems administration, DevOps, security administration, or network… more
- Oracle (Phoenix, AZ)
- …Overview Join OCI's Edge Security team as a Principal Engineer to architect and deliver cloud-scale DDoS protection. You'll lead design for high-performance ... lifecycle. - Mentor engineers, influence cross-org roadmaps, and collaborate with Product, SRE , and Network Engineering from concept to GA. Basic qualifications -… more
- Walmart (Sunnyvale, CA)
- …improve overall renewal rate, first-year retention, and PLUS penetration._ + **_Drive engineering, SRE , and data excellence_** + _Establish and track DORA, SRE , ... and OE/EE metrics (SLAs, availability, deployment frequency, MTTR, CFR, code coverage, cloud efficiency) for lifecycle services and data products._ + _Implement robust observability and data quality governance so lifecycle decisions are made on trustworthy… more
- Cisco (Research Triangle Park, NC)
- …engineering, and getting rid of tedious, manual tasks. **Your Impact** Splunk's FedRAMP SRE team is looking for a Site Reliability Engineer to help lead, design ... service owners across the platform to teach and implement modern interpretations of SRE , observability, Chaos Engineering and DevOps. This role is highly visible and… more
- MongoDB (New York, NY)
- **Team and Role Overview** The SRE Observability team is part of the larger Platform Engineering organization, and is dedicated to building and maintaining the ... team, you'll also work closely with other SWE and SRE teams to promote and implement best practices in...by all parts of the engineering organization + Design, architect , build and deliver core pieces of our observability… more