- Zscaler (Short Hills, NJ)
- …days a week. Exceptional remote candidates will also be considered. As a Principal Site Reliability Engineer - ML Platform , you will: + Architect, build, and ... secure. The pioneering, AI-powered Zscaler Zero Trust Exchange (TM) platform , which is found in our SASE and SSE...maintain large-scale distributed systems to support end-to-end AI pipelines, including data collection,… more
- NVIDIA (Durham, NC)
- …related field (or equivalent experience). + 5+ years of experience in a Site Reliability , DevOps, or Systems Engineering role. + Strong automation and scripting ... people. NVIDIA is looking for a highly motivated SRE Engineer to join the NVIDIA AIR team - the...on secure production infrastructure. + Manage deployment/upgrades for Operating Systems , Kubernetes (k8s) clusters, and other orchestration tools. +… more
- DoorDash (San Francisco, CA)
- …ship, observe, and remediate production systems . About the Role As a Software Engineer on Reliability Platforms, you'll help design and build the systems ... platform peers to create durable abstractions that make reliability the path of least resistance. This is a...execute production changes - moving DoorDash toward proactive, self-healing systems . We're excited about you because + Platform… more
- JPMorgan Chase (Jersey City, NJ)
- …your skillsets to drive innovation and modernize the world's most complex and mission-critical systems . As a Site Reliability Engineer III at JPMorgan Chase ... to your team by sharing your knowledge of end-to-end operations, availability, reliability , and scalability of your application or platform . **Job… more
- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- Hussmann Corporation (Bridgeton, MO)
- …collaboration with design teams to drive product improvements. The System Reliability Engineer will collaborate with platform development, manufacturing ... deliver Industry leading Quality and Reliability . The Systems Reliability Engineer develops the...plans/protocols, Failure modes and effects analysis (FMEA), to achieve system reliability and meet acceptance criteria. +… more
- Coinbase (Charlotte, NC)
- …part of Infrastructure ( Platform ) org responsible for paving the path for system 's reliability and scalability. We manage multiple company wide projects like ... Canary based safe release capability to ensure company wide system 's reliability and less customer impact ....and less customer impact . As a *Senior Software Engineer * you will help to promote reliability … more
- Wells Fargo (Charlotte, NC)
- **Overview** We are seeking a Senior Site Reliability Engineer (SRE) to help develop our platform operations across Windows, Linux, and cloud-native ... infrastructure that supports diverse applications across our enterprise. **Key Responsibilities** ** Platform Reliability & Cloud Engineering** + Ensure high… more
- Hyundai Autoever America (Fountain Valley, CA)
- Purpose: Hyundai AutoEver America is seeking a highly experienced Senior or Lead Platform Engineer /Site Reliability Engineer (SRE)/Hadoop Admin to manage ... data infrastructure. This role requires a hands-on technical leader who can drive platform innovation, ensure high availability and reliability , and mentor team… more
- SAIC (San Diego, CA)
- … engineer , IT Specialist, platform engineer , site reliability engineer , release engineer , systems administrator, systems engineer , or ... **Description** SAIC is seeking a cleared (TS/SCI) **Senior Platform Engineer ** in support of NAVWAR's...role in operationalizing AI for next-generation naval platforms and systems . You'll focus on building and managing secure, scalable… more