- Insight Global (Woonsocket, RI)
- Job Description Insight Global is seeking 2 Site Reliability Engineers to drive observability, resiliency, executive ready reporting, and Level 3+ support for ... including metrics, logs, traces, alerting, dashboards, and SLO/SLIs. * Build reliability reporting and scorecards (uptime, latency, error budgets, MTTR) that provide… more
- Red Hat (Raleigh, NC)
- Red Hat is looking for a Platform Engineer to join its Platform Engineering team! In this role, you will help architect, implement, improve, and support the ... your expertise in SRE principles, you will help create an environment where reliability , scalability, and security come first, and are not treated as an… more
- ServiceNow, Inc. (San Diego, CA)
- It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... engineers who are tasked with maintaining and developing the reliability , scalability and performance of the ServiceNow infrastructure. The...as a company and the SRE role. **As an Engineer on the SRE team you will:** + Provide… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Observability & Telemetry collection… more
- IBM (Austin, TX)
- …seeking a motivated and detail-oriented IT Administrator Intern with an interest in Site Reliability Engineering (SRE) to join our team. This internship offers ... in enterprise IT systems management while introducing you to modern reliability engineering practices. You'll work alongside experienced professionals to support… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
- Oracle (Carson City, NV)
- …Center** You will be joining the OCSC (Oracle Cloud Service Centre) as an SRD ( site reliability developer). Your job role will be helping Oracle ensure the ... experiencing both development and operations. As a Cloud Service Centre Site Reliability Developer Intern you will be involved with: **Operations** + Administer… more
- JPMorgan Chase (Jersey City, NJ)
- Responsibilities: + Design and implement solutions to enhance the reliability and scalability of AI/ML platforms and applications to accommodate fast growing ... leadership by defining and evaluating standards and architecture for reliability , observability and automation frameworks. + Build strong cross-functional… more
- Palo Alto Networks (Santa Clara, CA)
- …US Citizen or Green Card holder.** **Your Career** We are seeking development-heavy Site Reliability Engineers (SREs) who are passionate about bringing new ideas ... ensure applications align with infrastructure requirements, focusing on scalability and reliability + Collaborate with PMs to deliver compliances (SOC2, Fedramp,… more
- MongoDB (New York, NY)
- …Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also work closely with other SWE and SRE ... spans the globe - including several cloud providers + Build for reliability , making services and infrastructure available, resilient, fault tolerant and self-healing… more
Recent Jobs
-
Retail Sales, PT - Eastwood Automotive
- The Eastwood Company (Pottstown, PA)
-
Senior, Software Engineer- Java
- Walmart (Sunnyvale, CA)