- G2 Ops, Inc. (El Segundo, CA)
- …make an impact: For this opportunity, we are seeking a highly motivated, team-oriented Systems Engineer . This exciting position will have the chance to work on ... to Reliability , Maintainability, and Availability (RMA) engineering in defense systems , including DoD supply and logistics engineering for space systems … more
- PennyMac (Westlake Village, CA)
- …through the complete mortgage journey. A Typical Day As a member of the Site Reliability Operations (SRO) team, you will help provide 24/7 monitoring and ... support of Pennymac's database infrastructure and related systems . This role focuses specifically on database operations, performance optimization, and ensuring the … more
- Insight Global (New York, NY)
- …Skills and Requirements . 4+ years of experience in a DevOps or Site Reliability Engineering role . Demonstrated ability to maintain a large ... and taking a holistic view of system health. . Build software and systems to manage platform infrastructure and applications. . Improve reliability , quality,… more
- Mondelez International (Chicago, IL)
- …to minimize downtime and maximize productivity. + **Root Cause Analysis (RCA)** : Reliability engineer should be adept at conducting root cause analyses to ... activities before failures occur. + **Data Analysis and Decision-Making** : Reliability engineer rely on data-driven decision-making to prioritize maintenance… more
- RELX INC (Boca Raton, FL)
- …with modern multi cloud platforms and cutting-edge tools to enhance system reliability , visibility, and security across the entire development lifecycle. If you are ... passionate about scalable systems and accelerating engineering teams, you will make a...Role: This position i ndividuals are responsible for challenging reliability and toil reduction projects. Key Responsibilities: + Monitoring… more
- Insight Global (Plano, TX)
- …monitoring: measure, analyze, regularly assess and improve the reliability of core infrastructure components (networking equipment, compute, databases, ... single points of failure or component failures. -Maintain and improve the reliability , availability, and performance of production services, with a focus on reducing… more
- MongoDB (New York, NY)
- …Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also work closely with other SWE and SRE ... spans the globe - including several cloud providers + Build for reliability , making services and infrastructure available, resilient, fault tolerant and self-healing… more
- NVIDIA (Santa Clara, CA)
- …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... analyze capacity data and develop plans for appropriate level enterprise-wide systems , and coordinate with management personnel in implementing changes. + Develop… more
- MongoDB (New York, NY)
- …Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications. The Site Reliability Engineering team designs and builds the global ... health of the system. We are strong believers in infrastructure-as-code and self-healing systems . The SRE Team is fully integrated with all the other engineering… more
- Nutanix (Albany, NY)
- …team plays a crucial role in ensuring the smooth operation of critical systems , leveraging cutting-edge technologies and automation to achieve our goals. You will ... Our work setup is hybrid, requiring you to be on- site three days a week while giving you the...events. **Your Role** + Ensure the 24/7 availability and reliability of Nutanix's cloud services and infrastructure. + Respond… more