- Meta (Menlo Park, CA)
- …we are seeking for engineers to work on the space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - Scaling / ... SW stacks around NCCL and PyTorch to improve the full-stack distributed ML reliability and performance (eg Large-Scale GenAI/LLM training) from the trainer down to… more
- Meta (Menlo Park, CA)
- …are seeking for engineers to work on the space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking ... SW stacks around NCCL and PyTorch to improve the full-stack distributed ML reliability and performance (eg Large-Scale GenAI/LLM training) from the trainer down to… more
- PagerDuty (San Francisco, CA)
- …frequent testing and learning **Basic Qualifications:** + Deep understanding of a developer and reliability engineer as an end user and how they relate with the ... rest of their organization + Understanding of the relationship between the core end user and the buyer and how that correlates with growth and retention + Understanding of how modern organizations experience and desire to experience incident management… more
- NVIDIA (Santa Clara, CA)
- …experience in Solutions Architecture, Technical Program Management, Product Management, System Reliability Engineer or other complex multi-functional roles. + ... Proven track record to lead and influence without direct authority across technical and business functions. + Proven analytical skills with experience in establishing benchmarks, collecting/analyzing intricate data, and redefining data into strategic themes,… more
- Evolent (Sacramento, CA)
- …infrastructure. We are transforming the way we manage cloud infrastructure and application reliability , your role as a Platform Engineer with a **sharp focus ... in becoming part of our high-performance team. **Responsibilities** : + Engineer and manage observability solutions using **OpenTelemetry** to monitor and trace… more
- Google (Mountain View, CA)
- …or read acareer profile (https://careers.google.com/stories/site- reliability -engineering-profile-google/) about why a Software ... Site Reliability Manager, Site Reliability Engineering _corporate_fare_ Google _place_ Mountain View, CA, USA **Advanced** Experience owning outcomes and… more
- Graphic Packaging International, LLC (Irvine, CA)
- Maintenance Manager - Reliability Centered Requisition ID: 12262 Location: Irvine, CA, US, 92606 Department: Manufacturing & Operations Travel: Up to 25% **At Graphic ... hear from you.** **A World of Difference. Made Possible.** **Job Description: Reliability Centered Maintenance Manager** The GPI Irvine Reliability Centered… more
- ServiceNow, Inc. (San Diego, CA)
- It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... and data-aware automation experiences for our customers. The Opportunity: Principal Software Engineer We are looking for a Principal Software Engineer to… more
- Oracle (Sacramento, CA)
- …infrastructure services that drive OCI's hardware lifecycle activities Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a ... characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.… more
- Belcan (Berkeley, CA)
- Test Engineer Job Number: 361538 Category: Test Engineering Description: Job Title: Test Engineer Pay $48-$55 DOE Location: Berkeley, CA Zip Code: 94710 ... Contract: 6 months A Test Engineer job opportunity is open with our client in...design, develop, and implement test strategies to ensure the reliability , safety, and performance of energy storage products. You… more