- NVIDIA (CA)
- …Work with NVIDIA's DGX Cloud team as a Senior Site Reliability Engineer to maintain high-performance DGX Cloud clusters for AI researchers and enterprise ... AWS, GCP, Azure, OCI, and private clouds + Scale systems sustainably through mechanisms like automation and evolve ...using tools like OpenTelemetry, Prometheus, Grafana, ELK Stack, Lightstep, Splunk , Datadog, etc. Ways to stand out from the… more
- ServiceNow, Inc. (San Diego, CA)
- It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... of the Fortune 500(R). Our intelligent cloud-based platform seamlessly connects people, systems , and processes to empower organizations to find smarter, faster, and… more
- Walmart (Sunnyvale, CA)
- … like Kafka. + Experience utilizing monitoring and alert tools like Prometheus, Splunk , and other related systems and excellent in debugging and troubleshooting ... ** We are seeking a talented and passionate Software Engineer III to join our team. The ideal candidate...customers every day. We build and maintain the critical systems that handle billions of transactions annually, directly impacting… more
- Walmart (Sunnyvale, CA)
- …adaptive security frameworks across the enterprise. **What you'll do:** As a **Principal Engineer ** at Walmart, you will serve as a key technical thought leader ... decisions, mentor teams, and lead by example in building high-scale, intelligent systems that integrate cutting-edge AI/ML and agentic technologies. You will operate… more
- Roche (Santa Clara, CA)
- …come. Join Roche, where every voice matters. **The Position** **Principal DevOps Engineer - ML/AI Algorithms** Developing software is great, but developing software ... a purpose is even better! As a Principal DevOps Engineer - ML/AI Algorithms, you will work on products...with container technology, including Kubernetes, AWS EKS, Helm Charts, Splunk , and Docker, along with provisioning infrastructure through IAC… more
- Evolent (Sacramento, CA)
- …the mission. Stay for the culture. **What You'll Be Doing:** The Security Engineer III is responsible for designing and implementing robust security measures to ... posture through comprehensive security design, implementation, and management. The Security Engineer III will work closely with cross-functional teams to develop and… more
- Insight Global (Hawthorne, CA)
- …Monitoring: Grafana, Splunk - Cloud & DevOps: Terraform, Helm, UNIX-like systems We are a company committed to creating diverse and inclusive environments where ... Description Insight Global is looking for a Full Stack Engineer in Hawthorne, CA to join a cutting-edge team...platforms. This team is responsible for designing and scaling systems that support millions of users across diverse geographies,… more
- US Bank (Los Angeles, CA)
- …what you excel at-all from Day One. **Job Description** As a Reliability Engineer , your role will be a combination of supporting production applications and ... latency, performance, efficiency, and effective proactive monitoring. The reliability engineer interfaces with business users, development teams and system… more
- Walmart (Sunnyvale, CA)
- …like Splunk , Dynatrace. + Ecommerce domain experience preferred in systems like Cart, Account, Orders management, Catalog, Product Recommendation, Checkout, Tax ... do ** Join our team as a Principal Software Engineer for a key position within Directed Spend Team...group is responsible for enabling the core services and systems that steers customers to buy pre-defined merchandise on… more
- The Walt Disney Company (Anaheim, CA)
- …including the MyDisneyExperience app and Hey, Disney! This Principal Software Engineer will sit in the Disneyland Ticketing Technology organization within Technology ... testing, and implementation of software components, fixes, improvements, and/or new systems and applications + Exercise full autonomy to interact with users… more