- Walmart (Sunnyvale, CA)
- …to the roadmap of Walmart's core machine learning capabilities. + Create monitoring dashboards; perform latency tuning of deep learning models, scaling solutions to ... compare models, features, and hyperparameters; utilize A/B testing and continuous monitoring to validate and adjust models. + Possess excellent communication skills… more
- LiveRamp (San Francisco, CA)
- …with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and Terraform ... Dynamodb** + **Optimize the performance and cost of the systems and rightsize Kubernetes containers.** + **Work in close...clouds (GCP or AWS)** + **Experience with deployment and monitoring of highly scalable products.** + **Hands on experience… more
- Walmart (Sunnyvale, CA)
- …order management system. You'll independently handle high impact, critical software/ systems monitoring issues, troubleshoot business and production issues. ... of changes. * Provides support to the business for new and existing systems by responding to user questions, concerns, and issues (for example, technical… more
- Highmark Health (Sacramento, CA)
- …be responsible for contributing to our technical ecosystem including infrastructure, systems , networks, applications, integrations, etc. They will work closely with ... coding, testing andimplementingtechnical solutions, as well as providing general production monitoring and support, meeting defined scope, target dates and budgets… more
- Genentech (Oceanside, CA)
- …drug substance facility. The site employs highly integrated computer control systems to manage plant operations and manufacturing data. The candidate will ... OT compliance and Validation ensure compliance for all IT Systems . As part of the Site Team and larger...and enforce timely completion of assigned training. + Tracking, monitoring and reporting issues via ServiceNow ticketing system. +… more
- Rubrik (Sacramento, CA)
- …and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and enable proactive identification ... enable teams at Rubrik to develop secure software and protect data and systems with appropriate security controls. Information Security also develops systems to… more
- NVIDIA (Santa Clara, CA)
- …be doing: + You will be part of an DGX Cloud team responsible for production systems that enable large scalable GPU clusters to be used for a variety of AI ... software related to managing fleets of GPU nodes. + Implementing monitoring and health management capabilities that enable industry leading reliability,… more
- LiveRamp (San Francisco, CA)
- …**You will:** + **Be a key contributor to critical, highly-available, low-latency systems which power authentication and authorization for all of LiveRamp** + **Lead ... engineers on the team** + **Architect solutions for integrating our authN/authZ systems throughout LiveRamp's stack** + **Figure out ways to replace legacy… more
- Walmart (Sunnyvale, CA)
- …needs; determining and carrying out necessary processes and practices; monitoring progress and results; recognizing and capitalizing on improvement opportunities; ... like Cosmos, MongoDB, etc. + Strong knowledge of messaging systems like Kafka + Hands-on knowledge and experience with...like Kafka + Hands-on knowledge and experience with cloud systems like Azure, GCP + Clear understanding of design… more
- Walmart (Sunnyvale, CA)
- …support millions of Sam's Club customers-while laying the groundwork for **Agentic AI systems ** that consume and act on this data. + Partner with engineering, AI/ML, ... are discoverable, trustworthy, and consumable** by next-gen AI agents and LLM-based systems . + Collaborate across Sam's Club engineering teams to contribute to a… more