- LiveRamp (San Francisco, CA)
- …with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability monitoring and alerting** + **Maintain and enhance CI/CD Tooling and Terraform ... such as SingleStore DB, ScyllaDB, Cassandra or Dynamodb** + **Optimize the performance and cost of the systems and rightsize Kubernetes containers.** + **Work… more
- NVIDIA (Santa Clara, CA)
- …with kubernetes including cluster operations, operator development, node health monitoring and working with GPU resource scheduling. We welcome out-of-the-box ... software related to scheduling GPU resources on kubernetes. + Implementing monitoring and health management capabilities that enable industry leading reliability,… more
- TP-Link North America, Inc. (Irvine, CA)
- …better! At TP-Link Systems Inc, we are committed to crafting dependable, high- performance products to connect users worldwide with the wonders of technology. ... simplicity, we aim to assist our clients in achieving remarkable global performance and enable consumers to enjoy a seamless, effortless lifestyle. KEY… more
- Amazon (Cupertino, CA)
- …we are able to develop creative and new designs that set the standards on performance , quality, cost, and operational excellence. What you will do: As a member of ... you will oversee the fleet of servers you develop, monitoring their quality and how they are meeting the...be constantly looking for ways to improve your products' performance , quality, and cost. We're changing an industry, and… more
- Walmart (Sunnyvale, CA)
- …Deploy and monitor products on Cloud platforms + Develop and implement best-in-class monitoring processes to enable data applications meet SLAs + Guide the team ... compensation package, you can receive incentive awards for your performance . Other great perks include 401(k) match, stock purchase...Sam's Club, we offer competitive pay as well as performance -based bonus awards and other great benefits for a… more
- NVIDIA (Santa Clara, CA)
- …and excellent communication and planning abilities. Experience working with High Performance Computing (HPC), GPUs, and high- performance networking (RDMA, ... lifecycle management for large-scale Machine Learning systems. + Implement monitoring and health management capabilities that enable industry-leading reliability,… more
- Coinbase (Sacramento, CA)
- …and operations work. * Collaborate with our core infrastructure team to performance tune and optimize our cloud deployments. (Think Docker, Terraform, Kubernetes, ... transparent cultural practices. * Strong skills around observability, debugging and performance tuning * Strong communication skills and ability to explain technical… more
- Walmart (Sunnyvale, CA)
- …needs determining and carrying out necessary processes and practices monitoring progress and results recognizing and capitalizing on improvement opportunities ... At Walmart, we offer competitive pay as well as performance -based bonus awards and other great benefits for a... Additional compensation includes annual or quarterly performance bonuses. Additional compensation for certain positions may… more
- Google (Sunnyvale, CA)
- …You also ensure that network operations are safe and efficient by monitoring network performance , coordinating planned maintenance, adjusting hardware components ... and processes to improve them. + Develop software that improves the performance , safety, transparency, and manageability of network systems. + Participate in… more
- Cardinal Health (Sacramento, CA)
- …process improvements and back-end solutions for commercial technologies to maximize performance and suitability for business needs. This job family manages ... to production outages. + Analyze production system operations using tools such as monitoring , capacity analysis and outage root cause analysis to identify and drive… more