- Intuit (San Diego, CA)
- **Overview** Come join the Identity Team as Site Reliability / DevOps Engineer (System Engineering). Identity is at the heart of all offerings across Intuit and is ... platforms to enable faster and automatic recovery. + Design and develop observability components for massive scale platforms, to detect issues quickly and isolate… more
- NVIDIA (Santa Clara, CA)
- …spectrum of challenges. Practices such as proactive storage performance monitoring , automated fault detection and remediation, scalable data replication strategies, ... high availability, and data integrity. + Develop and maintain storage monitoring , logging, and alerting systems to ensure proactive detection and resolution… more
- Google (Sunnyvale, CA)
- … observability , and governance. This includes the interfaces, performance monitoring for the application-centric view, policy and governance management, and ... Staff Software Engineer , App Hub _corporate_fare_ Google _place_ Sunnyvale, CA,...to digitally transform its business and industry. We deliver enterprise -grade solutions that leverage Google's cutting-edge technology, and tools… more
- The Walt Disney Company (San Francisco, CA)
- …Public Cloud Provider (eg, AWS, Microsoft Azure, Google Cloud) + Experience with observability tools for metrics, logging, and monitoring (eg, Datadog, Splunk, ... customers. We are looking for an experienced backend services focused Software Engineer to join the Partner Experience Engineering team. Your success will depend… more
- NVIDIA (Santa Clara, CA)
- …within AI, ML, and HPC. Joining our team as a Storage & Networking Product Engineer involves being part of a group that fosters the development of highly available, ... end-to-end performance across the full stack. + Develop automated systems for monitoring , recording, and notifying in storage and networking. + Build and maintain… more
- NVIDIA (Santa Clara, CA)
- …building for performance and reliability at global scale, covering automation, monitoring , high availability, capacity planning, and lifecycle management. + Define ... optimizations (SR-IOV/ DPU) + Experience with Technologies like eBPF and XDP for Observability & DDoS mitigation + Collect and review system data for capacity and… more
- General Motors (Mountain View, CA)
- …lifecycle software development, from design and implementation to deployment and monitoring . + Provide technical leadership across multiple teams, ensuring alignment ... architectural gaps, and security vulnerabilities. + Lead integration efforts with enterprise systems, ensuring seamless communication and data flow across services.… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …platform and processes to improve operations. Key Responsibilities: + Implement monitoring framework to improve infrastructure reliability, observability , and ... and EDA. + Demonstrated ability to operate and manage large-scale, enterprise -grade environments + Excellent communication skills, both written and verbal. +… more
- Walmart (Sunnyvale, CA)
- …and implementing self-service ML deployment platforms and API gateways for enterprise environments **Advanced Observability & Monitoring Excellence** ... across e-commerce, supply chain, and in-store systems. + **Build intelligent observability and monitoring systems** using ML-driven anomaly detection, predictive… more
- PennyMac (Westlake Village, CA)
- …partners. + Background in implementing and optimizing monitoring , alerting, and observability solutions at an enterprise scale. Why You Should Join As ... maintaining service level agreements (SLAs) that meet or exceed business requirements. + Monitoring & Observability - Lead the development and implementation of… more
Recent Jobs
-
Engineer, Senior Control (Midland, TX or Carlsbad,NM)
- Epco, Inc. (Midland, TX)