- NVIDIA (Santa Clara, CA)
- …Kubernetes, Docker, Kubeflow), along with agentic architectures (eg, MCP, LangGraph) and observability tools for monitoring autonomous systems. + Track record of ... NVIDIA is seeking a highly technical Senior Director to lead Developer Relations Managers to...assurance, and post-release support. + Experience (as a software engineer or technical product manager) in one or more… more
- TEKsystems (Oakland, CA)
- …where users can buy, sell, and store cryptocurrencies, is seeking a high-level, Senior SRE to join their AI Infrastructure team. The following experience is ... as code - GCP or AWS Cloud Infra (logging, observability , pub/sub, cloud syncs) - Vector.dev and Datadog for...supervision Description We are looking for a Site Reliability Engineer (SRE) to join the IT AI Infrastructure team… more
- PennyMac (Westlake Village, CA)
- …maintaining service level agreements (SLAs) that meet or exceed business requirements. + Monitoring & Observability - Lead the development and implementation of ... Site Reliability Operations Engineers across all levels (1,2,3, & Senior ). Foster a culture of excellence, collaboration, and continuous...comprehensive monitoring and observability practices using New Relic… more
- Oracle (Sacramento, CA)
- …that make Oracle Cloud self- monitoring , self-diagnosing, and self-healing. As a Senior Data Scientist / Data Engineer , you will design and operationalize ... tools (Terraform, Ansible) and DevOps automation. + Familiarity with data observability and model-drift monitoring . + Strong collaboration and communication… more
- Oracle (Sacramento, CA)
- …systems (PaaS/SaaS)** . + Strong technical foundation in **cloud infrastructure** , ** monitoring ** , ** observability ** , and **automation frameworks** . + ... **Job Description** ** Senior Manager, Technical Program Management - Root Cause... technical leadership role for an experienced **site reliability engineer (SRE), DevOps professional, or technical program manager** who… more
- Deloitte (Sacramento, CA)
- …in enterprise contexts. + Design and build for enterprise-grade operations, embedding observability , monitoring , cost management, and lifecycle tooling to ensure ... applications - delivering production-grade reliability, scalability, and performance. + Engineer core solution components directly, including data integration layers,… more
Recent Jobs
-
Senior Manager, Brand Capabilities
- Edgewell Personal Care (New York, NY)
-
Director of Compliance-Testing
- Bank OZK (Little Rock, AR)
-
Senior Buyer EDH Purchasing
- Powell Industries, Inc. (Houston, TX)