- NVIDIA (Santa Clara, CA)
- …maintaining vital systems efficiently and reliably.. As a Senior Storage Product Engineer , you will take ownership of NVIDIA's Product Team's internal and ... Chef, Puppet, and Terraform for automating storage deployments. Experience with observability and tracing tools like InfluxDB, Prometheus, Grafana, and the Elastic… more
- NVIDIA (Santa Clara, CA)
- …within AI, ML, and HPC. Joining our team as a Storage & Networking Product Engineer involves being part of a group that fosters the development of highly available, ... tools (Ansible, Terraform, Puppet, Chef, Kubernetes). + Familiarity with observability stacks (Prometheus, Grafana, Elastic, InfluxDB) to monitor and optimize… more
- Zscaler (San Jose, CA)
- …agility with a cloud-first strategy. We're looking for an experienced Senior Staff Development Engineer to join our team. This role is hybrid and based in our San ... and overall architecture + Experience with graph databases such as Neo4j, alongside observability tools like Prometheus, Grafana, and logging systems such as the ELK… more
- Cisco (Santa Clara, CA)
- …web APIs and microservices (REST/gRPC), including testing, deployment, and basic observability (logs/metrics). + Demonstrated ability to work end-to-end on features: ... collaborate on design, implement, write tests, help deploy, and iterate based on metrics or feedback. **Preferred Qualifications:** + Experience or strong interest in RAG systems and vector databases (Weaviate, Qdrant, Milvus, FAISS, etc.). + Exposure to… more
- Cisco (San Jose, CA)
- …anomaly detection. + Familiarity with AI-driven DevOps automation and model observability . + Exposure to edge computing environments. + Experience on various ... AI cloud platforms such as AWS SageMaker, Google Cloud AI Platform, Azure ML or similar. + Strong written and verbal communication skills, with the ability to contribute to design discussions and documentation. + Excellent problem-solving skills, thinking… more
- CVS Health (Harrisburg, PA)
- …**Collaborate with DevOps to support CI/CD pipelines, infrastructure-as-code, and observability ** Collaborates with other members of the development team and ... stakeholders to make high-level architectural decisions, proposes design patterns, and ensures scalability, performance, and maintainability of digital solutions. Leverages advanced programming skills to design and implement complex features, optimize… more
- Microsoft Corporation (Redmond, WA)
- …solutions, and patterns that will improve the availability, reliability, efficiency, observability , and performance of products while also driving consistency in ... monitoring and operations at scale and share knowledge with other engineers. **Qualifications** **Required Qualifications:** + Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in… more
- Oracle (Des Moines, IA)
- …pipelines + ML model lifecycles + Security + identity management + Observability , CI/CD, deployment processes + Work with engineering leaders (eg, Priyank's teams) ... to ensure designs are accurately captured. **Cross-Functional Partnering** + Partner with engineering, applied science, product, and regulatory leaders to gather technical inputs. + Work with third-party regulatory consultants to incorporate their… more
- Oracle (Oklahoma City, OK)
- …ultra fast container runtime, cloud connectivity re-imagined, action-driven machine-friendly observability & supervision infra, and hardcore distributed systems. You ... will also: + Collaborate with cross-functional teams to design and build scalable, high-performance foundational platform services. + Define and improve engineering best practices, development processes, and design standards. + Design, implement, and maintain… more
- Walmart (Bentonville, AR)
- …lifecycle across multiple teams-including coding, testing, CI/CD deployment, observability , monitoring, incident response, and maintenance. Implements distributed ... architectures optimized for real-time data processing, AI/ML integration, and cross-service reliability. Maintains architectural decision records (ADRs) to ensure traceability, alignment, and transparency in technical planning and tradeoffs. AI/ML Integration… more