- Amazon (Sunnyvale, CA)
- …for distributed AI training workloads * Develop comprehensive performance monitoring , metrics collection, and benchmarking tools for high-bandwidth cluster ... experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Strong programming… more
- Amazon (Sunnyvale, CA)
- …with the satellite ground control services, the customer engagement systems and monitoring services to constantly fine-tune our network and deliver a high quality ... language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a… more
- Insight Global (San Jose, CA)
- …and deploy resources using Infrastructure-as-Code Develop solutions using Azure DevOps Operations task, using cloud native tools, like Log Analytics, Azure Monitor ... and Azure Security Center or other monitoring tooling Deliver/update documentation (eg product descriptions, operational tasks)...to your environment Implement tasks related to network platform operations Local to San Jose or open to working… more
- Walmart (Sunnyvale, CA)
- …sellers can leverage Walmart's advanced fulfillment infrastructure to streamline operations and improve delivery to customers. In this fast-paced, innovation-driven ... and WFS on a global scale. + **Collaborating with product, business, and operations leaders** to define, prioritize, and execute the product and technology roadmap,… more
- Amazon (Cupertino, CA)
- …of new code changes to a variety of org-owned and customer-owned systems. Build monitoring tools and metrics to ensure hardware is running properly in both test and ... software development experience - 2+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience - 4+ years… more
- Coinbase (Sacramento, CA)
- …build infrastructure to provide the most secure and highest uptime*: Observability and monitoring is a cornerstone of the team's philosophy in order to ensure ... our customers*: We are always on the lookout for creative ways to optimize our operations as we continue to scale. * *We empower best in class staking experience for… more
- Zoom (San Jose, CA)
- …providing robust and reliable support for massive-scale data and service operations . What You'll Do: + Design, develop, and optimize Java-based microservices ... and resolve performance bottlenecks. + Set up and maintain comprehensive monitoring , alerting, and observability using APM tools and Linux-based diagnostics. +… more
- Coinbase (Sacramento, CA)
- …new financing products and features * Create and build new reporting, monitoring , tools, frameworks and APIs * Collaborate with engineers, product managers and ... developer efficiency, engineering excellence, and operational excellence * Provide support to operations and other engineering teams * Mentor team members and help… more
- Phillips 66 (Rodeo, CA)
- …process, improving energy efficiency, handling key logistics, improving equipment reliability , monitoring emissions or completing permit applications, optimizing ... in your internship and provide insight on your experience. Our **Refining** operations are comprised of global refining, marketing and transportation of petroleum… more
- Google (Sunnyvale, CA)
- …enhancements to keep our systems running smoothly. You also ensure that network operations are safe and efficient by monitoring network performance, coordinating ... who use Google services around the world. We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running… more