- Amazon (Sunnyvale, CA)
- …training platform for large language models up to 400B parameters Design high- performance training systems that produce models optimized for edge deployment ... a mentor, tech lead or leading an engineering team - Experience with distributed systems or high- performance computing - Proficiency in Python and at least one… more
- Walmart (Sunnyvale, CA)
- …identifying and addressing customer issues,** ensuring continuous improvement in platform performance , reliability, and user experience. + **Cultivating a culture of ... of Data Structures + 8+ years of experience in systems design, algorithms, and distributed systems . +...knowledge would be an added advantage + Expertise in cloud infrastructure, such as Open Stack, Azure, GCP, or… more
- Walmart (Sunnyvale, CA)
- …requirements; translating requirements into mobile solutions for multiple operating systems (for example, iPhone, Android); gathering requested information (for ... integrating solutions to ensure they are applicable to multiple operating systems ; developing user interface solutions; conducting testing to ensure solution is… more
- Broadcom (Palo Alto, CA)
- …existing VMware services to correct errors, adapt to new hardware, and improve performance ; * Develop and direct VMware systems testing and validation ... Develop scalable build and testing infrastructure for large scale distributed software systems ; * Participate in the architecture, design, and implementation of next… more
- Aeris Communications (San Jose, CA)
- …and architectures to identify scalable and efficient solutions that address system -level challenges and support secure, high- performance product development in ... platform capabilities. + Help drive the design and definition of next generation high performance hybrid cloud software platforms that enable the next phase of… more
- GE Aerospace (San Francisco, CA)
- …services + Build capacity models to baseline transactional load compared to resource performance and leverage data to predict overall system capacity while ... **Job Description Summary** GE is building operations teams focused on performance and availability of Compute and Network infrastructure consumed by all business… more
- eightfold.ai (Santa Clara, CA)
- …+ Diagnose and troubleshoot issues in complex distributed environments and optimize system performance . + Contribute to the team's technical growth and ... team at Eightfold.ai is at the forefront of developing intelligent, autonomous systems that will redefine talent management. We are building cutting-edge agentic AI… more
- Walmart (Sunnyvale, CA)
- …Design scalable, low-latency services to host models; productionize prototypes on the cloud , including data pipelines, training & inference pipelines, and pre & ... learning models, scaling solutions to enterprise level; investigate and resolve performance issues. + Run experiments to compare models, features, and… more
- Palo Alto Networks (Santa Clara, CA)
- …for impactful storytelling + Nice to haves: + Able to troubleshoot system -level integration and performance issues + GitHub portfolio or equivalent ... with precision. **Your Career** As a Principal Software UI/Frontend Engineer of the AIOps engineering team, you will collaborate...applications with Docker and Kubernetes + Experience with Google Cloud Platform (GCP) is a plus + Experience working… more
- Palo Alto Networks (Santa Clara, CA)
- …Visualizations for impactful storytelling **Nice to haves:** + Able to troubleshoot system -level integration and performance issues + GitHub portfolio or ... with precision. **Your Career** As a Senior Staff Software Engineer of the AIOps engineering team, you will collaborate...applications with Docker and Kubernetes + Experience with Google Cloud Platform (GCP) is a plus + Experience working… more