- Amazon (Cupertino, CA)
- …to run the largest cloud computing infrastructures in the world, and managing systems at scale ? If yes, come join us. Key job responsibilities You will be part of a ... drives changes back into development and builds mechanisms to scale and efficiently operate our infrastructure at the Edge...to build rock solid never-fail, highly-secure hardware at world-class scale and who obsess over customers. A day in… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental ... operations that enable AI to scale across multiple accelerators & servers. Most of our...ML customers, iterate fast and deliver meaningful solutions at scale , then come join us! This truly is a… more
- Amazon (Cupertino, CA)
- …AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud- scale machine learning accelerators and servers that use them. This role is for a ... software engineer in the Machine Learning Inference Model Enablement and...a wide variety of LLM model families, including massive scale large language models like the Llama family, DeepSeek… more
- Palo Alto Networks (Santa Clara, CA)
- …Cortex DevOps team, your role involves operating and maintaining a large- scale GCP environment, including the design, implementation, and continuous enhancement of ... managed high cardinality metrics, implemented tracing, and operationalized large scale logging solutions. As part of this role, you...Expertise - 4+ years of experience as a DevOps/SRE engineer with a passion for technology and a strong… more
- IBM (San Jose, CA)
- …any image for any cloud. * Waypoint makes infrastructure easily accessible at scale , enabling platform teams to deliver golden patterns and workflows with an ... an emphasis on Golang development * Lead and execute large- scale projects, ensuring the reliable delivery of key features...have at least 8+ years of experience as an engineer * You have professional experience developing with modern… more
- Meta (Los Angeles, CA)
- …impact, we encourage you to apply. **Required Skills:** Software Engineer , Infrastructure Responsibilities: 1. Collaborate with cross-functional teams (product, ... Qualifications:** Preferred Qualifications: 14. 6+ years relevant experience building large- scale infrastructure applications or similar experience 15. Experience with… more
- Meta (Menlo Park, CA)
- …comms lib and scheduling infrastructure. **Required Skills:** AI/HPC Systems Performance Engineer Responsibilities: 1. Active member of a multi-disciplinary team to ... develop solutions for large scale training systems. 2. Responsible for the overall performance of the communication system, including performance benchmarking,… more
- Microsoft Corporation (Mountain View, CA)
- …both client and service APIs and micro-services that operate at high scale . It provides exciting opportunities in building resilient, highly available, and highly ... can thrive at work and beyond. **Responsibilities** + Software Development Engineer working within an agile development environment with other developers and… more
- Google (San Jose, CA)
- …with information and one another. Our products need to handle information at massive scale , and extend well beyond web search. We're looking for engineers who bring ... ideas from all areas, including information retrieval, distributed computing, large- scale system design, networking and data storage, security, artificial… more
- Meta (Menlo Park, CA)
- …"Apply to Job" online on this web page. **Required Skills:** Software Engineer (Systems) Responsibilities: 1. Research, design, develop, build and test operating ... diverse scope and design core, backend software components. 4. Handle large scale data storage, synchronization and coordination of large server cluster, and provide… more