- Red Hat (Boston, MA)
- …build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed vLLM (https://github.com/vllm-project/) infrastructure in the ... in scalable inference systems and Kubernetes-native deployments. Your work with distributed systems and cloud infrastructure will directly impact enterprise AI… more
- MongoDB (New York, NY)
- Our team is building the cloud-based distributed systems software responsible for the lifecycle of search indexes including: data ingestion, index building, ... industrial-strength backend software in a complex codebase + Experience developing distributed systems and cloud services + Experience with at least one modern… more
- NVIDIA (Santa Clara, CA)
- …entire software stack. + Innovate and improve model architectures, distributed training algorithms, and model parallel paradigms. + Accelerate foundation model ... NVIDIA is now looking for AI Software Engineers for our GenAI Frameworks (Megatron Core...optimize models by designing and implementing the latest in distributed training algorithms, model parallel paradigms, model optimizations, defining… more
- Microsoft Corporation (Redmond, WA)
- …work and beyond. **Responsibilities** + Design, develop and operate features for large-scale distributed software services and solutions. + Adhere to modern ... products, and develop one of the largest scale, business-critical distributed systems in Microsoft. Our services run in 25+...experience highly desirable. + Systematic and structured approach to software design. + Proficiency in Agile software … more
- Oracle (Jackson, MS)
- …design and implement robust, high-performance solutions that scale across large, distributed systems. Responsibilities + Develop infrastructure software and ... and attitude of a start-up, with the scale and customer-focus of the leading enterprise software company in the world. You are the builder here. We are at the… more
- Oracle (Nashville, TN)
- **Job Description** Oracle Cloud Infrastructure (OCI) is seeking a talented and motivated Senior Member of Technical Staff (IC3) to join our dynamic team that builds ... cutting-edge technologies and want to apply your skills in building robust distributed platforms, this could be the perfect opportunity for you. **Responsibilities**… more
- NVIDIA (Santa Clara, CA)
- …looking for forward-thinking, hard-working, and creative people to join a multifaceted software team with high standards! This software engineering role involves ... workload in GPU cluster. As a member of the software development team, we will work with users from...understanding of Deep Learning frameworks like PyTorch and TensorFlow, distributed training and inference. + Knowledge of GPU cluster… more
- Walmart (Dallas, TX)
- …componentapplicationcomplex For agile methodology Solution Design Requires knowledge of Software architecture Distributed systems Scalability Design patterns ... etc development standards and tools Eg Mondaycom Linx Embold etc for software codingconfiguration Take initiative to learn the fundamentals of different coding… more
- NVIDIA (Santa Clara, CA)
- The NVIDIA DGXC Data Services team is developing a cloud-native stack of software services and tools for managing data across hybrid and multi-cloud infrastructures. ... exabyte-scale, high-performance GPU-based training and inference jobs. You will craft software services to deliver functionality to NVIDIA's internal platforms and… more
- Oracle (Des Moines, IA)
- …You will lead architecture and hands-on development across key layers: distributed processing, transactions, consensus, and storage engines. If you thrive at ... the intersection of large-scale distributed systems, database internals, and cloud platforms, this role offers the opportunity to advance the state of the art.… more
Recent Searches
- Water Design Build Program (United States)
- Postdoctoral Research Associate Thermal (Maine)
- Area Field Manager (Kansas)
- Staff Field Applications Engineer (California)
Recent Jobs
-
Associate, Senior Project Manager (MEP)
- TYLin (St. Louis, MO)
-
Embedded Senior Software Engineer, Leo Customer Terminals
- Amazon (Redmond, WA)
-
Manager, Distribution Project Management Office
- Xcel Energy (Denver, CO)
-
Safety and Training Manager
- Transdev (Readville, MA)