- Amazon (East Palo Alto, CA)
- …that are used by millions of companies worldwide to manage day-to-day operations . We will accomplish this by accelerating our customers' businesses through delivery ... massive datasets. - Develop robust monitoring and debugging tools to ensure the reliability and performance of training workflows on large GPU clusters. Design and… more
- Amazon (Northridge, CA)
- …- Utilize network topology understanding and Software Defined Networking / network operations experience to interface with the Kuiper network team to develop APIs ... meet Kuiper and US Government performance needs including availability, reliability , upgradeability, interoperability, and security requirements - Effectively communicate… more
- NVIDIA (Santa Clara, CA)
- …to have a strong programming background, knowledge of datacenter hardware, operations , and networking, familiarity with software testing and deployment, familiarity ... contribute to this platform to build end-to-end automation of datacenter operations , break/fix, and lifecycle management for large-scale Machine Learning systems. +… more
- LinkedIn (Mountain View, CA)
- …Platform capabilities that power our product innovation. As a Senior Staff Software Engineer within the DPX Quality team, you will help envision the next generation ... deployment and craft of your team's infrastructure and systems, with high reliability and scalability. *Leverage your deep and broad technical expertise to mentor… more
- Cognizant (Sacramento, CA)
- **Sr. Java Full Stack Engineer ** Cognizant Digital Practice helps clients reinvent products, experiences, and business models to create new value, differentiation, ... + Collaborate with cross-functional teams-including Product, UX, QA, and Operations -to understand requirements and deliver solutions that meet business objectives.… more
- Amazon (Northridge, CA)
- …the United States and our allied government customers. The Senior Power Electronics Engineer will play a pivotal role in the team delivering innovative and ... - Experience with complex board designs, such as high-speed interconnects, high- reliability designs, avionics, vehicle control systems, and/or motor & actuator… more
- Robert Half Technology (Santa Ana, CA)
- Description We are looking for an experienced Sr. Network Engineer to design, implement, and maintain secure and efficient network solutions that support our ... internal departments to procure hardware, software, and services required for network operations . * Analyze and resolve complex network issues to ensure high… more
- Hyundai Autoever America (Fountain Valley, CA)
- …is looking for a highly experienced and technically proficient Senior AI/ML Engineer to lead innovations and deliver impactful AI solutions for the automotive ... scalable AI/ML systems with a focus on performance and reliability . + Fine-tune and deploy LLMs for tasks like...support knowledge sharing and audits. Data Engineering & Model Operations : + Build data pipelines for training and deploying… more
- Microsoft Corporation (San Francisco, CA)
- …up to date from a security and compliance perspective. As a Principal Software Engineer - Azure Kubernetes Service team, you will be responsible for working with key ... trends, technical solutions, and patterns that will improve the availability, reliability , efficiency, observability, and performance of products while also driving… more
- Microsoft Corporation (Mountain View, CA)
- …in revenue annually. We are seeking a highly skilled and experienced **Principal Software Engineer ** to join our team in Mountain View, CA or Redmond, WA. In this ... Design and maintain comprehensive monitoring and alerting systems to ensure the reliability and performance of data pipelines and bidding applications. + Construct… more