- NVIDIA (Santa Clara, CA)
- …and reliability aspects of large-scale AI clusters, focusing on performance at scale, training stability, real-time monitoring, logging, and alerting + Engage ... operation, and optimization + Co-design telemetry of AI workloads to help engineering build solutions for more robust workloads at scale + Communicate across… more
- TAD PGS, Inc. (Canoga Park, CA)
- …to lead the analysis, design, and optimization of turbomachinery for liquid rocket engine systems . The ideal candidate will possess deep expertise in the design and ... performance analysis of inducers, impellers, volutes, pumps, turbines, and...combustion, expander, etc.), cryogenic propellants, and high-speed rotating equipment reliability . May coordinate risk and opportunity management for development… more
- Butler America (Canoga Park, CA)
- …to lead the analysis, design, and optimization of turbomachinery for liquid rocket engine systems . The ideal candidate will possess deep expertise in the design and ... performance analysis of inducers, impellers, volutes, pumps, turbines, and...staged combustion, expander, etc.), cryogenic propellants, high-speed rotating equipment reliability May coordinate risk and opportunity management for development… more
- ServiceNow, Inc. (Santa Clara, CA)
- …of the Fortune 500(R). Our intelligent cloud-based platform seamlessly connects people, systems , and processes to empower organizations to find smarter, faster, and ... ** The AI Agents team is a key part of our broader Platform Engineering division, focused on creating innovative solutions that empower our customers to design and… more
- SpaceX (Hawthorne, CA)
- …and drive design changes related to department priorities such as simplification, reliability , and performance . + Develop and continually improve the ... assembly, etc. + Understanding of test methods/setups and data acquisition systems . + Experience generating, reading, and interpreting engineering drawings.… more
- LiveRamp (San Francisco, CA)
- …evaluation methodologies (eg, using LLM-as-a-judge, adversarial testing) to measure agent performance , reliability , safety, bias, drift, and effectiveness. + ... the end-to-end development of advanced LLM-based agents, including prompt engineering , fine-tuning, building specialized tools/functions for agents, and designing… more
- Oracle (Redwood City, CA)
- …across AI and database projects. + Promote and ensure best practices for performance , reliability , security, and compliance. + Provide technical mentorship and ... secure, and maintainable architectures using cloud-native, microservices, and event-driven systems . + Lead architecture reviews, develop proofs-of-concept, and solve… more
- Oracle (Santa Clara, CA)
- …Ecosystem Management team. This team is reimagining how we ensure the security, reliability , stability, and performance of software components used across OCI. ... operating independently, and have experience building, growing, and managing high-performing engineering teams that deliver results. You will guide engineers in… more
- Amazon (San Francisco, CA)
- …language experience - 4+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience ... maintainable features Drive improvements in code quality, maintainability, and app performance Mentor peers and elevate our mobile development standards Work with… more
- Amazon (San Francisco, CA)
- …company specific interfaces for accessing Oracle, RedShift, and Spark storage systems . Build relationships with stakeholders and counterparts. Analyze data for ... approaches, expected and observed outcome, and other business defined key performance indicators. Implement models that comply with evaluations of the computational… more