- NVIDIA (Santa Clara, CA)
- …and data systems that provide real-time understandings of our sophisticated, distributed infrastructure. As an engineer on our team, you will play a key ... supervises workload health, performance, and usage in critical engineering systems . This allows our global teams to work at...workflows. + Maintain and update the observability tools and systems to meet the needs of new/evolving chip design… more
- Microsoft Corporation (Mountain View, CA)
- …We bridge the gap between the latest state-of-the-art AI models and hardware eco- systems . We build software to enable running AI models everywhere, from the world's ... complexity of key components/pipelines to improve performance and/or efficiency of our systems + Interacting and collaborating with our partners both internal and… more
- Google (Mountain View, CA)
- …are seeking an engineer who excels at building robust, scalable software systems for machine learning research and applications. We particularly value strong ... software engineering skills with a proven ability to build robust and scalable systems . + Proficiency in deep learning frameworks like JAX, TensorFlow, or PyTorch is… more
- Google (Sunnyvale, CA)
- …Large Language Models (LLMs), Natural Language Processing (NLP), or agent-based systems . + Deep understanding of AI/ML concepts, algorithms, and software ... who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security,… more
- NVIDIA (Santa Clara, CA)
- …including sophisticated AI agents and fine tuning & integrating with enterprise production systems . + Play a key role in design, development, and deployment of AI ... deploying LLM-powered solutions for engineering assistants and multi-turn, multi-modal dialogue systems . + Make a difference by leveraging AI technologies to solve… more
- Snap Inc. (Los Angeles, CA)
- …roadmap of the Content Relevance team and optimize our personalized video recommendation systems + Advance the core ML capabilities and design, implement, and scale ... the overall architecture of the content recommendation systems , ensuring scalability, performance, and reliability + Collaborate with cross-functional teams to align… more
- ServiceNow, Inc. (Santa Clara, CA)
- …of the Fortune 500(R). Our intelligent cloud-based platform seamlessly connects people, systems , and processes to empower organizations to find smarter, faster, and ... this role, you'll develop and test disaster recovery plans for critical systems , analyze business risks and recovery requirements, and maintain IT infrastructure… more
- DataRobot (San Francisco, CA)
- …work and advance their careers. You'll work across our control plane systems , influence cross-team roadmaps, and bring pragmatic engineering practices into how we ... believe in shared ownership of our platform and aim to build systems that are resilient, observable, and require minimal intervention. **Key Responsibilities:** +… more
- Leidos (Marysville, CA)
- …program, supporting the United States Air Force in geographically distributed intelligence operations. **Combat Coders** directly support mission objectives by ... all aspects of full-stack applications. Your contributions will move directly to production systems and get immediate feedback. You will be working with a small… more
- iCIMS (Sacramento, CA)
- …and legacy applications + Implement monitoring, alerting, and dashboards for assigned systems + **Incident Management & Response:** + Respond to alerts and incidents ... SaaS experience in a global environment + Authentication and identity management systems knowledge + Cloud certifications (AWS, Azure, or Google Cloud) +… more