• Production Engineer, Network

    Meta (Menlo Park, CA)
    …ensure optimal system performance. 17. Proven experience designing, developing, and operating distributed systems at scale, with an in-depth understanding of the ... seeking a Production Engineer with in-depth understanding of networking, systems , automation, and tooling to join the PE Network...network is a foundational component in achieving the company's AI goals and this role would play a key… more
    Meta (08/21/25)
    - Related Jobs
  • Senior Software Engineer

    Microsoft Corporation (Redmond, WA)
    …* 5+ years of experience writing production code in building internet scale services and distributed systems . * Ability to debug, read code and work on a large ... The AI Platform organization at Microsoft builds the end-to-end...include the following: . Design, implement, and support scalable, reliable , high-performance services . Write clean and concise code… more
    Microsoft Corporation (09/25/25)
    - Related Jobs
  • DevSecOps Engineer

    TekSynap (Reston, VA)
    …pipelines for NGA data lakes and object storage (eg, AWS S3), ensuring secure, reliable data ingestion into the agentic AI platform. + Implement robust CI/CD ... for system accreditation. + Design microservice architectures to support scalable, distributed agentic AI services, leveraging RESTful APIs, event-driven… more
    TekSynap (09/18/25)
    - Related Jobs
  • Software Engineer II

    Microsoft Corporation (Redmond, WA)
    …technologies: Proficient in backend technologies and cloud-based architectures to build large-scale distributed systems and AI /ML experience + Experience in ... in usability, accessibility, and performance. As part of Microsoft's AI -first transformation, we are integrating Copilot capabilities into end-user… more
    Microsoft Corporation (10/07/25)
    - Related Jobs
  • Senior Embedded Software Engineer

    Gecko Robotics (Pittsburgh, PA)
    …Linux kernel development, Loadable kernel modules, State machines, Event-driven architectures, Distributed systems , Multi-process parallel systems , IPCs and ... sacrificing real-time performance. + Architect and implement state machines and event-driven systems to ensure reliable robot operation. + Debug complex embedded… more
    Gecko Robotics (09/20/25)
    - Related Jobs
  • Manager, Software Engineering - Enterprise…

    LinkedIn (Mountain View, CA)
    …building and testing of highly reliable , available and scalable large-scale distributed systems and client-server architectures. + Shipped big projects with ... and lead the teams that design, implement, and optimize the performance of large-scale distributed systems with security and compliance in mind. + You will… more
    LinkedIn (10/02/25)
    - Related Jobs
  • Staff Software Engineer, Frontend, Gemini…

    Google (Sunnyvale, CA)
    …technical direction, architecture, and best practices to ensure high-quality, performant, and reliable user experiences. + Leverage AI tools and frameworks to ... (eg, Python, C, C++, Java, JavaScript). + 1 year of experience leveraging AI tools or frameworks to accelerate development and engineering productivity. + Experience… more
    Google (10/02/25)
    - Related Jobs
  • Engineering Manager, Research

    Red Hat (Raleigh, NC)
    …Manager who will lead and develop a world-class team of engineers delivering reliable and high-performing AI , container, linux, and hybrid cloud technologies to ... will enable the Research group to solve complex problems and drive innovative systems development with our academic and research partners and internal Red Hat… more
    Red Hat (10/11/25)
    - Related Jobs
  • Senior Software Engineer, Bare Metal Automation…

    NVIDIA (Santa Clara, CA)
    …from the crowd: + Technical competency in managing and automating large-scale distributed systems independent of cloud providers. Advanced hands-on experience ... part of an DGX Cloud team responsible for production systems that enable large scalable GPU clusters to be...bare metal hardware. + Proven operational excellence in maintaining reliable and performant AI infrastructure. NVIDIA is… more
    NVIDIA (09/29/25)
    - Related Jobs
  • Engineering Manager, Search Platform

    DoorDash (San Francisco, CA)
    …years of leadership experience + Prior experience building highly available, scalable, and reliable distributed systems + Strong communication skills and ... per second. This requires the platform to be scalable, reliable , fast, and provide high app quality. AI...business environment + Experience with Search/Information Retrieval/ML or stateful distributed systems like storage is a big… more
    DoorDash (08/02/25)
    - Related Jobs