- Microsoft Corporation (Sunnyvale, CA)
- …services with scalable and sustainable architecture and implementation and with high performance, low latency , and high availability. In this role, you will work ... with a unique group of talented engineers, scientists, and product managers to build the industry's best Responsible AI services. You will own the design of new AI services and integration with existing services such as Azure AI Content Safety, Azure OpenAI… more
- SpaceX (Hawthorne, CA)
- …STARLINK CUSTOMER SUPPORT Starlink is revolutionizing internet connectivity by providing high-speed, low - latency satellite internet to even the most remote and ... rural locations worldwide. Whether you're streaming, gaming, or working remotely, Starlink ensures a seamless online experience where traditional internet services fall short. Perfect for areas with unreliable or unavailable connectivity, Starlink is your… more
- Google (Sunnyvale, CA)
- …+ 1 year of experience in a technical leadership role. + Experience in Low Latency networks. + Experience in Kernel or user mode system software ... development. + Experience with Networking Protocols. Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information… more
- LinkedIn (Mountain View, CA)
- …serving feature data with high performance. Model Serving Infrastructure: this team builds low latency high performance applications serving very large & complex ... models across LLM and Personalization models. As an engineer, you will build compute efficient infra on top of native cloud, enable GPU based inference for a large variety of use cases, cuda level optimizations for high performance, enable on-device and online… more
- Actalent (San Diego, CA)
- …hospital/partner integrations, and CGM data exchange. * Ensure APIs are scalable, low latency , and fault-tolerant to meet business and customer needs. ... * Build responsive frontend and backend applications using modern frameworks and technologies such as Spring Boot, Node.js, and Express. * Work with NoSQL databases like Cassandra, MongoDB, DynamoDB, or RDBMS like MySQL, Postgres, Oracle. * Write and maintain… more
- The Hertz Corporation (San Francisco, CA)
- …will do:** + Design, implement and maintain applications that can be high-volume and low - latency + Contribute to all stages of software development lifecycle + ... Analyze user requirements to define business objectives + Envisioning system features and functionality + Develop and test software + Identify and resolve any technical issues arising + Create detailed design documentation + Propose changes to current… more
- Google (Mountain View, CA)
- …for streaming bi-directional dialog, so the user experience is always fluid and low - latency . + Rapidly prototype and evaluate new technologies. About you In ... order to set you up for success as a Research Scientist at Google DeepMind, we look for the following skills and experience: + PhD in Computer Science, or Machine Learning related field. + Experience working with LLMs. + Demonstrated experience in data… more
- Actalent (San Diego, CA)
- …CGM (Continuous Glucose Monitoring) data exchange + Ensure APIs are scalable, low - latency , and fault-tolerant. + Build responsive front-end and back-end ... applications using modern frameworks and technologies such as Spring Boot, Node.js, and Express. + Work with databases including: + NoSQL: Cassandra, MongoDB, DynamoDB + RDBMS: MySQL, PostgreSQL, Oracle + Write and maintain unit, integration, and end-to-end… more
- NVIDIA (Santa Clara, CA)
- …production. What you will be doing: + Optimize deep learning models for low - latency , high-throughput inference. + Convert and deploy models using frameworks such ... as TensorRT and TensorRT-LLM + Understand, analyze, profile, and optimize performance of deep learning workloads on state-of-the-art hardware and software platforms. + Collaborate with internal and external researchers to ensure seamless integration of models… more
- Palo Alto Networks (Santa Clara, CA)
- …large-scale AI/ML systems for performance, reliability, and developer-friendliness, focusing on low latency and high throughput in real-time AI applications ... + Technology Adoption: Evaluate and integrate new AI tools, frameworks, and cloud solutions, aligning with architectural guidelines. Lead POCs for emerging AI innovations + Architectural Best Practices: Champion design standards and best practices for AI… more