- Amazon (Mountain View, CA)
- …maximize your customer's business value of AWS is critical to a solutions architect . You should also have a demonstrated ability to think long-term about business, ... bottom line perforamance. At your disposal will be the full breadth of AWS cloud services including Compute, Networking, GenAI, ML, Edge, Analytics and Storage … more
- Palo Alto Networks (Santa Clara, CA)
- …robust data collection pipelines using Java multithreading and Apache Beam frameworks + Architect and implement scalable ETL and ELT processes to handle data ranging ... to petabytes in size + Design and optimize data storage and indexing solutions for both real-time streaming and...+ Familiar with at least one of the major cloud platforms (eg., GCP, AWS or Azure) **These will… more
- TE Connectivity (CA)
- …designing with Hyperscale and ODM customers is strongly preferred. + Knowledge of Cloud architecture. (server, switch, storage , AI, related ICs and protocols) + ... find appropriate solutions in conjunction with your assisting FAE and Sales Architect teams. + Market Insights: Stay up-to-date with industry trends, product… more
- Cisco (San Jose, CA)
- …the performance and reliability of bare metal, but with the flexibility of cloud -native systems. Your contributions will empower internal and external users to run ... and frameworks with **multi-tenant isolation** and **QoS guarantees** . + Architect systems for **secure GPU sharing** , including time-slicing, memory partitioning,… more
- NVIDIA (Santa Clara, CA)
- …in machine learning domain + Familiarity with CI/CD pipeline development, cloud -based testing (AWS/GCP), data storage and database solutions, implementing ... like TensorRT, JAX, PyTorch, NeMo, TensorFlow, or others. + Able to architect , design, implement and debug complex solutions using Python programming language… more
- NVIDIA (Santa Clara, CA)
- …variety of LLM frameworks (eg, TensorRT-LLM, vLLM, SGLang). + Disaggregated Serving: Architect and optimize the separation of prefill (context ingestion) and decode ... management and transfer of large KV caches across heterogeneous memory and storage hierarchies, using the NVIDIA Optimized Transfer Library (NIXL) for low-latency,… more