- NVIDIA (Santa Clara, CA)
- …with version control system, like Perforce + Familiarity with LSF or similar job scheduling systems for distributed computing. NVIDIA is home to some of the most ... innovative and dedicated engineers in the world. As our teams continue to grow, we are looking for creative and autonomous engineers who are passionate about technology and excellence. If you thrive in a fast-paced, collaborative environment, we want to hear… more
- Cisco (San Jose, CA)
- …(SDKs) and their application in networking. + Prior work with large-scale software systems and distributed computing. + Certifications such as PMP (Project ... Management Professional), CSM (Certified ScrumMaster), or Cisco Certifications (CCNA, CCNP, CCIE). + Experience leading projects that involve complex system integration and customization. + Experience in working with hardware teams, quality assurance, and… more
- Google (Irvine, CA)
- …evaluation processes, and in building, architecting, designing and implementing highly distributed global cloud-based systems , with an understanding of computing ... solutions. + Knowledge of technology solutions and ability to learn and work with new emerging technologies, methodologies, and solutions in the Cloud/IT technology space. + Ability to deliver results and work cross-functionally to position and orchestrate a… more
- Capital One (San Francisco, CA)
- …years of experience designing, implementing, and operating observability solutions for large-scale, distributed , and highly available systems . + 5+ years of ... experience with Open Source Observability tools such as OpenTelemetry, Prometheus, Grafana etc. Experience with emerging observability trends and technologies, such as eBPF, zero-code instrumentation, or AI/ML-driven anomaly detection is a plus. + 3+ years of… more
- Google (Sunnyvale, CA)
- …of computing solutions and building, architecting, designing and implementing highly distributed global cloud-based systems . + Knowledge of technology solutions. ... + Ability to learn and work with new emerging technologies, methodologies, and solutions in the Cloud/IT technology space. + Ability to deliver results and work cross-functionally to position and orchestrate a solution consisting of multiple products. +… more
- Oracle (Santa Clara, CA)
- …of experience in application development, including 3+ years working with large-scale distributed applications, web services, or systems design. + Experience ... with cloud computing platforms, technologies, and concepts. + Experience in programming languages such as C/C++, Java, and Python. Disclaimer: **Certain US customer or client-facing roles may be required to comply with applicable requirements, such as… more
- Meta (Menlo Park, CA)
- …experience in one or more of the following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high ... PyTorch and is on the critical path of multi-GPU distributed training. In other words, nearly every distributed... distributed training. In other words, nearly every distributed GPU-based ML workload in Meta Production goes through… more
- Meta (Menlo Park, CA)
- …of the following machine learning/deep learning domains: High speed networking (RDMA), Distributed ML Training, GPU architecture, ML systems , AI infrastructure, ... PyTorch and is on the critical path of multi-GPU distributed training. In other words, nearly every distributed... distributed training. In other words, nearly every distributed GPU-based ML workload in Meta Production goes through… more
- NVIDIA (Santa Clara, CA)
- …profiling and optimizing CUDA kernels. + Background with compression, storage systems , networking, and distributed computer architectures. Data Analytics is ... database operators or query planner, especially for parallel or distributed frameworks (eg production database or Spark). + Experience...search, join, aggregation, groupby, scaling up to multi GPU systems , and scaling out to many nodes. Take a… more
- NVIDIA (Santa Clara, CA)
- …Programming language. OO design preferred. + Experience with Make based build systems in large, distributed computing environments + Continuous Integration ... new compute farm technologies such as containers, volume cloning, distributed storage and distributed compute at scale + Deploy tracking metrics to resolve… more
Recent Jobs
-
Medical Laboratory Technician
- Avera (Platte, SD)
-
Advanced User Exp Designer
- Honeywell (Fort Mill, SC)
-
Non-Invasive Cardiologist, Newton
- Atlantic Health System (Hackettstown, NJ)
-
(USA) Principal, Software Engineer - AI Evangelist
- Walmart (Sunnyvale, CA)