- NVIDIA (Santa Clara, CA)
- …Distributed Software design and development. + Understanding of performance, security and reliability in complex distributed systems. Ways to stand out from the ... crowd: + Strong experience with Go and Rust programming languages. + Real world strong hands on experience with Containers and Kubernetes. + Proficient with Kata Containers and Container sandboxing technologies + Knowledgeable with virtualization technologies,… more
- Google (Mountain View, CA)
- …the stack, from managing our cloud infrastructure to ensuring the security and reliability of our production systems. You will work closely with domain experts to ... refine and iterate on our technology stack, and we will rely on you to apply your knowledge and expertise to come up with and implement novel and impactful ideas. **How you will make 10X Impact:** + Act like an owner; be fearless in diving deep, asking… more
- NVIDIA (Santa Clara, CA)
- …to deploy AI models in production environments, ensuring performance, safety, and reliability standards are met. + Integrate machine learning models directly with ... vehicle firmware to deliver production-quality, safety-critical software. What We Want to See: + Hands-on experience building LLMs, VLMs, or VLAs from scratch or a proven track record as a top-tier coder passionate about autonomous systems. + BS/MS in Computer… more
- TEKsystems (San Francisco, CA)
- …for the well-being of our SaaS production system - quality, performance, scalability, reliability and efficiency * Invent and reinvent how we build, deploy and ... operate * Keep abreast of technologies and tools. Embrace and contribute to open source communities Skills Python, Go, Java, Architecture, Aws, Development Top Skills Details Python,Go,Java Additional Skills & Qualifications * Really know about scalable… more
- Newegg Inc. (Diamond Bar, CA)
- …architectural changes and design enhancements to the infrastructure to improve reliability , redundancy, and performance, reduces costs and anticipates Company growth ... and acquisitions. * Ensure that all network-related procedures and policies are documented including diagrams, disaster recovery and all network configurations. * Monitor system performance and provide security measures, troubleshooting and maintenance as… more
- City and County of San Francisco (San Francisco, CA)
- …Newly hired 7484 employees are required to become North American Electric Reliability Corporation (NERC) certified by the completion of their probationary periods in ... order to fulfill the federal mandate established by the WECC Compliance Office. Physical Demands: strength and mobility to work in a typical plant operations setting, including operating hand and power tools; driving to various work sites; stamina to perform… more
- NVIDIA (Santa Clara, CA)
- …pragmatic tooling and processes to support deployment, monitoring, and operational reliability + Collaborate with engineers and cross-functional partners to ensure ... our integrated systems are supportable, debuggable, and production-ready + Document and present complex systems and tradeoffs clearly to both technical and non-technical audiences + Mentor and support teammates in developing scalable thinking and strong… more
- Walmart (Sunnyvale, CA)
- …+ Unit-testing code for robustness, including edge cases, usability, and general reliability + Collaborate with cross-functional teams to define, design, and ship ... new features + Experience with bug fixing and improving application performance + Work with outside data sources and APIs + Experience with the agile methodology Scrum + Continuously discover, evaluate, and implement new technologies to maximize development… more
- NVIDIA (Santa Clara, CA)
- …at NVIDIA, you will be a key technical leader driving quality and reliability across our next-generation Data Center Systems. Your work will directly impact NVIDIA's ... ability to deliver robust, secure, and high-performing solutions for AI, HPC, and cloud-scale systems. + Define End-to-End Security Test Strategy: Own and drive the pre-QA system-level test architecture and validation strategy for security across multiple… more
- NVIDIA (Santa Clara, CA)
- …of AI chips and memory devices is a plus. + Knowledge of quality and reliability concepts as well as manufacturing operations is a plus. + Good command of English ... in both written and spoken. And good communication and documentation skills. + High sense of responsibility, self-motivated, good team player. With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the… more