Senior Systems Software Engineer - NV Cloud Functions
NVIDIA Cloud Functions (NVCF) is an Open Source Platform that links workloads to GPUs. It lets teams deploy, manage, and serve GPU-accelerated, containerized applications across regions and clusters worldwide. The platform routes inference, streaming, and batch jobs across decentralized GPU clusters. This allows endpoints to scale repeatably, whether hosted on-prem or in the cloud. We are seeking a Senior Systems Software Engineer to join our team. You will focus on improving the performance, reliability, and scaling behavior of a system that routes AI workloads onto distributed GPU fleets. You will work on a polyglot platform that is now fully open source, with both control plane and edge deployments. The work suits someone with deep experience in systems performance, distributed systems, and Kubernetes-based runtimes. We are looking for engineers who want to learn and grow. Expect to be challenged, in an environment with rapidly shifting priorities, where insight, focus and execution is key. What you'll be doing: You'll be working in a distributed team that explores innovative ways to make GPU and DPU accelerated applications easier to develop, deploy, and monitor on the latest and greatest NVIDIA hardware. Design and ship services in Java, Go and Rust, building in the open on a public repository where your commits, design proposals, and reviews are transparent to the community. Work on automating and optimizing build, test, integration, and release processes for cloud native. Partner with engineering teams across NVIDIA so the platform integrates with adjacent NVIDIA technologies, including the KAI Scheduler, NVIDIA NIM, Grove and Dynamo. Help steward an open-source project. You will triage community issues and pull requests, write docs contributors can build on. What we need to see: Bachelor’s or Master’s Degree in Computer Science or equivalent program from an accredited University/College and 8+ years of hands-on software engineering. Expert level knowledge in a systems programming language (Go, C, Rust) and proven understanding of Data Structures, Algorithms and Distributed Software Architecture Strong understanding of Container Orchestration Systems (Kubernetes) and Container Technologies with hands-on automation experience in continuous integration frameworks like Gitlab & ArgoCD. Expertise in a scripting language (Bash, Python) and knowledge and experience working with System internals of Unix/Unix-like kernels such as Linux. Understanding of performance, security and reliability in complex distributed systems. Ways to stand out from the crowd: Background with pub-sub models and message queues Experience optimizing for high-throughput network paths, with a working understanding of unary versus streaming and bidirectional protocols across HTTP/2 and gRPC. Experience with developing Kubernetes Custom Resources and Operators deployed in Cloud Service Providers We have some of the most hard-working and skilled people in the world working for us and our world-class engineering teams are growing fast. If you're a creative and self-motivated engineer with a real passion for technology, we want to hear from you! Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com
Findigo hittar jobben och fyller i ansökan. Du klickar Skicka.
Visa jobbet och ansökUrsprunglig annons: hitmarker.net