Principal Software Engineer - AI Inference

Nvidia · JR2013753
NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly on NVIDIA GPUs and systems. You will also strengthen the underlying stack for high-throughput, low-latency inference at scale. This is a hands-on, deeply technical role for someone who excels at the intersection of inference ru…
Apply on original site
← Browse all jobs on Jobich.ch