Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of OSS software, including but not limited to projects like GGML, Llama.cpp, Ollama, ONNX Runtime. With our invention of the GPU - the engine of modern visual computing - the field has expanded to PC games, movie production, product design, medical diagnosis, research and AI. * 5+years of professional experience in local GPU deployment, profiling and optimization. * Strong proficiency in C/C++, Python, software design, programming techniques. * Familiarity with and development experience on the Windows operating system. * Experience working with open-source LLM and GenAI software. * Experience with CUDA and NVIDIA's Nsight GPU profiling and debugging suite. * Experience with GPU-accelerated AI inference driven by NVIDIA APIs, specifically cuDNN, CUTLASS, TensorRT.
mehr