With our invention of the GPU - the engine of modern visual computing - the field has expanded to PC games, movie production, product design, medical diagnosis, research and AI. * Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of OSS software, including but not limited to projects like GGML, Llama.cpp, Ollama, ONNX Runtime. * 5+years of professional experience in local GPU deployment, profiling and optimization. * Strong proficiency in C/C++, Python, software design, programming techniques. * Familiarity with and development experience on the Windows operating system. * Experience working with open-source LLM and GenAI software. * Experience with CUDA and NVIDIA's Nsight GPU profiling and debugging suite. * Experience with GPU-accelerated AI inference driven by NVIDIA APIs, specifically cuDNN, CUTLASS, TensorRT.
mehr