Strong proficiency in C/C++, Python, software design, programming techniques. * Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of OSS software, including but not limited to projects like GGML, Llama.cpp, Ollama, ONNX Runtime. * Work closely with internal engineering teams and external on solving local end-to-end LLM & Generative AI GPU deployment challenges, using techniques like quantization or distillation. * BS or MS degree in Computer Science, Engineering, or related degree. * Experience working with open-source LLM and GenAI software.
mehr