Influence the design of next-generation hardware architectures, software, and programming models in collaboration with research, hardware, system software, libraries, and tools teams at NVIDIA. Do you find it rewarding to identify and eliminate system bottlenecks to achieve the best possible performance on pioneering computer hardware? * Programming fluency in C/C++ with a deep understanding of algorithms and software development. * A background that includes parallel programming, e.g., CUDA, OpenACC, OpenMP, MPI, pthreads, etc. * Expertise in parallelization and performance optimization of Deep Learning models arising from Natural Language Processing, Computer Vision, Recommender Systems, etc.
mehr