If you're excited to build systems, kernels, and tools that make large-scale AI faster, more efficient, and easier to deploy, we'd love to hear from you. * Develop, optimize, and benchmark GPU kernels (hand-tuned and compiler-generated) using techniques such as fusion, autotuning, and memory/layout optimization; build and extend high-level DSLs and compiler infrastructure to boost kernel developer productivity while approaching peak hardware utilization.
mehr