Design and optimize GPU algorithms and APIs, from high-level interfaces down to low-level performance tuning involving memory, parallelism, and synchronization. * Collaborate with experienced ; participate in design reviews, code reviews, and open-source-style workflows. * Experience with software libraries or open-source projects, including testing, performance profiling, and code reviews. * Comfort navigating and debugging large, multi-language codebases (C++, Python, CMake, GitHub Actions CI systems) with demonstrated interest in tools, library design, and making other faster and more productive.
mehr