Networking Ecosystem Management: Ability to interact effectively with leading networking vendors vendors (e.g. Nvidia/Mellanox, HPE/Juniper, Cisco, Arista) * HPC Software: Understand the overall HPC and Ia environemts, and the possible impact of MPI, job schedulers (e.g. Slurm), containerization, AI frameworks (TensorFlow, PyTorch) and other systems management tools (such as Open Nebula or similar ones) when designing the networking infrastructure, * Storage: Understand the impact of Parallel file systems (Lustre, GPFS), NVMe, tiered storage, and data lifecycle management when designing the networking infrastructure. * Cloud & Hybrid HPC: Familiarity with the Integration of on-prem HPC networking with cloud-based HPC/AI systems.
mehr