Our models power the tools used by millions of creators, developers, and businesses worldwide, and FLUX is among the most advanced generative systems in the world. You'll shape training objectives, architectures, data strategies, and systems behind our joint image, video, and audio foundation models, with a direct line from your research to products used by millions. * Lead large-scale pretraining experiments for our multimodal (image, video, audio) foundation models (architecture, objective functions, scaling strategies) * Contribute across the full stack: low-level GPU and systems optimizations, research code, and high-level model design * You've led or co-owned pretraining for a foundation model (image, video, LLM, or multimodal) that shipped to production or a major release * Strong intuition for architecture and objective design — you've made calls on attention patterns, modulation schemes, or loss ...
mehr