Our models power the tools used by millions of creators, developers, and businesses worldwide, and FLUX is among the most advanced generative systems in the world. We're building the foundation models that power the next wave of visual intelligence — and pretraining is where that work begins. You'll shape training objectives, architectures, data strategies, and systems behind our joint image, video, and audio foundation models, with a direct line from your research to products used by millions. * Lead large-scale pretraining experiments for our multimodal (image, video, audio) foundation models (architecture, objective functions, scaling strategies) * Contribute across the full stack: low-level GPU and systems optimizations, research code, and high-level model design * You've led or co-owned pretraining for a foundation model (image, video, LLM, or multimodal) that shipped to production or a major release
mehr