Proficiency in designing and training vision-language models (VLMs) and language-multimodal models (LMMs) (e.g., Flamingo, GPT-4V, PaLM-E, RT-2) + Capability in designing agents for long-horizon planning, semantic task decomposition, and hierarchical control - Whether in the areas of mobility solutions, consumer goods, industrial technology or energy and building technology - with us, you will have the chance to improve quality of life all across the globe. * As a Research Engineer, you will develop cutting-edge Vision-Language-Action (VLA) architectures that empower AI agents to interpret human instructions and act autonomously in complex environments. * You will make VLA methods usable for concrete Bosch applications in practice and demonstrate their superior flexibility and generalization capabilities. o excellent MSc in Computer Science, Machine Learning, Robotics or related technical fields
mehr