Designing and building single- and multi-agent production systems with planning, memory, and tool-use capabilities that meet regulated-industry compliance and auditability standards. If you have a passion for building production-grade AI agents, MCP-based tool ecosystems, and data-rich AI workflows, then this is the perfect role for you! * Implementing testing, evaluation, monitoring, and observability best practices for production agent systems (LangSmith / LangFuse, structured tracing, offline and online evaluation). * Developing, integrating, and maintaining LLM-powered microservices and APIs (Python, FastAPI, gRPC, Postgres) as part of broader production applications deployed on AWS. * Identifying, testing, and adopting state-of-the-art advancements in LLMs and autonomous agent architectures (e.g. reflection, planning, multi-agent coordination, AWS Bedrock and Bedrock AgentCore).
mehr