BharathStaff AI Engineer · Production Multi-Agent PlatformsLinkedIn

About

Production multi-agent systems, and how agents should coordinate

Agent engineer in Bangalore. I ship multi-agent products in production and study multi-agent RL part-time at the University of Groningen.

Career arcARCIBuildproducts at speedIIOperatemulti-tenant scaleIIIResearchmulti-agent RLIVGuideteams past demo

I design and ship multi-agent systems in production: collaborative workflows, tool-using agents, fine-tuning, retrieval when it matters, and evals so teams can iterate without guessing.

I co-founded a voice-AI startup around multi-agent customer automation and hybrid data routing for agents. Before that I built a connected-vehicle data platform (10,000 vehicles in three months, later acquired). Earlier work spans industrial 6D pose estimation and LLM fine-tuning with QLoRA.

I am pursuing a PhD in multi-agent reinforcement learning at the University of Groningen, focused on strategic world models. I proposed SeqPPO, which showed roughly 3× better sampling efficiency than MAPPO, HATRPO, and HAPPO in our benchmarks.

Principles

  • Use multiple agents when a single prompt chain stops scaling.
  • If you cannot trace a multi-step run, you should not ship it.
  • Pick the right layer: orchestration, fine-tuning, retrieval, or tools.
  • When something breaks in prod, leave a runbook the team can reuse.
  • Research on coordination informs how I design agent handoffs.

Education

  • PhD, Multi-agent reinforcement learning, University of Groningen (2026–2030)
  • MSc, Artificial Intelligence, University of Groningen (2020–2022). GPA 7.4/10
  • B.E., Information Science and Engineering, Global Academy of Technology (2015–2019)

Recognition

  • Winner, Maastricht WiDS Datathon, "Data Science Pioneers" (2022)
  • Finalist, YALE-CBIT Hackathon (2022)
  • Team lead, Ai4Good × European Space Agency (2021)
  • Audience choice, AiMED:AiHack Covid (2021)
  • University research collaboration; grant proposal secured $100K funding (2022)