By verbosity, I mean why are there so many traces, most of them with some IDs appended, the content feels hard to read.
So you mean the verbosity isn't expected?
Can you check if your termination conditions are working as expected?
Yeah, it does seem working as expected. Here is a diff run of the same script, and the last assistant does say APPROVE at the end, e.g.