This paper suggests that you don't need the debating part: just get LLM to work ...

xcv123 · on April 7, 2024

The paper says that it enhances existing methods such as prompt engineering (chain of thought) and LLM debate. This agent method is orthogonal to LLM debate.

infogulch · on April 7, 2024

Interesting. Somehow it seems odd to add randomness (temperature) and then wash it away by averaging it out.

naasking · on April 7, 2024

In optimization problems, randomness can often get you out of local minima/maxima, and so averaging out a bunch of random search paths might get you better results in the worst case. Something similar might be happening here. The training set will be biased in various ways that might create weird local min/max points and so this process could avoid those weird kinks.

ewild · on April 7, 2024

temp applies to each token so the range of temperature is significantly larger than the average being pulled