Critique-out-Loud Reward Models Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud ankner/Llama3-8B-CLoud-RM Updated 20 days ago • 59 ankner/Llama3-8B-Classic-RM Updated 20 days ago • 2 ankner/Llama3-70B-CLoud-RM Updated 20 days ago • 4 ankner/Llama3-70B-Classic-RM Updated 20 days ago • 3