Official code for the paper "RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation".
Authors: Kaiqu Liang, Haimin Hu, Ryan Liu, Tom Griffiths, Jaime Fernández Fisac.
Code will be coming soon!
Official code for the paper "RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation".
Authors: Kaiqu Liang, Haimin Hu, Ryan Liu, Tom Griffiths, Jaime Fernández Fisac.
Code will be coming soon!