RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation Official code for the paper "RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation". Authors: Kaiqu Liang, Haimin Hu, Ryan Liu, Tom Griffiths, Jaime Fernández Fisac. Code will be coming soon!