RLBF. reinforcement learning from Bing feedback
1,15K