GitHub - agi-templar/Stable-Alignment: Multi-agent Social Simulation + Efficient, Effective, and...
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society". - ...