Announcing our Open Source Dataset initiative!
✅ Join our Discord!
📝 Propose datasets that you’d like us to create!
🗣️ Discuss and vote on dataset ideas with the community
🛠️ Each week, we’ll create and publish the top 5 voted datasets
The datasets will be created using the synthetic data generation pipeline powering the Glaive platform, which has already powered models like glaive-coder-7B and OpenHermes-2.5