@alignment_lab
Alignment Lab AI
3 months
how long before we acknowledge that releasing a model under a specific name, then replacing it with a worse one in a week without telling anyone is literally false advertising?
7
2
80

Replies

@BramVanroy
Bram
3 months
@alignment_lab Perhaps leaderboards should report the model hash of the model at the time of testing so that we have a fingerprint to identify with.
1
0
4
@alignment_lab
Alignment Lab AI
3 months
@BramVanroy any benchmark for an api model is inherently untrustable if the benchmarkers dont have access to the model
1
0
6
@vikhyatk
vik
3 months
@alignment_lab > get on top of leaderboard > raise $2.75B > swap out model and laugh all the way to the bank
2
0
19
@andersonbcdefg
Ben (e/sqlite)
3 months
@alignment_lab who did this
0
0
6
@tinycrops
ATH
3 months
@alignment_lab "did we break any laws?"
0
0
3
@nisten
nisten
3 months
@alignment_lab it technically is a form of fraud, ppl paid for something else?
0
0
1
@jobi1kan0b
jimbo
3 months
@alignment_lab Are you talking about Claude opus?
0
0
1