BERT-Large (345M parameters) is now faster than the much smaller DistilBERT (66M parameters) with accuracy of the larger BERT-Large! It delivers 8x latency speedup on commodity CPUs 🚀 🙏 @ZafrirOfir, @guybd35, @markurtz_ & teams for fantastic collab! Tweet added by Neural Magic @neuralmagic

Neural Magic

2 years

BERT-Large (345M parameters) is now faster than the much smaller DistilBERT (66M parameters) with accuracy of the larger BERT-Large! It delivers 8x latency speedup on commodity CPUs 🚀 🙏 @ZafrirOfir , @guybd35 , @markurtz_ & teams for fantastic collab!