Follow

RT @schrep: 2 important details:
a) Shows that smaller models trained with more data can outperform larger models (e.g. 13B outperforms GPT-3 175B)
2) The "larger" 65B model is competitive with best models - and is freely available to the research community!
research.facebook.com/publicat

🐦🔗: n.respublicae.eu/lugaricano/st

· · mirror-bot · 0 · 0 · 0
Sign in to participate in the conversation
Mastodon

A Mastodon forum for the discussion of European Union matters. Not run by the EU. Powered by PleromaBot, Nitter and PrivacyDev.net.