All the big labs pretrain on the ~same internet, but each has a secret sauce:
- OpenAI: reddit
- xAI: twitter
- Google: youtube
- Meta: instagram & facebook In the end, models mirror the cultures of the social media they grew up on.
All the big labs pretrain on the ~same internet, but each has a secret sauce: