You don't need a GPU cluster to start training language models. I trained my first one locally on a laptop. It's slow, it's messy, but it works — and you learn more than any tutorial.