Jun 20

Since the publication of the scaling laws, which demonstrated that model performance improves as a predictable power-law function of model size, dataset size, and total training compute, the dominant constraint on progress appeared to be compute itself (the sheer number and quality of GPUs that could be brought to bear on a single training run).

2 Comments

Mirko Monti

Jun 20

Sì Let’s invest in energy stocks

Reply (1)

Share

Computipedia

The real bottleneck