On the Difficulty of Extrapolation with NN Scaling

by ericjangon 1/25/2022, 5:56 PMwith 2 comments

by nerdponxon 1/26/2022, 8:47 PM

It seems like fancy hyperparameter optimization techniques (e.g. Bayesian black-box optimization) probably don't help here either, because they don't solve the problem of extrapolating outside the range of hyperparameter values have have already been tried. Is that a valid conclusion?