Paper reading in 1-hour! π SallyAnn D. and Jason will discuss new research from DeepMind on grokking.
The paper authors demonstrate two new and surprising behaviors: ungrokking, in which a network regresses from perfect to low test accuracy, and semi-grokking, in which a network shows delayed generalization to partial rather than perfect test accuracy.
Hereβs the paper: https://arxiv.org/abs/2309.02390
Join us! https://arize.com/resource/community-papers-reading/