Abstract
We consider the variational problem of cross-entropy loss with n feature vectors on a unit hypersphere in R d . We prove that when d ≥ n − 1 , the global minimum is given by the simplex equiangular tight frame, which justifies the neural collapse behavior. We also prove that, as n → ∞ with fixed d , the minimizing points will distribute uniformly on the hypersphere and show a connection with the frame potential of Benedetto & Fickus.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.