Property of gradient of cross-entropy loss with kernel density estimation
Fragment of a discussion from Talk:BeepBoop/Understanding BeepBoop
← Thread:Talk:BeepBoop/Understanding BeepBoop/Property of gradient of cross-entropy loss with kernel density estimation/reply
Jump to navigation
Jump to search
← Thread:Talk:BeepBoop/Understanding BeepBoop/Property of gradient of cross-entropy loss with kernel density estimation/reply
You do not have permission to edit this page, for the following reasons:
You can view and copy the source of this page.
The thoughts on surfing & targeting is quite inspiring. And even if no data points are near within K size (hard case), that case is still valuable, since there may exist some data point just outside of the K size. And repeating the training process with new weight iteratively may eventually turn that case into an easy case ;) Are you doing something similar as well?