> And around ~2012, a bunch of researchers have reported you don't even need 2nd-derivative information. You just have to initialize the neural net properly.
This sounds very interesting. How do you property initialize the weights? Do you have a link to a paper about this?
This sounds very interesting. How do you property initialize the weights? Do you have a link to a paper about this?