include comment on Saxe and sqrt(2) scaling factor

author Evan Shelhamer <shelhamer@imaginarynumber.net>

Tue, 26 May 2015 19:39:14 +0000 (12:39 -0700)

committer Evan Shelhamer <shelhamer@imaginarynumber.net>

Tue, 26 May 2015 20:12:25 +0000 (13:12 -0700)
author Evan Shelhamer <shelhamer@imaginarynumber.net>
Tue, 26 May 2015 19:39:14 +0000 (12:39 -0700)
committer Evan Shelhamer <shelhamer@imaginarynumber.net>
Tue, 26 May 2015 20:12:25 +0000 (13:12 -0700)
diff --git a/include/caffe/filler.hpp b/include/caffe/filler.hpp

index 0125b30..446f5b5 100644 (file)
--- a/include/caffe/filler.hpp
+++ b/include/caffe/filler.hpp
@@ -175,6 +175,9 @@ class XavierFiller : public Filler<Dtype> {
   * A Filler based on the paper [He, Zhang, Ren and Sun 2015]: Specifically
   * accounts for ReLU nonlinearities.
   *
+ * Aside: for another perspective on the scaling factor, see the derivation of
+ * [Saxe, McClelland, and Ganguli 2013 (v3)].
+ *
   * It fills the incoming matrix by randomly sampling Gaussian data with std =
   * sqrt(2 / n) where n is the fan_in, fan_out, or their average, depending on
   * the variance_norm option. You should make sure the input blob has shape (num,
author	Evan Shelhamer <shelhamer@imaginarynumber.net>
	Tue, 26 May 2015 19:39:14 +0000 (12:39 -0700)
committer	Evan Shelhamer <shelhamer@imaginarynumber.net>
	Tue, 26 May 2015 20:12:25 +0000 (13:12 -0700)