TensorFlow Implementation of Stein Variational Gradient Descent (SVGD)

References

def network():
    '''
    Define target density and return gradients and variables. 
    '''
    return gradients, variables

def make_gradient_optimizer():
    return tf.train.GradientDescentOptimizer(learning_rate=0.01)

Build multiple networks (particles) using network() and take all those gradients and variables in grads_list and vars_list.
Make SVGD optimizer, e.g.,

optimizer = SVGD(grads_list, vars_list, make_gradient_optimizer)

sess = tf.Session()
sess.run(optimizer.update_op, feed_dict={X: x, Y: y})

The goal of this problem is to match the target density p(x) (mixture of two Gaussians) by moving the particles initially sampled from other distributions q(x). For details, I recommend you to see the experiment section in the authors' paper.
I got the following result:
NOTE THAT I compared my implementation with that of authors and checked the results are the same.

In this example, we want to classify binary data by using multiple neural classifier. I checked how SVGD differs from ensemble method in this example. I made a pdf file for detailed mathematical derivations.
I got the following results:
- Thus, ensemble methods make particles to strongly classify samples, where as SVGD leads to draw the particles that characterize the posterior distribution.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
derivations		derivations
references		references
results		results
tests		tests
1_gaussian_mixture.py		1_gaussian_mixture.py
2_bayesian_classification.py		2_bayesian_classification.py
README.md		README.md
optimizer.py		optimizer.py
utils.py		utils.py