Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
  • Loading branch information
trungtrinh44 committed May 4, 2024
1 parent 44d42f9 commit c07d1e1
Show file tree
Hide file tree
Showing 4 changed files with 13 additions and 17 deletions.
3 changes: 3 additions & 0 deletions _layouts/default.html
Original file line number Diff line number Diff line change
Expand Up @@ -34,6 +34,9 @@
.my_orange {
color: #ED7D31;
}
.my_deepred{
color: rgb(196, 78, 82)
}
blockquote p strong {
color: #ED7D31;
}
Expand Down
1 change: 1 addition & 0 deletions assets/input_gradient_kernel.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions assets/rde_wgd.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
25 changes: 8 additions & 17 deletions index.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,22 +23,7 @@ Please cite our work if you find it useful:

> **Description:** Train an ensemble \\(\\{\boldsymbol{\theta}\_i\\}_{i=1}^M\\) using Wasserstein gradient descent [2], which employs a <span class="my_blue">kernelized repulsion term</span> to diversify the particles to cover the <span class="my_red"> Bayes posterior \\(p(\boldsymbol{\theta} \| \mathcal{D}) \\)</span>.
\begin{equation}
\boldsymbol{\theta}\_i^{(t+1)} = \boldsymbol{\theta}\_i^{(t)} + \eta\_t\bigg(
{\color{red}
\underbrace{
\nabla\_{\boldsymbol{\theta}\_i^{(t)}} \log p(\boldsymbol{\theta}\_i^{(t)} \| \mathcal{D})
}\_{\text{Driving force}}}
-
{\color[RGB]{68,114,196}
\underbrace{\frac{
\sum\_{j=1}^N \nabla\_{\boldsymbol{\theta}\_i^{(t)}} k(\boldsymbol{\theta}\_i^{(t)}, \boldsymbol{\theta}\_j^{(t)})
}{
\sum\_{j=1}^N k(\boldsymbol{\theta}\_i^{(t)}, \boldsymbol{\theta}\_j^{(t)})
}}\_{\text{Repulsion force}}
}
\bigg)
\end{equation}
<img src="./assets/rde_wgd.svg" alt="drawing" width="100%" max-width="1000px">

- The <span class="my_red">driving force</span> directs the particles towards high density regions of the posterior
- The <span class="my_blue">repulsion force</span> pushes the particles away from each other to enforce diversity.
Expand All @@ -54,4 +39,10 @@ Please cite our work if you find it useful:
<strong class="my_orange">Possible advantages:</strong>
- Each member is guaranteed to represent a different function;
- The issues of weight- and function-space repulsion are avoided;
- Each member is encouraged to learn different features, which can improve robustness.
- Each member is encouraged to learn different features, which can improve robustness.

# Defining the input-gradient kernel <span class="my_blue">\\(k\\)</span>

Given a base kernel <span class="my_deepred">\\(\kappa\\)</span>, we define the kernel in the input-gradient space for a minibatch of training samples \\(\mathcal{B}=\\{(\mathbf{x}\_b, y\_b\\}\_{b=1}^B\\) as follows:

<img src="./assets/input_gradient_kernel.svg" alt="drawing" width="100%" max-width="1000px">

0 comments on commit c07d1e1

Please sign in to comment.