Skip to content

Releases: Ipsedo/MARLClassification

v1.2 Actor Critic

21 Nov 21:12
Compare
Choose a tag to compare
  • use actor-critic as RL framework
  • fix reward that is always negative resulting in wrong convergence with log probabilities

Benchmarks will come soon, don't use trained models of this release

v1.1

17 Nov 16:57
Compare
Choose a tag to compare
  • benchmarks on MNIST & RESISC-45