Improvement performance of IQN #172

erinn-lee · 2022-04-15T09:40:03Z

Please describe the feature you want to add.
A clear and concise description of what the feature. Ex. I'm going to implement ...

Improvement performance of IQN

Additional requirement
A clear and concise description of additional requirement for the new feature

Reference
Please append the reference about the feature

erinn-lee · 2022-05-31T04:16:47Z

Benchmarks of JORLDY agents
https://www.notion.so/Benchmark-09684f1adf764c84a5a331cb5690544f

Models with IQN networks have poor performance.
[ I ] Agents which series of IQN have lower performances than other Distributional RL agents.
[ II ] The n-step option tends to destabilize the performance of the Rainbow IQN

"Agents which series of IQN have lower performances than other Distributional RL agents"

IQN agent has lower or same performance comparing with C51 and QR-DQN. Please, refer the link on top.
M-IQN which applied 'Muchausen' RL technique also has lower or same performance too.
Their performances should be enhanced.

"The n-step option tends to destabilize the performance of the Rainbow IQN"

The performance of Rainbow IQN is unstable. Especially, it is vulnerable about Breakout task when the agent update by using n-step TD error. Their performances should be enhanced.

erinn-lee added bug Something isn't working enhancement New feature or request labels Apr 15, 2022

erinn-lee assigned atech-rl-kakaoenterprise Apr 15, 2022

erinn-lee added a commit that referenced this issue Apr 15, 2022

add clip at network output for stable learning(#172)

4982979

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvement performance of IQN #172

Improvement performance of IQN #172

erinn-lee commented Apr 15, 2022

erinn-lee commented May 31, 2022 •

edited

Loading

Improvement performance of IQN #172

Improvement performance of IQN #172

Comments

erinn-lee commented Apr 15, 2022

erinn-lee commented May 31, 2022 • edited Loading

erinn-lee commented May 31, 2022 •

edited

Loading