Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improvement performance of IQN #172

Open
erinn-lee opened this issue Apr 15, 2022 · 1 comment
Open

Improvement performance of IQN #172

erinn-lee opened this issue Apr 15, 2022 · 1 comment
Assignees
Labels
bug Something isn't working enhancement New feature or request

Comments

@erinn-lee
Copy link
Collaborator

Please describe the feature you want to add.
A clear and concise description of what the feature. Ex. I'm going to implement ...

Improvement performance of IQN

Additional requirement
A clear and concise description of additional requirement for the new feature

Reference
Please append the reference about the feature

@erinn-lee erinn-lee added bug Something isn't working enhancement New feature or request labels Apr 15, 2022
@erinn-lee
Copy link
Collaborator Author

erinn-lee commented May 31, 2022

Benchmarks of JORLDY agents
https://www.notion.so/Benchmark-09684f1adf764c84a5a331cb5690544f

Models with IQN networks have poor performance.
[ I ] Agents which series of IQN have lower performances than other Distributional RL agents.
[ II ] The n-step option tends to destabilize the performance of the Rainbow IQN

"Agents which series of IQN have lower performances than other Distributional RL agents"
스크린샷 2022-05-31 오후 2 57 06
스크린샷 2022-05-31 오후 3 01 04

IQN agent has lower or same performance comparing with C51 and QR-DQN. Please, refer the link on top.
M-IQN which applied 'Muchausen' RL technique also has lower or same performance too.
Their performances should be enhanced.

"The n-step option tends to destabilize the performance of the Rainbow IQN"
스크린샷 2022-05-31 오후 3 02 04

The performance of Rainbow IQN is unstable. Especially, it is vulnerable about Breakout task when the agent update by using n-step TD error. Their performances should be enhanced.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants