Compute discriminator output with policy and expert states #22

GiulioRomualdi · 2025-07-02T14:17:04Z

Concatenate policy and expert observations for the discriminator input to enhance its performance. This change introduces a new loss function for the discriminator.

…tate

GiulioRomualdi · 2025-07-02T19:22:14Z

Tested in a training that it is working as expected

Compute the discriminator output considering both policy and expert s…

49dc9d9

…tate

GiulioRomualdi self-assigned this Jul 2, 2025

GiulioRomualdi requested a review from Giulero July 2, 2025 14:17

GiulioRomualdi added 2 commits July 2, 2025 16:26

Update amp_ppo.py

4031122

Update amp_ppo.py

d9ff977

Giulero approved these changes Jul 2, 2025

View reviewed changes

GiulioRomualdi merged commit 4086172 into main Jul 3, 2025

GiulioRomualdi deleted the discriminator_at_once branch July 3, 2025 14:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compute discriminator output with policy and expert states #22

Compute discriminator output with policy and expert states #22

Uh oh!

GiulioRomualdi commented Jul 2, 2025

Uh oh!

GiulioRomualdi commented Jul 2, 2025

Uh oh!

Uh oh!

Compute discriminator output with policy and expert states #22

Compute discriminator output with policy and expert states #22

Uh oh!

Conversation

GiulioRomualdi commented Jul 2, 2025

Uh oh!

GiulioRomualdi commented Jul 2, 2025

Uh oh!

Uh oh!