Skip to content

Discriminator is unable to adapt to high loss. #2921

Discussion options

You must be logged in to vote

I think i might have resolved at least parts of my issue:
When training two model graphs in parallel one has to be quite particular when to detach a tensor from the model graph.

In my naive approach i detached the tensor while calculating the loss. If I understand it correctly this cause the tensor to be disconnected from the operations before and therefore the optimizer is not able to "reach" this operations in the next optimization step to change the corresponding weights.
Here is what i did it to fix the issue.

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
2 replies
@laggui
Comment options

@VirtualNonsense
Comment options

Comment options

You must be logged in to vote
1 reply
@laggui
Comment options

Answer selected by VirtualNonsense
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants