Convert kernel_mask into a constant tensor by Larst0 · Pull Request #74 · MathiasGruber/PConv-Keras

Larst0 · 2021-08-17T13:58:21Z

When I use the given implementation for training, I always get NaN values at the output. Sometimes this happens after a few training steps and sometimes after a few epochs (depending on the training data used).

While debugging, I noticed that the kernel_mask was updated. I think this is because K.ones(shape=...) returns a trainable variable if all entries in the passed shape are >0. In the original PyTorch implementation the kernel_mask is initialized using weight_maskUpdater = torch.ones(...), which by default creates a non-trainable tensor (since requires_grad=False).

After replacing K.ones(...) with K.constant(...) the NaN values no longer occur.

Convert kernel_mask into a constant tensor

8dcd73d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert kernel_mask into a constant tensor#74

Convert kernel_mask into a constant tensor#74
Larst0 wants to merge 1 commit intoMathiasGruber:masterfrom
Larst0:patch-1

Larst0 commented Aug 17, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Larst0 commented Aug 17, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant