[D] DeepDanbooru v3 PyTorch Port: Constant 0.5 or 0 output after loading weights
I’m porting DeepDanbooru v3 (Janouch port) to PyTorch. After mapping 209 layers from Safetensors, the model outputs exactly 0.5 for all tags. I’ve tracked it back to the Batch Normalization layers. It seems like the ‘running_var’ values are causing a collapse. Is this a known issue when converting Keras/TensorFlow weights to PyTorch for ResNet architectures? Should I manually initialize the BN stats?
submitted by /u/RevolutionaryAge70
[link] [comments]
Like
0
Liked
Liked