Hello, i found the grad matrix(grad = weight.grad) has many non-zero elements while their corresponding values in weight matrix are zero, I want to ask why this happen(as a beginner of pytorch)? Thanks!
partial gradient matrix:

partial weight matrix:
