The mask that is used is shown in the output video. Could someone help me solve it? Is it because of the way input is sent to the model? I would appreciate any help.