Unable to finetune BiRefNet_HR-general-epoch_130.pth using public training code.

Hello, 

We are trying to fine tune the released HR model checkpoint (BiRefNet_HR-general-epoch_130.pth) using the public repository. 

Repository : https;//github.com/ZhengPeng7/BiRefNet

Training command : bash train.sh modelHR 0,1

**Our objective is to fine-tune the released HR checkpoint on a custom dataset**

When using the BiRefNet_HR-general-epoch_130.pth That training process does not support normally and appears to stall during initialization/loading. We are unable to successfully start fine tuning from released HR checkpoint. Additionally during training we also encountered a issue which is originating from the lost computation. I have attached the screenshot of the issue for your reference. 

This suggests that some values passed to the loss function may be outside the expected range although we are unsure whether this is related to

- The HR checkpoint itself
- The current training configuration 
- A mismatch between the released HR checkpoint and the public training code. 

Any guidance would be greatly appreciated. We would be happy to provide additional logs or debugging information if required. 

Thank you. 

<img width="1403" height="655" alt="Image" src="https://github.com/user-attachments/assets/675a600c-d964-420d-baa5-3a8a52632fa9" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to finetune BiRefNet_HR-general-epoch_130.pth using public training code. #308

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Unable to finetune BiRefNet_HR-general-epoch_130.pth using public training code. #308

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions