Skip to content

Unable to finetune BiRefNet_HR-general-epoch_130.pth using public training code. #308

Description

@hardikjain8

Hello,

We are trying to fine tune the released HR model checkpoint (BiRefNet_HR-general-epoch_130.pth) using the public repository.

Repository : https;//github.com/ZhengPeng7/BiRefNet

Training command : bash train.sh modelHR 0,1

Our objective is to fine-tune the released HR checkpoint on a custom dataset

When using the BiRefNet_HR-general-epoch_130.pth That training process does not support normally and appears to stall during initialization/loading. We are unable to successfully start fine tuning from released HR checkpoint. Additionally during training we also encountered a issue which is originating from the lost computation. I have attached the screenshot of the issue for your reference.

This suggests that some values passed to the loss function may be outside the expected range although we are unsure whether this is related to

  • The HR checkpoint itself
  • The current training configuration
  • A mismatch between the released HR checkpoint and the public training code.

Any guidance would be greatly appreciated. We would be happy to provide additional logs or debugging information if required.

Thank you.

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions