Skip to content

Clarification on GPU count used in reward model training #21

@LAOS-Y

Description

@LAOS-Y

Are all the reward models trained with 32 GPUs in total, including models with single machine config, e.g., editscore_7B, editscore_qwen3_vl_4B_instruct, and editscore_qwen3_vl_8B_instruct?

Is all multi-machine reward training done with 2 nodes with 16 GPUs each?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions