Learning Goal-Conditioned Representations for Language Reward Models - Scale Labs | Scale Labs