On the Performance of Multimodal Language Models - Scale Labs | Scale Labs