GLaMM <img src="images/logos/face.png" height="40">: Pixel Grounding Large Multimodal Model [CVPR 2024] — collection by mbzuai-oryx | Shared Context