Multimodal Large Language Model (MLLM)
VLMs fit under this. This stuff is really important for robotics, since we are dealing with multi-modal data.
They have a huge reading list:
Some papers:
VLMs fit under this. This stuff is really important for robotics, since we are dealing with multi-modal data.
They have a huge reading list:
Some papers: