Document Details
Clip:
V ∗ : Guided Visual Search as a Core Mechanism in Multimodal LLMs Penghao Wu † UC San Diego pew011@ucsd.edu Saining Xie New York University saining.xie@nyu.edu[targets located] V*: LLM‐guided Search Visual Working Memory (VWM) VQA
Filename:
2312.14135.pdf
Filetype:
application/pdf
Size:
5324146 bytes
Uploaded On:
2024-01-22
Abstract:
Summary:
Tags:
Notes:
Visible:
1
Status:
Parsed
Author:
CreationDate:
2023-12-27T02:18:24+00:00
Creator:
LaTeX with hyperref
Keywords:
ModDate:
2023-12-27T02:18:24+00:00
PTEX.Fullbanner:
This is pdfTeX, Version 3.141592653-2.6-1.40.25 (TeX Live 2023) kpathsea version 6.3.5
Producer:
pdfTeX-1.40.25
Subject:
Title:
Trapped:
False
Pages:
18
Return to Document Library