Document Details


2312.14135.pdf
Download View Text Delete
Clip: V ∗ : Guided Visual Search as a Core Mechanism in Multimodal LLMs Penghao Wu † UC San Diego pew011@ucsd.edu Saining Xie New York University saining.xie@nyu.edu[targets located] V*: LLM‐guided  Search Visual Working Memory (VWM) VQA
Filename: 2312.14135.pdf
Filetype: application/pdf
Size: 5324146 bytes
Uploaded On: 2024-01-22
Abstract:
Summary:
Tags:
Notes:
Visible: 1
Status: Parsed
Author:
CreationDate: 2023-12-27T02:18:24+00:00
Creator: LaTeX with hyperref
Keywords:
ModDate: 2023-12-27T02:18:24+00:00
PTEX.Fullbanner: This is pdfTeX, Version 3.141592653-2.6-1.40.25 (TeX Live 2023) kpathsea version 6.3.5
Producer: pdfTeX-1.40.25
Subject:
Title:
Trapped: False
Pages: 18

Return to Document Library