Abstract: In this paper, we present GASPARD (Graphical Array Specification for Parallel and Distributed computing), our visual programming environment devoted to the development of parallel ...
Abstract: Knowledge-based Visual Question Answering (VQA) is a challenging task that requires models to access external knowledge for reasoning. Large Language Models (LLMs) have recently been ...
In vision-language models (VLMs), visual tokens usually consume a significant amount of computational overhead, despite their sparser information density compared to text tokens. To address this, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results