Atzberger, DanielDanielAtzbergerCech, TimTimCechScheibel, WillyWillyScheibelLimberger, DanielDanielLimbergerDöllner, JürgenJürgenDöllnerTrapp, MatthiasMatthiasTrapp2024-10-142024-10-142022https://doi.org/10.1145/3554944.3554961https://knowledge.hpi.de/handle/123456789/4477Based on the assumption that semantic relatedness between documents is reflected in the distribution of the vocabulary, topic models are a widely used technique for different analysis tasks. Their application results in concepts, the so-called topics, and a high-dimensional description of the documents. For visualization tasks, they can further be projected onto a lower-dimensional space using a dimension reduction technique. Though the quality of the resulting scatter plot mainly depends on the chosen layout technique and the choice of its hyperparameters, it is unclear which particular combinations of topic models and dimension reduction techniques are suitable for displaying the semantic relatedness between the documents. In this work, we propose a benchmark comprising various datasets, layout techniques, and quality metrics for conducting an empirical study on different such layout algorithms.A Benchmark for the Use of Topic Models for Text Visualization Tasksinproceedings