Sessions

Applications

  • Generating Search-Engine-Optimized Headlines for Sports News
    Frank Zalkow, Benedikt Schäfer, Thomas Moissl, Jonas Bücherl, Kerstin Markl, Sebastian Bothe, Francois Duchateau, Julia Dollase, Patric Kabus, Daniel Steinigen, Oliver Schmitt, Fabian Küch
  • Anon2025: A German Historical Newspaper Dataset for Named Entity Recognition and Entity Linking
    Sophie Schneider, Ulrike Förstel, Kai Labusch, Jörg Lehmann, Clemens Neudecker
  • Adaption and Evaluation of Generative Large Language Models for German Medical Information Extraction
    Sören Spiegel, Seid Muhie Yimam, Philipp Breitfeld, Frank Ückert

Discourse and Semantics

  • Function Words as Stable Features for German Opinion Articles Classification
    Amelie Schmidt-Colberg, Simon Burkard, Anne Grohnert, Michael John
  • Efficient and Effective Coreference Resolution for German
    Fynn Petersen-Frey, Hans Ole Hatzel, Chris Biemann
  • LLM-based Classification of Grounding Acts in German
    Milena Belosevic, Hendrik Buschmeier

Hate Speech

  • Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech
    Florian Ludwig, Frederike Zufall, Torsten Zesch
  • FASCIST-O-METER: Classifier for Neo-fascist Discourse Online
    Rudy Alexandro Garrido Veliz, Martin Semmann, Chris Biemann, Seid Muhie Yimam
  • HICC: A Dataset for German Hate Speech in Conversational Context
    Lars Schmid, Pius von Däniken, Patrick Giedemann, Don Tuggener, Judith Bühler, Maria Kamenowski, Katja Girschick, Dirk Baier, Mark Cieliebak

Methods

  • LRMs are not thinking straight: Unreliability of thinking trajectories
    Jhouben Cuesta-Ramirez, Samuel Beaussant, Mehdi Mounsif
  • Learn to pick the winner: Black-box ensembling for textual and visual question answering
    Yuxi Xia, Klim Zaporojets, Benjamin Roth
  • Surprisal in Action: A Comparative Study of LDA and LSA for Keyword Extraction
    J. Nathanael Philipp, Max Kölbl, Michael Richter

New Resources

  • SocCor: A Multimodal-based Multilingual Soccer Corpus for Text Data Analytics
    Paul Löhr, Jannik Strötgen
  • Predicting Functional Content Zones in German Source-Dependent Argumentative Essays: Experiments on a Novel Dataset
    Xiaoyu Bai, Manfred Stede
  • A Survey of Idiom Datasets for Psycholinguistic and Computational Research
    Michael Flor, Xinyi Liu, Anna Feldman

Translation and Multilinguality

  • Evaluating the Feasibility of Using ChatGPT for Cross-cultural Survey Translation
    Danielly Sorato, Diana Zavala-Rojas
  • Information Divergence in Translation and Interpreting: Findings from Same-Source Texts
    Maria Kunilovskaya, Sharid Loáiciga, Ekaterina Lapshinova-Koltunski
  • SEAS: Sentence Extraction and Alignment from Subtitles
    Josh Stephenson, Libby Barak

Poster 1

  • German Aspect-based Sentiment Analysis in the Wild: B2B Dataset Creation and Cross-Domain Evaluation
    Jakob Fehle, Niklas Donhauser, Udo Kruschwitz, Nils Constantin Hellwig, Christian Wolff
  • Vague, Incomplete, Subjective, and Uncertain Information in Art Provenance
    Fabio Mariani
  • Automatic Creation of Marginalia
    Aaron Lang, Robin Jegan, Andreas Henrich
  • Localization of English Affective Narrative Generation to German
    Johannes Schäfer, Sabine Weber, Roman Klinger
  • MultimodalPreProCessor: New Horizons for Distributed Microservice-Oriented Processing of Corpora using UIMA
    Daniel Bundan, Giuseppe Abrami, Alexander Mehler
  • Systematic Review of Linguistic Characteristics in Profiling and Automated Detection of Autistic Speech
    Charlotte Bellinghausen, Andreas Riedel
  • More than the Sum of Their Words: Generating and Contrasting Large Linguistic Networks
    Hanna Schmück
  • Towards a Cross-Dialectal Dictionary for Low German (Low Saxon)
    Christian Chiarcos, Janine Siewert, Tabea Gröger, Christian Fäth
  • Rapid Text Segmentation: Crowd-sourcing Lay Intuition about Text Structure in the Browser
    Florian Frenken
  • Applying an Information-theoretic Approach for Automatic Identification of German Multi-word Expressions
    Sergei Bagdasarov, Elke Teich

Poster 2

  • Hit or Be Hit: Tests of (Pre)Compositional Abilities in Vision and Language Models
    Mădălina Zgreabăn, Albert Gatt, Pablo Mosteiro
  • When AI Gets It Wrong: Exploring the Educational Value of Flawed Transcriptions in Language Pedagogy
    Anna Malin Gerke
  • Hybrid Feature-Embedding Models for Robust AI Text Detection
    Kasper Thomas Gartside Knudsen, Christian Hardmeier
  • Advancing German Language Modelling - Transparent Models and Comprehensive Benchmarks
    Jan Pfister, Julia Wunderle, Anton Ehrmanntraut, Fotis Jannidis, Andreas Hotho
  • Using LLMs for experimental stimulus pretests in linguistics. Evidence from semantic associations between words and social gender
    Christian Lang, Franziska Kretzschmar, Sandra Hansen
  • Developmentally plausible pretraining, now also auf Deutsch: a BabyLM Dataset for German
    Bastian Bunzeck, Daniel Duran, Sina Zarrieß
  • PETapter: Leveraging PET-style classification heads for modular few-shot parameter-efficient fine-tuning
    Jonas Rieger, Mattes Ruckdeschel, Gregor Wiedemann
  • Large Language Model Data Generation for Enhanced Intent Recognition in German Speech
    Theresa Pekarek Rosin, Burak Can Kaplan, Stefan Wermter
  • Mapping the Intertextual Networks of the Tang Liu Dian through Representation Learning and Large Language Models
    Yihan Sheng
  • Detecting Sexism and Its Severity in German Online Comments: Modeling Annotation Subjectivity with BERT and mBERT
    Melanie Woodrow, Margot Mieskes