Publications

Published research on deep learning, image captioning, and AI systems.

Moral Preferences of LLMs Under Directed Contextual Influence

P. Blandfort, T. Karayil, U. Pawar, R. Graham, A. McKenzie, D. Krasheninnikov

arXiv preprint · Feb 2026

Moral benchmarks for LLMs typically use context-free prompts, implicitly assuming stable preferences. We study how directed contextual influences reshape decisions in trolley-problem-style moral triage settings and find that contextual influences often significantly shift decisions, baseline preferences are a poor predictor of directional steerability, influences can backfire, and reasoning reduces average sensitivity but amplifies the effect of biased few-shot examples.

LLMsAI SafetyEthicsMoral Reasoning

Rethinking Software Design with Large Language Models Intelligent Interfaces

C. N. Coelho, H. Xiong, T. Karayil, S. Koratala, R. Shang, J. Bollinger, M. Shabar, S. Nair

International Conference on Multimodal Interaction (ICMI) · Apr 2025

The advancement of Large Language Models (LLMs) has led to a rapid expansion of their applications, including in software design. We propose a new approach to refine system specifications using natural language-based interfaces, enabling software engineers and architects to iteratively improve software development documentation, ensuring a clearer definition of system behavior, required resources, and dependencies.

LLMsSoftware EngineeringAI Agents

Effort and Size Estimation in Software Projects with Large Language Model-based Intelligent Interfaces

C. N. Coelho, H. Xiong, T. Karayil, S. Koratala, R. Shang, J. Bollinger, M. Shabar, S. Nair

arXiv preprint · Feb 2024

The advancement of Large Language Models (LLM) has also resulted in an equivalent proliferation in its applications. Through the example of UI-based user stories, we provide a comparison against traditional methods and propose a new way to enhance specifications of natural language-based questions that allows for the estimation of development effort by taking into account data sources, interfaces and algorithms.

LLMsSoftware EstimationAI

AttrLostGAN: Attribute Controlled Image Synthesis from Reconfigurable Layout and Style

S. Frolov, A. Sharma, J. Hees, T. Karayil, F. Raue, A. Dengel

German Conference on Pattern Recognition (GCPR) · Mar 2021

We propose a method for attribute controlled image synthesis from layout which allows to specify the appearance of individual objects without affecting the rest of the image. We extend a state-of-the-art approach for layout-to-image generation to additionally condition individual objects on attributes.

GANsImage SynthesisComputer Vision

IteROAR: Quantifying the Interpretation of Feature Importance Methods

S. M. Palacio, F. Raue, T. Karayil, J. Hees, A. Dengel

Technical Report · Jan 2021

We present IteROAR, a method for quantifying the interpretation quality of feature importance methods in deep learning, providing a systematic framework for evaluating how well these methods explain model predictions.

ExplainabilityDeep LearningFeature Importance

The Focus–Aspect–Value Model for Predicting Subjective Visual Attributes

P. Blandfort, T. Karayil, J. Hees, A. Dengel

International Journal of Multimedia Information Retrieval (IJMIR) · Jan 2020

We propose the Focus–Aspect–Value model for structuring the process of capturing subjectivity in image processing, addressing the challenge of predicting subjective visual attributes from images.

Visual AttributesSubjectivityDeep Learning

Conditional GANs for Image Captioning with Sentiments

T. Karayil, A. Irfan, F. Raue, J. Hees, A. Dengel

International Conference on Artificial Neural Networks (ICANN) · Sep 2019

We explore the use of conditional Generative Adversarial Networks for generating image captions that incorporate sentiment, combining visual understanding with affective language generation.

GANsImage CaptioningSentiment AnalysisDeep Learning

The Focus-Aspect-Value Model for Explainable Prediction of Subjective Visual Interpretation

T. Karayil, P. Blandfort, J. Hees, A. Dengel

International Conference on Multimedia Retrieval (ICMR) · Jun 2019

We propose the Focus-Aspect-Value (FAV) model to structure the process of capturing subjectivity in image processing, and introduce a novel dataset following this way of modeling. We find that incorporating context information based on tensor multiplication outperforms the default way of information fusion (concatenation).

ExplainabilityVisual AttributesDeep Learning

Fusion Strategies for Learning User Embeddings with Neural Networks

P. Blandfort, T. Karayil, F. Raue, J. Hees, A. Dengel

IEEE International Joint Conference on Neural Networks (IJCNN) · Jan 2019

We analyze the effect on embedding quality caused by several fusion strategies in neural networks for learning user embeddings from rating data. We propose Pair-Distance Correlation, a novel measure for evaluating embedding quality, and find that prediction performance not necessarily reflects embedding quality.

User EmbeddingsNeural NetworksFusion Strategies

The Focus-Aspect-Polarity Model for Predicting Subjective Noun Attributes in Images

T. Karayil, P. Blandfort, J. Hees, A. Dengel

arXiv preprint · Oct 2018

We propose the Focus-Aspect-Polarity model to structure the process of capturing subjectivity in image processing, and introduce a novel dataset. We find that incorporating context information based on tensor multiplication outperforms concatenation for information fusion.

Visual AttributesSubjectivityDeep Learning

Image Captioning in the Wild: How People Caption Images on Flickr

P. Blandfort, T. Karayil, D. Borth, A. Dengel

MUSA2 Workshop at ACM Multimedia · Oct 2017

We study how people naturally caption images on Flickr, analyzing real-world image captioning behavior to understand the gap between automated captioning systems and human descriptions in the wild.

Image CaptioningFlickrMultimedia

Generating Affective Captions using Concept And Syntax Transition Networks

T. Karayil, P. Blandfort, D. Borth, A. Dengel

ACM Multimedia · Oct 2016

We present a method for generating affective image captions using Concept and Syntax Transition Networks, combining visual concept detection with structured language generation to produce emotionally expressive descriptions.

Image CaptioningAffective ComputingNLP

Introducing Concept And Syntax Transition Networks for Image Captioning

P. Blandfort, T. Karayil, D. Borth, A. Dengel

International Conference on Multimedia Retrieval (ICMR) · Jun 2016

We introduce Concept and Syntax Transition Networks, a novel approach for image captioning that separates visual concept detection from syntactic arrangement, enabling more flexible and controllable caption generation.

Image CaptioningDeep LearningNLP

A Segmentation-Free Approach for Printed Devanagari Script Recognition

T. Karayil, A. Ul-Hasan, T. Breuel

IEEE International Conference on Document Analysis and Recognition (ICDAR) · Aug 2015

We propose a segmentation-free approach for recognizing printed Devanagari script, eliminating the need for character-level segmentation and enabling end-to-end recognition of Devanagari text.

OCRDevanagariDocument AnalysisDeep Learning