The University of Southampton
University of Southampton Institutional Repository

Conversational image search: a sketch-based approach

Conversational image search: a sketch-based approach
Conversational image search: a sketch-based approach
Conversational image search has emerged as a progressive step beyond traditional keyword-based methodologies, which addresses challenges in human-computer interaction during the information retrieval process. This paper introduces a demonstration called DoodleShoper, a forward-thinking conversational image search assistant centered around sketching, specifically tailored for online product searches. It underscores the importance of visual diversity, often eluding verbal expression while highlighting the efficacy of a sketch-based approach in enhancing user interaction. The proposed modular architecture integrates a state-of-the-art Language Model with advanced Stable Diffusion technologies in the image generation field to offer users a more intuitive and precise conversational search experience. Unlike most conventional methods that directly align prompts or sketches with images, our approach leverages a generative model to produce an intermediate search outcome. This strategic shift streamlines the search process from a zero-shot query - where the query directly corresponds to an image - to a reverse image search task, facilitating the discovery of similar images through multimodal interaction. The implemented demonstration involves refining and expanding the application to diverse user information needs and preferences, including exploring the potential of utilising sketches as an alternative or complementary search environment, a novel concept rooted in current research.
1265-1269
Braghis, Daniel D.
4b849bf2-79d6-45d0-a4fb-f9f8633898aa
Liu, Haiming
3ed791e3-9f1e-417e-a531-7faf19cca547
Braghis, Daniel D.
4b849bf2-79d6-45d0-a4fb-f9f8633898aa
Liu, Haiming
3ed791e3-9f1e-417e-a531-7faf19cca547

Braghis, Daniel D. and Liu, Haiming (2024) Conversational image search: a sketch-based approach. In Proceedings of the 2024 International Conference on Multimedia Retrieval (ICMR). pp. 1265-1269 . (doi:10.1145/3652583.3657594).

Record type: Conference or Workshop Item (Paper)

Abstract

Conversational image search has emerged as a progressive step beyond traditional keyword-based methodologies, which addresses challenges in human-computer interaction during the information retrieval process. This paper introduces a demonstration called DoodleShoper, a forward-thinking conversational image search assistant centered around sketching, specifically tailored for online product searches. It underscores the importance of visual diversity, often eluding verbal expression while highlighting the efficacy of a sketch-based approach in enhancing user interaction. The proposed modular architecture integrates a state-of-the-art Language Model with advanced Stable Diffusion technologies in the image generation field to offer users a more intuitive and precise conversational search experience. Unlike most conventional methods that directly align prompts or sketches with images, our approach leverages a generative model to produce an intermediate search outcome. This strategic shift streamlines the search process from a zero-shot query - where the query directly corresponds to an image - to a reverse image search task, facilitating the discovery of similar images through multimodal interaction. The implemented demonstration involves refining and expanding the application to diverse user information needs and preferences, including exploring the potential of utilising sketches as an alternative or complementary search environment, a novel concept rooted in current research.

This record has no associated files available for download.

More information

e-pub ahead of print date: 7 June 2024

Identifiers

Local EPrints ID: 503513
URI: http://eprints.soton.ac.uk/id/eprint/503513
PURE UUID: fc5b7a40-f34a-4509-89f1-29fdf257fc0e
ORCID for Haiming Liu: ORCID iD orcid.org/0000-0002-0390-3657

Catalogue record

Date deposited: 04 Aug 2025 16:50
Last modified: 05 Aug 2025 02:04

Export record

Altmetrics

Contributors

Author: Daniel D. Braghis
Author: Haiming Liu ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×