Conversational image search: a sketch-based approach
Conversational image search: a sketch-based approach
Conversational image search has emerged as a progressive step beyond traditional keyword-based methodologies, which addresses challenges in human-computer interaction during the information retrieval process. This paper introduces a demonstration called DoodleShoper, a forward-thinking conversational image search assistant centered around sketching, specifically tailored for online product searches. It underscores the importance of visual diversity, often eluding verbal expression while highlighting the efficacy of a sketch-based approach in enhancing user interaction. The proposed modular architecture integrates a state-of-the-art Language Model with advanced Stable Diffusion technologies in the image generation field to offer users a more intuitive and precise conversational search experience. Unlike most conventional methods that directly align prompts or sketches with images, our approach leverages a generative model to produce an intermediate search outcome. This strategic shift streamlines the search process from a zero-shot query - where the query directly corresponds to an image - to a reverse image search task, facilitating the discovery of similar images through multimodal interaction. The implemented demonstration involves refining and expanding the application to diverse user information needs and preferences, including exploring the potential of utilising sketches as an alternative or complementary search environment, a novel concept rooted in current research.
1265-1269
Braghis, Daniel D.
4b849bf2-79d6-45d0-a4fb-f9f8633898aa
Liu, Haiming
3ed791e3-9f1e-417e-a531-7faf19cca547
Braghis, Daniel D.
4b849bf2-79d6-45d0-a4fb-f9f8633898aa
Liu, Haiming
3ed791e3-9f1e-417e-a531-7faf19cca547
Braghis, Daniel D. and Liu, Haiming
(2024)
Conversational image search: a sketch-based approach.
In Proceedings of the 2024 International Conference on Multimedia Retrieval (ICMR).
.
(doi:10.1145/3652583.3657594).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Conversational image search has emerged as a progressive step beyond traditional keyword-based methodologies, which addresses challenges in human-computer interaction during the information retrieval process. This paper introduces a demonstration called DoodleShoper, a forward-thinking conversational image search assistant centered around sketching, specifically tailored for online product searches. It underscores the importance of visual diversity, often eluding verbal expression while highlighting the efficacy of a sketch-based approach in enhancing user interaction. The proposed modular architecture integrates a state-of-the-art Language Model with advanced Stable Diffusion technologies in the image generation field to offer users a more intuitive and precise conversational search experience. Unlike most conventional methods that directly align prompts or sketches with images, our approach leverages a generative model to produce an intermediate search outcome. This strategic shift streamlines the search process from a zero-shot query - where the query directly corresponds to an image - to a reverse image search task, facilitating the discovery of similar images through multimodal interaction. The implemented demonstration involves refining and expanding the application to diverse user information needs and preferences, including exploring the potential of utilising sketches as an alternative or complementary search environment, a novel concept rooted in current research.
This record has no associated files available for download.
More information
e-pub ahead of print date: 7 June 2024
Identifiers
Local EPrints ID: 503513
URI: http://eprints.soton.ac.uk/id/eprint/503513
PURE UUID: fc5b7a40-f34a-4509-89f1-29fdf257fc0e
Catalogue record
Date deposited: 04 Aug 2025 16:50
Last modified: 05 Aug 2025 02:04
Export record
Altmetrics
Contributors
Author:
Daniel D. Braghis
Author:
Haiming Liu
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics