Title | SiamCLIM: Text-Based Pedestrian Search Via Multi-Modal Siamese Contrastive Learning |
Creator | |
Date Issued | 2023 |
Conference Name | 30th IEEE International Conference on Image Processing, ICIP 2023 |
Source Publication | 2023 IEEE International Conference on Image Processing: Proceedings
![]() |
ISBN | 9781728198354 |
Pages | 1800-1804 |
Conference Date | OCT 08-11, 2023 |
Conference Place | Kuala Lumpur, MALAYSIA |
Publication Place | NEW YORK, USA |
Publisher | IEEE |
Abstract | Text-based pedestrian search (TBPS) aims at retrieving target persons from the image gallery through descriptive text queries. Despite remarkable progress in recent state-of-the-art approaches, previous works still struggle to efficiently extract discriminative features from multi-modal data. To address the problem of cross-modal fine-grained text-to-image, we proposed a novel Siamese Contrastive Language-Image Model (SiamCLIM). The model implements textual description and target-person interaction through deep bilateral projection, and siamese network structure to capture the relationship between text and image. Experiments show that our model significantly outperforms the state-of-the-art methods on cross-modal fine-grained matching tasks. We conduct the downstream task experiments on the benchmark dataset CUHK-PEDES and the experimental results demonstrate that our model is state-of-the-art and outperforms the current methods by 11.55%, 11.02%, and 7.76% in terms of top-1, top-5, and top-10 accuracy, respectively. © 2023 IEEE. |
Keyword | contrastive learning multi-modal Siamese Network Text-based person search text-image |
DOI | 10.1109/ICIP49359.2023.10222660 |
URL | View source |
Indexed By | CPCI-S |
Language | 英语English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Theory & Methods |
WOS ID | WOS:001106821001175 |
Scopus ID | 2-s2.0-85180805993 |
Citation statistics | |
Document Type | Conference paper |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/11536 |
Collection | Faculty of Science and Technology |
Corresponding Author | Zhang, Hui |
Affiliation | 1.Department of Computer Science and Technology, BNU-HKBU United International College 2.University of Edinburgh, United Kingdom 3.Department of Computer Science, Hong Kong Baptist University 4.University of Alberta, Canada |
First Author Affilication | Beijing Normal-Hong Kong Baptist University |
Corresponding Author Affilication | Beijing Normal-Hong Kong Baptist University |
Recommended Citation GB/T 7714 | Huang, Runlin,Wu, Shuyang,Jie, Leipinget al. SiamCLIM: Text-Based Pedestrian Search Via Multi-Modal Siamese Contrastive Learning[C]. NEW YORK, USA: IEEE, 2023: 1800-1804. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment