Details of Research Outputs

TitleSiamCLIM: Text-Based Pedestrian Search Via Multi-Modal Siamese Contrastive Learning
Creator
Date Issued2023
Conference Name30th IEEE International Conference on Image Processing, ICIP 2023
Source Publication2023 IEEE International Conference on Image Processing: Proceedings
ISBN9781728198354
Pages1800-1804
Conference DateOCT 08-11, 2023
Conference PlaceKuala Lumpur, MALAYSIA
Publication PlaceNEW YORK, USA
PublisherIEEE
Abstract

Text-based pedestrian search (TBPS) aims at retrieving target persons from the image gallery through descriptive text queries. Despite remarkable progress in recent state-of-the-art approaches, previous works still struggle to efficiently extract discriminative features from multi-modal data. To address the problem of cross-modal fine-grained text-to-image, we proposed a novel Siamese Contrastive Language-Image Model (SiamCLIM). The model implements textual description and target-person interaction through deep bilateral projection, and siamese network structure to capture the relationship between text and image. Experiments show that our model significantly outperforms the state-of-the-art methods on cross-modal fine-grained matching tasks. We conduct the downstream task experiments on the benchmark dataset CUHK-PEDES and the experimental results demonstrate that our model is state-of-the-art and outperforms the current methods by 11.55%, 11.02%, and 7.76% in terms of top-1, top-5, and top-10 accuracy, respectively. © 2023 IEEE.

Keywordcontrastive learning multi-modal Siamese Network Text-based person search text-image
DOI10.1109/ICIP49359.2023.10222660
URLView source
Indexed ByCPCI-S
Language英语English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence ; Computer Science, Theory & Methods
WOS IDWOS:001106821001175
Scopus ID2-s2.0-85180805993
Citation statistics
Cited Times:1[WOS]   [WOS Record]     [Related Records in WOS]
Document TypeConference paper
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/11536
CollectionFaculty of Science and Technology
Corresponding AuthorZhang, Hui
Affiliation
1.Department of Computer Science and Technology, BNU-HKBU United International College
2.University of Edinburgh, United Kingdom
3.Department of Computer Science, Hong Kong Baptist University
4.University of Alberta, Canada
First Author AffilicationBeijing Normal-Hong Kong Baptist University
Corresponding Author AffilicationBeijing Normal-Hong Kong Baptist University
Recommended Citation
GB/T 7714
Huang, Runlin,Wu, Shuyang,Jie, Leipinget al. SiamCLIM: Text-Based Pedestrian Search Via Multi-Modal Siamese Contrastive Learning[C]. NEW YORK, USA: IEEE, 2023: 1800-1804.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Huang, Runlin]'s Articles
[Wu, Shuyang]'s Articles
[Jie, Leiping]'s Articles
Baidu academic
Similar articles in Baidu academic
[Huang, Runlin]'s Articles
[Wu, Shuyang]'s Articles
[Jie, Leiping]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Huang, Runlin]'s Articles
[Wu, Shuyang]'s Articles
[Jie, Leiping]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.