KMS

浏览/检索结果: 共6条,第1-6条 帮助

已选(0)清除 条数/页:   排序方式:
NFT1000: A Cross-Modal Dataset for Non-Fungible Token Retrieval 预印本
2024
作者:  
收藏  |  浏览/下载:146/0  |  提交时间:2024/11/11
MIBench: Evaluating Multimodal Large Language Models over Multiple Images 预印本
2024
作者:  Liu, Haowei;  Zhang, Xi;  Xu, Haiyang;  Shi, Yaya;  Jiang, Chaoya
Adobe PDF(3530Kb)  |  收藏  |  浏览/下载:147/0  |  提交时间:2024/08/14
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval 会议论文
2024 JOINT INTERNATIONAL CONFERENCE ON COMPUTATIONAL LINGUISTICS, LANGUAGE RESOURCES AND EVALUATION, LREC-COLING 2024 - MAIN CONFERENCE PROCEEDINGS, Hybrid, Torino, Italy, May 20, 2024 - May 25, 2024
作者:  Liu, Haowei;  Shi, Yaya;  Xu, Haiyang;  Yuan, Chunfeng;  Ye, Qinghao
Adobe PDF(2118Kb)  |  收藏  |  浏览/下载:153/2  |  提交时间:2024/07/05
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training 会议论文
2024 JOINT INTERNATIONAL CONFERENCE ON COMPUTATIONAL LINGUISTICS, LANGUAGE RESOURCES AND EVALUATION, LREC-COLING 2024 - MAIN CONFERENCE PROCEEDINGS, Hybrid, Torino, Italy, May 20, 2024 - May 25, 2024
作者:  Liu, Haowei
收藏  |  浏览/下载:174/0  |  提交时间:2024/09/06
Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval 会议论文
MM 2023 - PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, Ottawa, ON, Canada, October 29, 2023 - November 3, 2023
作者:  Shi, Yaya;  Liu, Haowei;  Xu, Haiyang;  Ma, Zongyang;  Ye, Qinghao
Adobe PDF(10551Kb)  |  收藏  |  浏览/下载:252/1  |  提交时间:2024/01/19
TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 8, 页码: 4137-4148
作者:  Haowei Liu;  Yongcheng Liu;  Yuxin Chen;  Chunfeng Yuan;  Bing Li
Adobe PDF(17843Kb)  |  收藏  |  浏览/下载:231/0  |  提交时间:2023/09/08
  • 首页
  • 上一页
  • 1
  • 下一页
  • 末页