×
验证码:
换一张
忘记密码?
记住我
×
统一认证登录
登录
中文版
|
English
上海科技大学知识管理系统
ShanghaiTech University Knowledge Management System
统一认证登录
登录
注册
ALL
ORCID
题名
作者
发表日期
关键词
文献类型
DOI
出处
存缴日期
收录类别
出版者
学习讨论厅
图片搜索
粘贴图片网址
首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
知识整合
学习讨论厅
在结果中检索
研究单元&专题
信息科学与技术学院 [6]
创意与艺术学院 [1]
作者
何旭明 [3]
屠可伟 [1]
颜世鹏 [1]
贾子夏 [1]
张军 [1]
王新宇 [1]
更多...
文献类型
会议论文 [6]
发表日期
2024 [4]
2022 [2]
出处
ACM INTERN... [1]
IEEE SYMPO... [1]
LECTURE NO... [1]
NAACL 2022... [1]
PROCEEDING... [1]
PROCEEDING... [1]
更多...
语种
英语 [6]
资助项目
NSFC[62350... [1]
Shanghai S... [1]
资助机构
收录类别
EI [6]
CPCI-S [1]
状态
已发表 [5]
×
知识图谱
KMS
反馈留言
浏览/检索结果:
共6条,第1-6条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
期刊影响因子升序
期刊影响因子降序
WOS被引频次升序
WOS被引频次降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Multimodal Local Representation Learning For Multi-Task Blastocyst Assessment
会议论文
IEEE SYMPOSIUM ON BIOMEDICAL IMAGING 2024, Athens, Greece, 27-30 May 2024
作者:
Zhang J(张军)
;
Zheng BZ(郑博中)
;
Ni N(倪娜)
;
Tong GQ(童国庆)
;
Wu YN(武颖娜)
Adobe PDF(547Kb)
|
收藏
|
浏览/下载:547/7
|
提交时间:2024/05/29
Adversarial machine learning
Cell culture
Contrastive Learning
Image representation
Image retrieval
Multi-task learning
Biomedical images
Blastocyst assessment
Image texts
Image-text retrieval
Learning frameworks
Multi tasks
Multi-modal
Multi-task model
Multimodal local representation
Text retrieval
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
会议论文
PROCEEDINGS OF THE AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, Vancouver, BC, Canada, February 20, 2024 - February 27, 2024
作者:
Qiu, Longtian
;
Ning, Shan
;
He, Xuming
Adobe PDF(948Kb)
|
收藏
|
浏览/下载:382/46
|
提交时间:2024/04/26
Gaussian distribution
Breakings
Fine grained
Image captioning
Image texts
Performance
Power
Pre-training
Text alignments
Textual description
Visual feature
UniGen: Unified Generative Pre-training for Multilingual Multimodal Representation
会议论文
ACM INTERNATIONAL CONFERENCE PROCEEDING SERIES, Tokyo, Japan, March 16, 2024 - March 18, 2024
作者:
Tian, Zheyuan
;
Luo, Guan
;
Wang, Bo
;
Li, Bing
;
Hu, Weiming
Adobe PDF(1018Kb)
|
收藏
|
浏览/下载:202/1
|
提交时间:2024/09/06
Autoregressive modelling
Generative model
Image data
Internet data
Multi-modal
Multilingual model
Multilingual texts
Multimodal pre-training
Pre-training
Text images
Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning
会议论文
PROCEEDINGS OF THE IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, Seattle, WA, United states, June 16, 2024 - June 22, 2024
作者:
Li, Rongjie
;
Wu, Yu
;
He, Xuming
Adobe PDF(2554Kb)
|
收藏
|
浏览/下载:41/4
|
提交时间:2025/03/28
Adversarial machine learning
Contrastive Learning
Generative adversarial networks
Visual languages
Image captioning
Image texts
Labelings
Language model
Multi-modal
Multimodal reasoning
Performance
Question Answering
Text generations
Vision-language
ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition
会议论文
NAACL 2022 - 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, PROCEEDINGS OF THE CONFERENCE, Seattle, WA, United states, July 10, 2022 - July 15, 2022
作者:
Wang, Xinyu
;
Gui, Min
;
Jiang, Yong
;
Jia, Zixia
;
Bach, Nguyen
Adobe PDF(1139Kb)
|
收藏
|
浏览/下载:585/0
|
提交时间:2022/10/14
Character recognition
Computational linguistics
Natural language processing systems
Object detection
Attention mechanisms
Cross-modal
Embeddings
Image information
Image representations
Image texts
Multi-modal
Named entity recognition
Text alignments
Text representation
Generative Negative Text Replay for Continual Vision-Language Pretraining
会议论文
LECTURE NOTES IN COMPUTER SCIENCE (INCLUDING SUBSERIES LECTURE NOTES IN ARTIFICIAL INTELLIGENCE AND LECTURE NOTES IN BIOINFORMATICS), Tel Aviv, Israel, October 23, 2022 - October 27, 2022
作者:
Yan, Shipeng
;
Hong, Lanqing
;
Xu, Hang
;
Han, Jianhua
;
Tuytelaars, Tinne
Adobe PDF(1184Kb)
|
收藏
|
浏览/下载:869/148
|
提交时间:2023/02/03
Classification (of information)
Image classification
Image enhancement
Large dataset
Text processing
Zero-shot learning
Continual learning
Down-stream
Image texts
Images classification
Large amounts
Multi-modal
Performance
Pre-training
Training model
Vision-language pretraining
首页
上一页
1
下一页
末页