×
验证码:
换一张
忘记密码?
记住我
×
统一认证登录
登录
中文版
|
English
上海科技大学知识管理系统
ShanghaiTech University Knowledge Management System
统一认证登录
登录
注册
ALL
ORCID
题名
作者
发表日期
关键词
文献类型
DOI
出处
存缴日期
收录类别
出版者
学习讨论厅
图片搜索
粘贴图片网址
首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
知识整合
学习讨论厅
反馈留言
个人主页
个人信息
个人简介
科研成果
代表性成果(1)
预印本(6)
会议论文(5)
来源
Arxiv(6)
2024 ANNUAL CONFERENCE OF ...(1)
2024 ASSOCIATION FOR THE A...(1)
2025 ASSOCIATION FOR THE A...(1)
ASSOCIATION FOR COMPUTING ...(1)
FINDINGS OF THE ASSOCIATIO...(1)
收录类别
EI(5)
PPRN.PPRN(3)
CPCI-S(2)
访问统计
来源
Arxiv(6)
2024 ANNUAL CONFERENCE OF ...(1)
2024 ASSOCIATION FOR THE A...(1)
2025 ASSOCIATION FOR THE A...(1)
ASSOCIATION FOR COMPUTING ...(1)
FINDINGS OF THE ASSOCIATIO...(1)
发表日期
2025(3)
2024(7)
2023(1)
关键词云
More»
成果统计
More»
×
知识图谱
合作作者[TOP 5]
点击查看合作网络
徐悦
合作成果数:5
翁丰华
合作成果数:3
齐修远
合作成果数:2
Huang, Minlie
合作成果数:2
Qin, Zhan
合作成果数:2
合作作者
徐悦
合作成果数:5
翁丰华
合作成果数:3
齐修远
合作成果数:2
Huang, Minlie
合作成果数:2
Qin, Zhan
合作成果数:2
徐越
合作成果数:1
邵元明
合作成果数:1
傅铖彦
合作成果数:1
邱红叶
合作成果数:1
Feng, Jun
合作成果数:1
Jian Lou
合作成果数:1
Lance Waller
合作成果数:1
Li Xiong
合作成果数:1
Lou, Jian
合作成果数:1
Pengfei Tang
合作成果数:1
Qiu, Meikang
合作成果数:1
Wang, Yi
合作成果数:1
Xu, Yue
合作成果数:1
Yang, Sibei
合作成果数:1
访问统计
总访问量
370
访问来源
内部: 15
外部: 355
国内: 340
国外: 30
年访问量
94
访问来源
内部: 7
外部: 87
国内: 79
国外: 15
月访问量
20
访问来源
内部: 6
外部: 14
国内: 10
国外: 10
访问量
访问量
1.
LinkPrompt: Natural and Universal Adversarial Attacks on Prompt-ba..
[405]
2.
IGAMT: Privacy-Preserving Electronic Health Record Synthesization ..
[362]
3.
Demo: Certified Robustness on Toolformer
[359]
4.
MMJ-Bench: A Comprehensive Study on Jailbreak Attacks and Defenses..
[343]
5.
Defending Jailbreak Attack in VLMs via Cross-modality Information ..
[318]
6.
LinkPrompt: Natural and Universal Adversarial Attacks on Prompt-ba..
[295]
7.
Don't Say No: Jailbreaking LLM by Suppressing Refusal
[286]
8.
Cross-modality Information Check for Detecting Jailbreaking in Mul..
[205]
9.
DELMAN: Dynamic Defense Against Large Language Model Jailbreaking ..
[13]
10.
DR.GAP: Mitigating Bias in Large Language Models using Gender-Awar..
[12]
11.
Adversary-Aware DPO: Enhancing Safety Alignment in Vision Language..
[9]
下载量
1.
MMJ-Bench: A Comprehensive Study on Jailbreak Attacks and Defenses..
[6]
2.
LinkPrompt: Natural and Universal Adversarial Attacks on Prompt-ba..
[2]
3.
IGAMT: Privacy-Preserving Electronic Health Record Synthesization ..
[2]
4.
Cross-modality Information Check for Detecting Jailbreaking in Mul..
[2]
5.
Demo: Certified Robustness on Toolformer
[1]
6.
LinkPrompt: Natural and Universal Adversarial Attacks on Prompt-ba..
[1]
7.
Defending Jailbreak Attack in VLMs via Cross-modality Information ..
[1]
科研成果
11
2607
15
1
0
1
Items
Views
Downloads
TC[WOS]
TC[CSCD]
H-index
排序方式:
按发表日期降序
按发表日期升序
按WOS被引频次降序
按期刊影响因子降序
正在努力地加载数据中,请稍候……
[1]
Weng, Fenghua,Lou, Jian,Feng, Jun,et al. Adversary-Aware DPO: Enhancing Safety Alignment in Vision Language Models via Adversarial Training. 2025.
浏览/下载:
9/0
; 被引[WOS]:
0
评论
推荐
收藏
[2]
Wang, Yi,Weng, Fenghua,Yang, Sibei,et al. DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing. 2025.
浏览/下载:
13/0
; 被引[WOS]:
0
评论
推荐
收藏
[3]
Qiu, Hongye,Xu, Yue,Qiu, Meikang,et al. DR.GAP: Mitigating Bias in Large Language Models using Gender-Aware Prompting with Demonstration and Reasoning. 2025.
浏览/下载:
12/0
; 被引[WOS]:
0
评论
推荐
收藏
[4]
Weng, Fenghua,Xu, Yue,Fu, Chengyan,et al. Mmj-bench: A Comprehensive Study On Jailbreak Attacks And Defenses For Vision Language Models[C]. 2025 Association For The Advancement Of Artificial Intelligence.2024-08-16.
浏览/下载:
343/6
; 被引[WOS]:
0
评论
推荐
收藏
[5]
Xu, Yue,Qi, Xiuyuan,Qin, Zhan,et al. Defending Jailbreak Attack in VLMs via Cross-modality Information Detector. 2024.
浏览/下载:
318/1
; 被引[WOS]:
0
评论
推荐
收藏
[6]
Xu Y,Qi XY,Qin Z,et al. Cross-modality Information Check For Detecting Jailbreaking In Multimodal Large Language Models[C]. Findings Of The Association For Computational Linguistics: Emnlp 2024.United States.2024-08-01.
浏览/下载:
205/2
; 被引[WOS]:
0
评论
推荐
收藏
[7]
Zhou, Yukai,Wang, Wenjie. Don't Say No: Jailbreaking LLM by Suppressing Refusal. 2024.
浏览/下载:
286/0
; 被引[WOS]:
0
评论
推荐
收藏
[8]
Xu, Yue,Wang, Wenjie. LinkPrompt: Natural and Universal Adversarial Attacks on Prompt-based Language Models. 2024.
浏览/下载:
295/1
; 被引[WOS]:
0
评论
推荐
收藏
[9]
Xu Y,Wang WJ. Linkprompt: Natural And Universal Adversarial Attacks On Prompt-based Language Models[C]. 2024 Annual Conference Of The North American Chapter Of The Association For Computational Linguistics.Association For Computational Linguistics (acl).2024-03-01,6473-6486.
浏览/下载:
405/2
评论
推荐
收藏
[10]
Wang WJ,Pengfei Tang,Jian Lou,et al. Igamt: Privacy-preserving Electronic Health Record Synthesization With Heterogeneity And Irregularity[C]. 2024 Association For The Advancement Of Artificial Intelligence.2275 E Bayshore Rd, Ste 160, Palo Alto, Ca 94303 Usa.Association For The Advancement Of Artificial Intelligence.2024-03-01,15634-15643.
浏览/下载:
362/2
; 被引[WOS]:
1
评论
推荐
收藏
每页显示
10
0
条
‹
1
2
›