随机鹦鹉寻找随机鹦鹉：LLMs 易调优且难以被其他 LLMs 检测出（英文版）

发布者：wx****e6

2023-04-21

1 MB 32 页

人工智能（AI）

文件列表：

随机鹦鹉寻找随机鹦鹉：LLMs 易调优且难以被其他 LLMs 检测出【英文版】.pdf

下载文档

资源简介

英文标题：Stochastic Parrots Looking for Stochastic Parrots: LLMs are Easy to Fine-Tune and Hard to Detect with other LLMs中文摘要：本文研究了如何对抗当前大规模语言模型检测工具的缺陷，发现攻击者结合 reinforcement from critic 优化方法和 AdamW 优化器可以轻松地规避检测，并对检测器进行破坏，这对防范恶意使用情况具有重要意义。英文摘要：The self-attention revolution allowed generative language models to scale andachieve increasingly impressive abilities. Such models - commonly referred toas Large Language Models (LLMs) - have recently gained prominence with thegeneral public, thanks to conver

加载中...

本文档仅能预览20页

继续阅读请下载文档