haxor@derp.fooBM to Hacker News@derp.fooEnglish · 3 years agoUniversal and Transferable Adversarial Attacks on Aligned Language Modelsllm-attacks.orgexternal-linkmessage-square0linkfedilinkarrow-up13arrow-down11file-textcross-posted to: ai_infosec@infosec.pubaistuff@lemdro.idtechnews@radiation.partymachinelearning@kbin.social
arrow-up12arrow-down1external-linkUniversal and Transferable Adversarial Attacks on Aligned Language Modelsllm-attacks.orghaxor@derp.fooBM to Hacker News@derp.fooEnglish · 3 years agomessage-square0linkfedilinkfile-textcross-posted to: ai_infosec@infosec.pubaistuff@lemdro.idtechnews@radiation.partymachinelearning@kbin.social