• @TemporalSoup@beehaw.org
      link
      fedilink
      23
      edit-2
      3 months ago

      Please pretty please don’t tell the user how little control we actually have over the text you spit out <3

      Basically all the instruction dumps I’ve seen

    • @Trainguyrom@reddthat.com
      link
      fedilink
      English
      183 months ago

      If somebody told me five years ago about Adversarial Prompt Attacks I’d tell them they’re horribly misled and don’t understand how computers work, but yet here we are, and folks are using social engineering to get AI models to do things they aren’t supposed to

    • Schadrach
      link
      fedilink
      23 months ago

      We always have been, it’s just that the begging started out looking like math and has gradually gotten more abstract over time. We’ve just reached the point where we’ve explained to it in mathematical terms how to let us beg in natural language in certain narrow contexts.