LibreTechni.ca
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
cm0002@toast.ooo to Programmer Humor@programming.dev · 1 day ago

Frog put Claude in a box

lemmy.ml

message-square
40
fedilink
  • cross-posted to:
  • programmerhumor@lemmy.ml
746

Frog put Claude in a box

lemmy.ml

cm0002@toast.ooo to Programmer Humor@programming.dev · 1 day ago
message-square
40
fedilink
  • cross-posted to:
  • programmerhumor@lemmy.ml
  • SavvyWolf@pawb.social
    link
    fedilink
    English
    arrow-up
    50
    ·
    1 day ago

    I hate that I can’t tell if this is a reference to something that actually happened or not.

    • verstra@programming.dev
      link
      fedilink
      arrow-up
      42
      ·
      1 day ago

      It’s probably something like “I’ve disabled agent’s removeFile tool, but LLM figured out that it can use the bash tool, still”.

      It looks like “AI bad” or “Claude insecure” mantra.

      • OwOarchist@pawb.social
        link
        fedilink
        English
        arrow-up
        36
        ·
        23 hours ago

        It looks like “AI bad” or “Claude insecure” mantra.

        Until you solve prompt injection, they are indeed extremely bad for security and should never be given permissions that would allow them to do anything catastrophic.

      • dumnezero@piefed.social
        link
        fedilink
        English
        arrow-up
        76
        ·
        1 day ago

        mantra

        you mean facts?

        • Scipitie@lemmy.dbzer0.com
          link
          fedilink
          arrow-up
          18
          ·
          1 day ago

          “It’s my circlejerk - so it’s a fact!”

          • dumnezero@piefed.social
            link
            fedilink
            English
            arrow-up
            41
            ·
            1 day ago

            I hope that you’re hired for long enough to learn what having security means in the context of using LLM “agents” and the like.

        • kingofras@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          19 hours ago

          deleted by creator

      • kingofras@lemmy.world
        link
        fedilink
        arrow-up
        8
        ·
        19 hours ago

        mantra

        The way LLMs work is that they actively will make multiple attempts to get past hurdles (because they have no intelligence or methodology) so guardrails need to be extremely tight for them to work, other wise the model will simply see it as one of the challenges to overcome.

        That’s the mantra, and that is very poor technology to put in the hands of people who don’t understand how it works.

      • kingofras@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        19 hours ago

        deleted by creator

    • goondaba@lemmy.world
      link
      fedilink
      arrow-up
      10
      ·
      1 day ago

      https://xcancel.com/sluongng/status/2060746160558543217#m

Programmer Humor@programming.dev

programmer_humor@programming.dev

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !programmer_humor@programming.dev

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

  • Keep content in english
  • No advertisements
  • Posts must be related to programming or programmer topics
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1.27K users / day
  • 3.34K users / week
  • 7.09K users / month
  • 17.1K users / 6 months
  • 1 local subscriber
  • 32K subscribers
  • 1.35K Posts
  • 26.8K Comments
  • Modlog
  • mods:
  • adr1an@programming.dev
  • Feyter@programming.dev
  • BurningTurtle@programming.dev
  • Pierre-Yves Lapersonne@programming.dev
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org