• psud@aussie.zone
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    1 day ago

    I got a couple of good ones:

    A correct AI generated clock

    Another correct AI clock

    When I first opened it, deepseek also had a correct clock, but I accidentally refreshed the page when I scrolled up to double check its time, second time through only these two were right

    Ed. Names are below the clock, the top one was by the clock drawing champion Kimi K2

  • BlackEco@lemmy.blackeco.com
    link
    fedilink
    arrow-up
    90
    ·
    edit-2
    3 days ago

    Is it just me or the clocks frequently break or change appearance without the page being refreshed?

    Edit: nevermind, I skipped past the sentence explaining that every minute, the site prompts LLMs for a new solution. This is hilariously sad how LLMs aren’t able to be consistent from one prompt to another.

    • Carighan Maconar@piefed.world
      link
      fedilink
      English
      arrow-up
      42
      ·
      3 days ago

      It’s the expected result if your big ol’ artificial intelligence wannabe is ultimately just a stochastic word combinator.

    • vrighter@discuss.tchncs.de
      link
      fedilink
      arrow-up
      17
      ·
      3 days ago

      if every single token is, at the end, chosen by random dice roll (and they are) then this is exactly what you’d expect.

    • null@piefed.nullspace.lol
      link
      fedilink
      English
      arrow-up
      5
      ·
      3 days ago

      This is hilariously sad how LLMs aren’t able to be consistent from one prompt to another.

      Typically that’s configurable. Like for a chatbot, you’d want it to give the same/similar results for a given question, where with a character creator, you might want the results to vary so you can re-run until you get something you like.

      Of course that wouldn’t be as funny here.

    • vyzu@feddit.fr
      link
      fedilink
      Français
      arrow-up
      1
      ·
      3 days ago

      You can click on the button in the top right corner (with a question mark) to have explanations. The clocks are refreshed every minute

    • BatmanAoD@programming.dev
      link
      fedilink
      arrow-up
      10
      ·
      edit-2
      3 days ago

      Thanks for sharing this! I really think that when people see LLM failures and say that such failures demonstrate how fundamentally different LLMs are from human cognition, they tend to overlook how humans actually do exhibit remarkably similar failures modes. Obviously dementia isn’t really analogous to generating text while lacking the ability to “see” a rendering based on that text. But it’s still pretty interesting that whatever feedback loops did get corrupted in these patients led to such a variety of failure modes.

      As an example of what I’m talking about, I appreciated and generally agreed with this recent Octomind post, but I disagree with the list of problems that “wouldn’t trip up a human dev”; these are all things I’ve seen real humans do, or could imagine a human doing.

      • huppakee@piefed.social
        link
        fedilink
        English
        arrow-up
        4
        ·
        3 days ago

        such a variety of failure modes

        What i find interesting is that in both cases there is a certain consistency in the mistakes too - basically every dementia patient still understands the clock is something with a circle and numbers and not a square with letters for example. LLMs can tell you cokplete bullshit, but still understands it has to be done with perfect grammar in a consistant language. So much so it struggles to respond outside of this box - ask it to insert spelling errors to look human for example.

        the ability to “see”

        This might be the true problem in both cases, both the patient and the model can not comprehend the bigger picture (a circle is divided into 12 segments, because that is how we deconstructed the time it takes for the earth to spin around it’s axis). Things that seem logical to use, are logical because of these kind of connections with other things we know and comprehend.

    • ZC3rr0r@lemmy.ca
      link
      fedilink
      arrow-up
      1
      ·
      3 days ago

      Thanks for sharing that mindfuck. I honestly would’ve thought something was wrong with my cognition if you hadn’t mentioned it was a test beforehand.

  • NeatNit@discuss.tchncs.de
    link
    fedilink
    arrow-up
    58
    ·
    3 days ago

    The last one, Kimi K2, has been consistently good as long as I’ve been looking at it. That’s pretty impressive.

    The rest are hilarious!

      • SlurpingPus@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        2 days ago

        I dig the square clock, and am now sad that the numbers can’t be put into the corners on a real clock. Unless they’re shifted from the usual position.

        • huppakee@piefed.social
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          2 days ago

          Cartier found a work around quite some time ago and maybe they weren’t even the first to design a square ‘clock’:

          (The roman numerals are nice, but notice the ‘circle’ between the numerals and the hands, almost like the circle from the ai)

          • SlurpingPus@lemmy.world
            link
            fedilink
            arrow-up
            2
            ·
            edit-2
            2 days ago

            That one is pretty good, though the Roman numerals are rather busy and uneven.

            This one is closer, though now I have to wonder if all non-square rectangular clocks have an old-timey whiff for me, or it’s just the border here:

            This is also impressive:

            • huppakee@piefed.social
              link
              fedilink
              English
              arrow-up
              2
              ·
              1 day ago

              I like the second one a lot, especially how the upper and bottom numerals face the floor and the left and right ones face towards the center, and to allow for that there has to be a sudden flip from 3>4 and 8>9. But the indices are not playing by the square-clock rule and unlike the cartier one form a regular oval shape.

              I like how the upper one had to find a way to make clear which indice represents the numerals - it really shows the problem in projecting the circular movement of the hands into a rectangular (thanks, that’s the right word) shape.

              It think most analog clocks/watches will give you an old-timey whiff much more often than not, just because there is a more new-timey alternative. I went looking for some watch faces for smart watches, but couldn’t really find any interesting one. Most are either digital numbers or a round clock on a rectangular display.

              A clock face for an apple watch branded with Hermes A clock face for an Apple watch branded with Rolex

              Neither of those interest me like the Cartier tank, which I find really ugly watches to be honest. It’s just this double outlined rectangle(-ish shape) which is unevenly split into 60 boxes that I like (seen below on the first, third and fifth watch).

              Six different Cartier watches in one image

    • Hazzard@lemmy.zip
      link
      fedilink
      arrow-up
      12
      ·
      3 days ago

      Haha, I found myself thinking the same thing, and then caught myself, realizing all the other LLMs on this page had lowered the bar immensely for what I’m considering impressive.

    • Enkrod@feddit.org
      link
      fedilink
      arrow-up
      11
      ·
      3 days ago

      I thought the same and then Kimi K2 came up with a clock that has two 12 and no 11…

    • snooggums@piefed.world
      link
      fedilink
      English
      arrow-up
      18
      ·
      3 days ago

      qwen 2.5 is absolutely pants on head ridiculous compared to gpt5 when I’m looking at it right now.

  • sheepishly@fedia.io
    link
    fedilink
    arrow-up
    21
    ·
    3 days ago

    Given that the AI models are basically constructing these “blindly”- using the language model to string together html and javascript without really being able to check how it looks- some of these are actually pretty impressive. But also making the AI do things it’s bad at is funny. Reminds me of all the AI ASCII art fails…

  • rekabis@lemmy.ca
    link
    fedilink
    arrow-up
    2
    ·
    2 days ago

    Another reason why, while AI might be a fun toy, no one who is serious about getting work done will touch it with a dirty barge pole. The gratuitous hallucinations alone ought to be a sufficient deterrent.

    • zerofk@lemmy.zip
      link
      fedilink
      arrow-up
      6
      ·
      edit-2
      3 days ago

      So far, I’d give qwen the prize for most artistic impression of a clock.

      Kimi K2 appears to consistently get it right.

      • zerofk@lemmy.zip
        link
        fedilink
        arrow-up
        4
        ·
        3 days ago

        And just as I typed that, Kimi made one where 9 and 10, and 11 and 12 overlapped.

    • TechLich@lemmy.world
      link
      fedilink
      arrow-up
      10
      ·
      3 days ago

      No JavaScript I think, it’s just html and CSS. The initial time is provided in the prompt every minute according to the description. I wonder if they’d be any better if they could use js. Probably not.

  • Bazell@lemmy.zip
    link
    fedilink
    arrow-up
    5
    ·
    3 days ago

    Well, KIMI K2 seems to have created the working one. Others failed. I suppose that this model was optimized for this while others not.

    • SolarBoy@slrpnk.net
      link
      fedilink
      English
      arrow-up
      9
      ·
      3 days ago

      The clocks change every minute. I’ve seen some from deepseek and qwen that looked ok. But kimi seems to be the most consistent