cross-posted from: https://lemmy.ca/post/61948688

Excerpt:

“Even within the coding, it’s not working well,” said Smiley. “I’ll give you an example. Code can look right and pass the unit tests and still be wrong. The way you measure that is typically in benchmark tests. So a lot of these companies haven’t engaged in a proper feedback loop to see what the impact of AI coding is on the outcomes they care about. Lines of code, number of [pull requests], these are liabilities. These are not measures of engineering excellence.”

Measures of engineering excellence, said Smiley, include metrics like deployment frequency, lead time to production, change failure rate, mean time to restore, and incident severity. And we need a new set of metrics, he insists, to measure how AI affects engineering performance.

“We don’t know what those are yet,” he said.

One metric that might be helpful, he said, is measuring tokens burned to get to an approved pull request – a formally accepted change in software. That’s the kind of thing that needs to be assessed to determine whether AI helps an organization’s engineering practice.

To underscore the consequences of not having that kind of data, Smiley pointed to a recent attempt to rewrite SQLite in Rust using AI.

“It passed all the unit tests, the shape of the code looks right,” he said. It’s 3.7x more lines of code that performs 2,000 times worse than the actual SQLite. Two thousand times worse for a database is a non-viable product. It’s a dumpster fire. Throw it away. All that money you spent on it is worthless."

All the optimism about using AI for coding, Smiley argues, comes from measuring the wrong things.

“Coding works if you measure lines of code and pull requests,” he said. “Coding does not work if you measure quality and team performance. There’s no evidence to suggest that that’s moving in a positive direction.”

  • Leon@pawb.social
    link
    fedilink
    English
    arrow-up
    19
    ·
    6 hours ago

    I saw a vibe coded PR the other day. So much redundant code, lots of comments making assumptions and questions. It’s a mess.

    Glad it didn’t land in my lap but the person who is now responsible for steering that up is already quite busy and wasting their time with this feels shit.

    • thebestaquaman@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      4 hours ago

      One of the worst things about this is that the person vibe coding just ends up shitting on the reviewers time. Like… you couldn’t even bother to write a real PR, and now you want me to spend time filtering your shit? Fuck off.

      • Leon@pawb.social
        link
        fedilink
        English
        arrow-up
        12
        ·
        6 hours ago

        Too many people are willingly paying anti-democratic billionaires to outsource their thinking and agency.

        • org@lemmy.org
          link
          fedilink
          English
          arrow-up
          3
          ·
          6 hours ago

          Too many people know their job is only going to last 6 months before the next round of layoffs, and that talent and hard work has never been the way to keep a job in the tech industry… so why try?

          • Leon@pawb.social
            link
            fedilink
            English
            arrow-up
            8
            ·
            6 hours ago

            Not really a valid excuse in this case as we aren’t really experiencing layoffs here. Au contraire, our company is hiring. I’m not in the U.S.

            Still think that letting language models controlled by billionaire paedophiles and wannabe dictators is a poor idea, regardless of how fed up one is with one’s job.

            • org@lemmy.org
              link
              fedilink
              English
              arrow-up
              2
              ·
              5 hours ago

              Where is “here?”

              And, if you want to bring pedophiles into it, most of what you touch on a daily basis involved a billionaire pedophile at some point. You just sound lazy at this point.

              • Encrypt-Keeper@lemmy.world
                link
                fedilink
                English
                arrow-up
                6
                ·
                edit-2
                4 hours ago

                Did you just call them lazy after making the argument that building talent and hard work is not worth doing in the modern tech landscape?

                • org@lemmy.org
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  4 hours ago

                  The argument is lazy to blanket everything in “pedophile” instead of actually talking about the issue.

              • Leon@pawb.social
                link
                fedilink
                English
                arrow-up
                1
                ·
                4 hours ago

                Oh absolutely, and we can do our best to swear off of that but thanks to them worming their way in like a cancer in every part of society, shaping it to benefit them, that’s just the nature of taking part in society. The ones in power have always, and will always continue to exploit us for as long as we let them.

                All the more reason to not outsource our thinking to their machines. Governments are already doing it, getting caught red-handed acting on reports that never existed. Why rely on that when the option not to is so readily available?

                • org@lemmy.org
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  4 hours ago

                  Ehhh… this sounds more like blanket ai-hate and less about you actually caring. You’re already in their cloud. I doubt you run bare metal. You probably use GitHub. Etc. caring on one hand and not on the other means nothing.

                  I’ll continue farming out bullshit tasks to AI while I play with my cat and prepare for the next round of layoffs, rather than giving my soul to a company who doesn’t actually care about me.

                  • Leon@pawb.social
                    link
                    fedilink
                    English
                    arrow-up
                    1
                    ·
                    3 hours ago

                    I actually really love machine learning. I trained my own language models back in the late 201Xs, and at my previous company I worked on image classification for a smaller photography platform they were developing. I’m not an ML expert, but it’s easy to see the bullshit that the “AI” companies are selling as bullshit when you know the foundations of the tech. A little like how you don’t need to be a surgeon to call bullshit on someone saying that they performed open heart surgery and brain surgery on themselves simultaneously.

                    You’re already in their cloud. Yes, unfortunately. There was a time when I was a lot more naïve and way less critical. People change. This is what “radicalised” me. I know that a story like that is just daily life in the U.S., but despite my cynicism I thought things were better here.

                    You probably use GitHub. I don’t. I self-host everything that I can, and make careful choices with what I choose not to. I’ve left what I had on GitHub on there, and I’ll probably use it as a mirror whenever I release a FOSS project, because I’m OK with Microsoft paying the hosting costs for me, if they’re going to try and scrape my shit anyway. You know, just like GNOME is doing.

                    caring on one hand and not on the other means nothing.

                    I don’t really agree. This is the same idiotic take as “if you hate capitalism so much, why are you partaking in it?” You’re also speaking directly through your sphincter as you’ve no idea what my life is like, what choices I’ve made, and so forth; you don’t know me.

                    Would I prefer to never have to engage with a payment processor or a bank again? Absolutely. That however, is sadly impossible in my society. LLMs aren’t integral to society yet, and I’d see them continue that way.

                    I’ll continue farming out bullshit tasks to AI while I play with my cat and prepare for the next round of layoffs, rather than giving my soul to a company who doesn’t actually care about me.

                    Not a fan of the LLM part, but I love the overall sentiment. Fuck the corporations, your time and energy is better spent on the people you love. I hope things work out for you.

                    ~Animals are people too. Lots of love to your cat.~

                    ~I love cats.~