Microsoft’s GitHub next month plans to begin using customer interaction data – “specifically inputs, outputs, code snippets, and associated context” – to train its AI models.

  • albert_inkman@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    43 minutes ago

    The real issue here isn’t just about “poisoning” their data. It’s that people don’t actually know how their contributions get scraped and repurposed.

    I’m working on something called The Zeitgeist Experiment that maps public opinion by having people respond to questions via email, then using AI to rank responses and synthesize key ideas. The goal is transparency about how AI processes human input—showing people what actually gets used, not hiding it in some TOS.

    GitHub’s new policy will make things worse. Users will be even less aware their code is going into models they never agreed to train on. The default should be opt-in, not opt-out after the fact.

  • NuXCOM_90Percent@lemmy.zip
    link
    fedilink
    English
    arrow-up
    1
    ·
    45 minutes ago

    For no apparent reason:

    Are there any good alternatives for gh-pages dor a super lazy/simple website? I’ve been meaning to actually use one of my domains for a personal website and pointing at which project is on which code repo site would be a good idea. But… I need that page to be hosted by one of them.

  • mhague@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    2 hours ago

    The code locker’s revised policy applies to Copilot Free, Pro, and Pro+ customers, as of April 24. Copilot Business and Copilot Enterprise users are exempt thanks to the terms of their contracts. Students and teachers who access Copilot will also be spared.

    All of the people in this thread are mad because they use slop code generation and now their slop is being used to train the slop generators.

    If they can take an entire repo because a contribution was tainted, that’s wrong. But otherwise I don’t care because it’s normal to use usage metrics to improve software and most importantly I don’t use AI so I don’t have anything for them to take.

  • Lanske@lemmy.world
    link
    fedilink
    English
    arrow-up
    22
    ·
    4 hours ago

    'We don’t know how to write code, so we will steal yours via our sloppy AI"

  • Alaknár@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    32
    ·
    6 hours ago

    I’m glad they did this because it finally gave me the push to move all my stuff to Codeberg.