HNNewShowAskJobs
Built with Tanstack Start
Torrenting from a Meta-owned corporate laptop doesn't feel right(wired.com)
23 points by hdjjhhvvhga 4 months ago | 11 comments
  • catlikesshrimp4 months ago

    Clickbait. Real title:

    "Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal"

    Which is "old news"

    Disable javascript to read or: https://archive.ph/Kp29q

    • jazzyjackson4 months ago |parent

      Title of the article vs a salacious quote (newly revealed in court documents) doesn’t reach the threshold of clickbait IMO, Meta engineers really were torrenting libgen with the CEOs approval (this is the part that’s not old news)

      Previously it was just “books3 was part of the training data”, now it’s “MZ was made aware of pirated materials, gave the go ahead, and by way of torrenting Meta engineers redistributed the copyrighted materials, which is outlawed whether or not you’re training a super intelligence.

      Personally I’m not a fan of enforcing copyright law in general, but I’m especially not a fan of corporations getting to skirt laws that the little people are made to obey. If Meta wants to train on libgen, they should have partnered with internet archive and provided them better lawyers.

      • throw59594 months ago |parent

        You don't know whether they seeded anything. Or anything recognizably copyrightable.

        • jazzyjackson4 months ago |parent

          Fair, when someone says torrenting I assume bidirectionality, but they may have blocked outgoing packets in order to comply with some interpretation of the law.

          Given that llama was originally “leaked” via torrent, I have this assumption that meta folks are Pirates in spirit tho, and wouldn’t leech without being told explicitly, but then, being told not to upload would be legally perilous too since it would hint that they are aware of the illegality. Meta’s defense here seems to be “Officer I swear I didn’t know that wasn’t allowed”, testing the legal theory of transforming copyrighted work.

        • atkailash4 months ago |parent

          [dead]

    • insane_dreamer4 months ago |parent

      The real title is more damming than the "click-bait" title.

      > old news

      Yeah, but the unsealed docs in the court case proving this to be true, aren't old news.

  • ilrwbwrkhv4 months ago

    I thought it is common knowledge that every single AI company trained on pirated datasets.

    • magic_smoke_ee4 months ago |parent

      I suspect MAANG + OpenAI will simply pay lobbyists to make it legal. They have huge piles of money and money makes the law in America.

    • hdjjhhvvhga4 months ago |parent

      Common knowledge - yes, but not proved in court.

  • ErikBjare4 months ago

    I wonder if copyright holders would be okay if Meta had a legitimately acquired a copy of each copyrighted work they trained on. (not saying the issue ends there)

  • metalman4 months ago

    getting to watch someone steal a cake and eat it too