cantankerous_cashew@lemmy.world to Technology@lemmy.worldEnglish · 13 hours agoMeta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Revealwww.wired.comexternal-linkmessage-square28fedilinkarrow-up1276arrow-down15cross-posted to: technology@lemmy.world
arrow-up1271arrow-down1external-linkMeta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Revealwww.wired.comcantankerous_cashew@lemmy.world to Technology@lemmy.worldEnglish · 13 hours agomessage-square28fedilinkcross-posted to: technology@lemmy.world
minus-squareCriticalMiss@lemmy.worldlinkfedilinkEnglisharrow-up13·12 hours agoEarlier reports suggested they trained it on books from Bibliotik. What changed?
minus-squareBetaDoggo_@lemmy.worldlinkfedilinkEnglisharrow-up3·5 hours agoThe llama-1 paper acknowledged the use of the books dataset, libgen isn’t mentioned in any of the papers so this is new info.
minus-squarehalcyoncmdr@lemmy.worldlinkfedilinkEnglisharrow-up20·12 hours agoProbably just both honestly.
Earlier reports suggested they trained it on books from Bibliotik.
What changed?
The llama-1 paper acknowledged the use of the books dataset, libgen isn’t mentioned in any of the papers so this is new info.
Probably just both honestly.