RSS Bot@lemmy.bestiver.seMB to Hacker News@lemmy.bestiver.seEnglish · 3 days agoA minimal PyTorch implementation for training your own small LLM from scratchgithub.comexternal-linkmessage-square1fedilinkarrow-up16arrow-down10file-text
arrow-up16arrow-down1external-linkA minimal PyTorch implementation for training your own small LLM from scratchgithub.comRSS Bot@lemmy.bestiver.seMB to Hacker News@lemmy.bestiver.seEnglish · 3 days agomessage-square1fedilinkfile-text
minus-squareiiilinkfedilinkEnglisharrow-up1·3 days agoThat’s probably the easiest to read attention and GPT implementation I’ve seen. Congrats to the author.
That’s probably the easiest to read attention and GPT implementation I’ve seen. Congrats to the author.