Destide@feddit.uk to Programming@programming.devEnglish · 1 year agoOpen-R1: a fully open reproduction of DeepSeek-R1huggingface.coexternal-linkmessage-square9linkfedilinkarrow-up1108arrow-down15
arrow-up1103arrow-down1external-linkOpen-R1: a fully open reproduction of DeepSeek-R1huggingface.coDestide@feddit.uk to Programming@programming.devEnglish · 1 year agomessage-square9linkfedilink
minus-squareTomasEkeli@programming.devlinkfedilinkarrow-up5·1 year agohonestly both 7b and 8b are pretty dumb as well.
minus-squareMadhuGururajan@programming.devlinkfedilinkEnglisharrow-up1·1 year agowe could add so much deterministic code at 1.5GB that would start religions…
3B is probably also pretty dumb
honestly both 7b and 8b are pretty dumb as well.
we could add so much deterministic code at 1.5GB that would start religions…
True