Global Switch Day. February 1st 2025.
Spread the word.
X to Mastodon, Instagram to Pixelfed, WhatsApp to Signal, Facebook to Friendica, YouTube to PeerTube, TikTok to Loops
@hierarchon a neasure of time?
@makeworld how did you come across this?
@futurebird @krozruch I think it's cause the trolls end up flocking to toxic instances that allow for more harmful behavior / content and get largely fediblocked without others having to see it.
@BestGirlGrace I hope it's cosmetic gene therapy just so things can get really exciting.
@lvk ooooo, that's pretty cool actually. Ty for the link
@nasser I missed a good one just a couple months ago too. Gotta get my own AV setup 🥲
Good news for anyone that wants to stalk me or make a deepfake of me: I've added all the youtube videos I could find of my various talks to my website. If you know of others send them my way!
@indutny If you have 20 GB of RAM this model might be more representative of the capabilities: https://unsloth.ai/blog/deepseekr1-dynamic
@indutny I'm not sure, I think folks are excited by the prospect of an open source alternative to o1 in which case that'd just be the massive 600b model which IIRC is what powers the deepseek app. I found the distilled models to not be less useful than regular qwen2.5 for my use cases 😅 I think you could get it more useful with the right prompting and multi shot approach. Maybe have it ask for more humab guidance instead of looping.
@wffl Don't they get compile down to just the raw data witbout the extra layers? Or am I trippin?
@indutny did you run the full R1 or one kf the distilled models? R1 is like 600+ B params and the small models can't really compare to that since they're just qwen/llamma but with some tuning to make them yap more.
Occult Enby that's making local-first software with peer to peer protocols, mesh networks, and the web.
Exploring what a local-first cyberspace might look like in my spare time.