Show newer

whether it’s true or not that “90% of everything is crap”, the tech giants are investing heavily in a hot new technology that promises to boost that figure up to an impressively reliable “five-nines” percentage

@skryking For me it was more that I can finally make this stuff work related and potentially find clients to pay me to mess with it. :P Sadly my hand pain makes computer touching less appealing off the clock.

@skryking Nice. I've been wanting to get into Rust for years but didn't have much of a use case. Now with the candle library from HuggingFace and my latest adventures with LLMs I've had an actual reason to write something in it. :) github.com/huggingface/candle/

@skryking This post by @simon is what exposed me to the idea for the first time: til.simonwillison.net/llms/pyt

I also have a slightly improved prompt here: gist.github.com/RangerMauve/19

I'll likely be publishing any new work as open source on Github. :) Probably with Rust.

feels kinda satisfying to prolapse this function to return an iterator instead of its heavy for loop.

@skryking it has less innate knowledge of facts but it is pretty good at "reasoning". I'm gonna teach it to make function calls and traverse datasets + summarize stuff. 😁

@skryking Nice. I only do CPU workloads. Try running phi 2 some time! It's super low in resurce usage. Particually the Q4 quantized models.

@laskov Oh yeah, I read their release but haven't used it yet. Was there anything specific they excelled at?

@skryking What have you been using to run the models? I find LM Studio really nice for tinkering. lmstudio.ai/

I find Q4 quantized models work pretty well on my steam deck.

Hmm, after testing the raw Phi 2 within LM Studio instead of the examples provided by HuggingFace candle, I think it's actually pretty decent after all.

Specifically I got phi 2 Q4_K_S gguf working from TheBloke.

Can't get that model running with candle since it can't seem to load the model weight.

Show thread

Having tested a bunch of projects, I gotta say that OpenHermes 2.5 is the most helpful out of the ones I can run locally.

I recently wasted a bunch of time getting Phi-2 to do some summarization work, and it just couldn't stay focused for more than a sentence or two.

Woot, I have finally written enough code to be unable to avoid the lifetime specifications / borrow checker stuff.

@beka_valentine RIP. What software are you using? I generally find OBS does everything I want.

I love this post-mortem from a former #p2p enthusiast... 🧵

"DHTs were not reliable or performant. We were way too optimistic about device discovery and NAT traversal."

He's absolutely right. If you're using a DHT, you're doing it wrong. It might have been the right primitive in 2003, but not today.

holy shit mozilla has a new issue tracker documenting all of the ways that apple, google, and microsoft purposefully put third party browsers at a disadvantage and is calling for action https://mozilla.github.io/platform-tilt/
Show older
Mauvestodon

Escape ship from centralized social media run by Mauve.