I think in the next couple years OS-shipped will replace the use of heavy cloud based . Microsoft, Google, and soon Apple will be shipping devices with local LLMs and it'll be cheaper for applications to target those APIs rather than pay OpenAI or the such. This will also mean that we'll get into a sort of "browser wars" of model functionality gated by hardware vendors.

I don't think cloud AI will fully go away but I think it'll make less and less sense for consumer facing use cases as the small models become more viable via better training and better hardware acceleration.

For example, Chrome is working on shipping web APIs for LLM access. I'm planning to release something similar in @agregore in the next week or two.

github.com/explainers-by-googl

@mauve Brave already supports custom ollama endpoints already.

Quite cool.

@agregore

Follow

@hermeticvm @agregore Oh snap. Is their api stable? Have you tried it out?

@mauve all you need is a local ollama instance which is pretty much compatible to ChatGPT.

@agregore

@hermeticvm @agregore Ohhh I see. This is for the built in LLM UI they have. I am working on JavaScript APIs for web apps to have access to.

@mauve I see. Let's hope for a good standard. I agree with your take that we'll see more local LLM stuff. Especially for latency and privacy reasons. @agregore

Sign in to participate in the conversation
Mauvestodon

Escape ship from centralized social media run by Mauve.