**Mauve 👁💜** @mauve@mastodon.mauve.moe · Feb 14, 2024, 20:07

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Feb 14, 2024, 20:07

Mauve 👁💜 @mauve@mastodon.mauve.moe

Feb 14, 2024, 20:07

Tried getting a fully local multi-modal model to tell me what it seems in my logo and it's honestly mind blowing that it can identify anything at all. i used `ollama run llava` on my steam deck. Might be a useful tool to integrate with caption generators or for #blind folks wanting to get a description without needing an online service

**techsinger** @techsinger@mastodon.social · Feb 14, 2024, 21:29

**techsinger** @techsinger@mastodon.social · Feb 14, 2024, 21:29

Feb 14, 2024, 21:29

techsinger @techsinger@mastodon.social

@mauve Thanks for the report. If I may ask, how responsive is the model on the steamdeck, and which LLAVA is it?

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Feb 14, 2024, 23:03

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Feb 14, 2024, 23:03

Feb 14, 2024, 23:03

Mauve 👁💜 @mauve@mastodon.mauve.moe

@techsinger It's not super fast but it's fast enough. I think the main slowdown is loading the model into memory, and from there generation was about one word per second for me. I'm not sure which llava it is specifically, but it's probably a 7b one.

Docs on it are here: https://ollama.com/library/llava

**Mauve 👁💜** @mauve@mastodon.mauve.moe · 2024-02-14T23:06:24Z

Mauve 👁💜 @mauve@mastodon.mauve.moe

@techsinger I will say that out of the box it fully ate my system resources to do so and locked my music player and browser. :P I may experiment with giving it fewer CPU cores so I can have space for processing UI stuff

Feb 14, 2024, 23:06 · · Tusky · · ·

**techsinger** @techsinger@mastodon.social · Feb 14, 2024, 23:21

**techsinger** @techsinger@mastodon.social · Feb 14, 2024, 23:21

Feb 14, 2024, 23:21

techsinger @techsinger@mastodon.social

@mauve thanks for the info. I appreciate knowing how well it's handled.

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Feb 15, 2024, 00:43

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Feb 15, 2024, 00:43

Feb 15, 2024, 00:43

Mauve 👁💜 @mauve@mastodon.mauve.moe

@techsinger TBH I wouldn't be surprised if there was a faster option out there. I found that for text generation it was way slower than LM Studio for example. But that doesn't support multi modal stuff.

Resources

Developers

What is Mastodon?

mastodon.mauve.moe

More…