@fredy_pferdi Oh that's great to know TY. I'll look into it. Is this going to use Vulkan for the GPU acceleration? I wasn't sure what my options would be since Ollama seems to only support Cuda and Metal

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 18:49

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 18:49

Mar 21, 2024, 18:49

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve It also supports AMD ROCm the equivalent to cuda

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 18:06

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 18:06

Mar 21, 2024, 18:06

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi Interesting I may be able to get it running without a container too. https://github.com/rocm-arch/rocm-arch

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 18:58

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 18:58

Mar 21, 2024, 18:58

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve Personally recommend strongly to not install the ROCm drivers on your device but using them in a container instead, they are not that stabile on those chips and it can lead to your device crashing. Also officially only like an LTS Ubunut and Cent OS and a couple of GPU's are supported.

Container on the other hand is one command (use the amd rcom version further info here:
https://hub.docker.com/r/ollama/ollama
https://ollama.com/blog/amd-preview
There is no substantial performance lose using a container

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:27

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:27

Mar 21, 2024, 20:27

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi Sweet just followed this guide to install it in my ubuntu distrobox container and it's working great :o

https://www.reddit.com/r/steamdeck_linux/comments/102hzav/guide_how_to_install_rocm_for_gpu_julia/

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:29

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:29

Mar 21, 2024, 20:29

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi Spoke too soon, ollama dies when I try to load the model. Will need to mess with it another day :) TY again for the tip.

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:33

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:33

Mar 21, 2024, 20:33

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve Distrobox is just an interface interface for Podman i think just running the already made images or building them yourself is way easier then to recreate the install manually with Distrobox.

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:36

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:36

Mar 21, 2024, 20:36

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve
first allow podman to use GPU
`sudo setsebool -P container_use_devices=true`

and then just run this to start the container
`podman create --name=ollama --security-opt seccomp=unconfined --device /dev/dri --device /dev/kfd -e HSA_OVERRIDE_GFX_VERSION=10.3.0 -e HCC_AMDGPU_TARGETS=gfx1035 -e OLLAMA_DEBUG=1 -v .ollama:/root/.ollama:U,rw -p 11434:11434 -i --tty --restart unless-stopped docker.io/ollama/ollama:rocm`

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:40

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:40

Mar 21, 2024, 20:40

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve with the Ryzen 7600u you may have to use -e HSA_OVERRIDE_GFX_VERSION=11.0.0 instead of -e HSA_OVERRIDE_GFX_VERSION=10.3.0 and make sure that you allocated enough RAM to the graphics card in the BIOS of your #GPD WIn 4 (in the advanced option)

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:53

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:53

Mar 21, 2024, 20:53

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi Cool TY, I found the HSA_OVERRIDE online and it ended up working great in my ubuntu container. 😁 Wish I had this for last night's demo! Also I don't have nearly enough RAM on this thing with 16 GB. TT_TT

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:53 *

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:53 *

Mar 21, 2024, 20:53 *

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve for testing you could allocate 8gb that should be enough to run a small model while still using the os.

Screenshot of the option to allocate more vram:
https://www.reddit.com/r/gpdwin/comments/yfivv5/anyone_know_what_do_these_options_mean_in_gpd_win/

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:55

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:55

Mar 21, 2024, 20:55

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi Yeah the issue is my Matrix client ends up eating way too much RAM and then I start eating swap. Might also have a memory leak somewhere in my OS wasting RAM after going out of sleep mode

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:56

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:56

Mar 21, 2024, 20:56

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve Yeah there are suspend issues with those GPD devices.

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:59

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:59

Mar 21, 2024, 20:59

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi Alas! It's still worth it for me to not have to use Windows or a regular laptop :P

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 16:11

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 16:11

Mar 21, 2024, 16:11

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve Is this a #GPD Win 4?

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 17:43

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 17:43

Mar 21, 2024, 17:43

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi Yeah exactly! I'm running #ChimeraOS on it in desktop mode. Lately been thinking of just installing Manjaro on it instead since the steam bits are a bit janky for me.

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 18:52

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 18:52

Mar 21, 2024, 18:52

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve I'm using the GPD Win Max 6800u with 32GB RAM 16GB of it allocated to VRAM and im running 13b models with reasonable speeds and 7b quit quickly. Recommend #Fedora and #Ollama in as #Podman container

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:46 *

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:46 *

Mar 21, 2024, 20:46 *

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve how do you do the speech to text ?

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:55

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Mar 21, 2024, 20:55

Mar 21, 2024, 20:55

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi It hooks into your entire OS. I use https://github.com/ideasman42/nerd-dictation with a custom script to make it easier to codehttps://github.com/RangerMauve/mauve-dictation

Since my OS is a steam OS derivative I needed to be fancy and install it in userspace: https://github.com/atcq/steam-dictation

Then I have a global shortcut to toggle it on/off

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:56

**Profile13115** @fredy_pferdi@social.linux.pizza · Mar 21, 2024, 20:56

Mar 21, 2024, 20:56

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve uiii that sounds interesting using immutable distro to and was not able to get it to work.

**Salvatore Zappalà** @salvozappa@fosstodon.org · Mar 22, 2024, 14:43

**Salvatore Zappalà** @salvozappa@fosstodon.org · Mar 22, 2024, 14:43

Mar 22, 2024, 14:43

Salvatore Zappalà @salvozappa@fosstodon.org

@mauve Well presented and thanks for sharing. I was looking exactly for something like this.

Resources

Developers

What is Mastodon?

mastodon.mauve.moe

More…