**Mauve 👁💜** @mauve@mastodon.mauve.moe · 2026-06-22T17:14:50Z

Mauve 👁💜 @mauve@mastodon.mauve.moe

You think we could teach human brains about embedding spaces by turning embedding vectors into audio and giving people tools to navigate the sound and the meanings at the same time?

Jun 22, 2026, 17:14 · · Web · · ·

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 17:23

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 17:23

Jun 22, 2026, 17:23

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve Was this written on a psychedelic or am I lacking some knowledge to understand this?

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 17:25

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 17:25

Jun 22, 2026, 17:25

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi Fully sober! What seems confusing? By embedding vectors/spaces I mean the ones in machine learning contexts. https://en.wikipedia.org/wiki/Embedding_(machine_learning)

E.g. inside an autoencoder.

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 17:33 *

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 17:33 *

Jun 22, 2026, 17:33 *

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve Ahh yeah i was thinking about this as the fourier space representation in quantum physics and really asked myself whats meant by this /why/how/for what

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 17:36

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 17:36

Jun 22, 2026, 17:36

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi Oh! No more like, if you train a model with its own internal representation of concepts, could a human learn a similar structure by listening to the embedding vectors translated to something like intensities of different harmonies. e.g. given a 1d embedding, each float would be the intensity of a specific frequency.

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 17:49

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 17:49

Jun 22, 2026, 17:49

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve What is meant by a similar structure in that case? This is to abstract for me to grasp TBH.

I understand the vector embedngs of an LLM as representation of human written social concepts.
In the end every word written down by a human is already an abstracted representation of the world (some so fare from reality not even worth a single thought) in some way and that gets further abstracted one layer by putting this abstract generalized data into vector embedding.

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 17:51

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 17:51

Jun 22, 2026, 17:51

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi I was imagining that after listening to the audio representations of embedding vectors for a while a person could get a grasp of the "vibe" of some data just from the sound. Then they could get information out of embeddings in a similar way to a machine trying to do the usual cosine distance between two vectors. Like, what if we could navigate our timeline with sounds before we bother converting them to speech or actually reading the text.

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 17:56 *

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 17:56 *

Jun 22, 2026, 17:56 *

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve Ahh interesting, any #neuroscience or #sociology ppl here having an explanation if we could get a "vibe" of a social construct based on abstract audio? I have strong doubts and I still do not really understand how to replace the "vibe" aspect with a more scientific description but there might be something to it.

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 18:04

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 18:04

Jun 22, 2026, 18:04

Mauve 👁💜 @mauve@mastodon.mauve.moe

@fredy_pferdi cc @jonny :P

**jonny (nonvenomous)** @jonny@neuromatch.social · Jun 22, 2026, 18:25

**jonny (nonvenomous)** @jonny@neuromatch.social · Jun 22, 2026, 18:25

Jun 22, 2026, 18:25

jonny (nonvenomous) @jonny@neuromatch.social

@mauve
@fredy_pferdi
Very interesting question. Audio representations are usually useful when the signal contains information encoded as relative positions that can be perceived as structure in time. Encoding embedding activations as intensities of frequencies would be encoding "vibe" of data as timbre, but thats challenging because timbres are the consequence of physical law and are not generic, like we can tell woodwinds from brass but we can't perceive differences in arbitrary spectral structure as well without training. However auditory prostheses that e.g. encode visual information as sound or even as patterned electrical stimulation on the tongue or skin can be learned, and there could be some interesting mapping between the activations and a spectrum that could make their structure perceivable.

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 18:29 *

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 18:29 *

Jun 22, 2026, 18:29 *

Profile13115 @fredy_pferdi@social.linux.pizza

@jonny @mauve One can also be creative with frequencies, amplitude and resonances (witch are also connected to social concepts and feelings (wich are not purely social concepts but also a more biological question)). So there would be something to explore on the experimental artistic front here too.

**jonny (nonvenomous)** @jonny@neuromatch.social · Jun 22, 2026, 18:30

**jonny (nonvenomous)** @jonny@neuromatch.social · Jun 22, 2026, 18:30

Jun 22, 2026, 18:30

jonny (nonvenomous) @jonny@neuromatch.social

@fredy_pferdi @mauve exactly, ya, "try it and see" would be my general attitude

**jonny (nonvenomous)** @jonny@neuromatch.social · Jun 22, 2026, 18:36

**jonny (nonvenomous)** @jonny@neuromatch.social · Jun 22, 2026, 18:36

Jun 22, 2026, 18:36

jonny (nonvenomous) @jonny@neuromatch.social

@fredy_pferdi @mauve e.g. check out https://www.seeingwithsound.com/webvoice/webvoice.htm from @seeingwithsound

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 18:40

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 22, 2026, 18:40

Jun 22, 2026, 18:40

Mauve 👁💜 @mauve@mastodon.mauve.moe

@jonny @fredy_pferdi @seeingwithsound Oh yeah I tried using that recently! It was cool to read some blog posts of fully blind people using it to look around at buildings and descriving what they percieved

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 18:17 *

**Profile13115** @fredy_pferdi@social.linux.pizza · Jun 22, 2026, 18:17 *

Jun 22, 2026, 18:17 *

Profile13115 @fredy_pferdi@social.linux.pizza

@mauve Now it clicked form me! Why i was so confused on this is because of the #quantummind #orchor hypothesis by the #epstin related #stuarthameroff and #rogerpenrose

https://jmail.world/search?q=Stuart+Hameroff
https://jmail.world/search?q=Roger+Penrose

**Ⓥincentpants** @vincentpants@treehouse.systems · Jun 22, 2026, 19:54

**Ⓥincentpants** @vincentpants@treehouse.systems · Jun 22, 2026, 19:54

Jun 22, 2026, 19:54

Ⓥincentpants @vincentpants@treehouse.systems

@mauve you should check out https://allosphere.ucsb.edu/. Did a lot of data vis/aur development for it in my 20s. Was such a great idea!

Resources

Developers

What is Mastodon?

mastodon.mauve.moe

More…