[P] Visualizing LM’s Architecture and data flow with Q subspace projection

[P] Visualizing LM's Architecture and data flow with Q subspace projection

Hey guys, I did something hella entertaining. With some black magic and vodoo I was able to extract pretty cool images that are like an MRI from the model. I’m not stating anything, I have some hypothesis about it… It is mostly because it is just so pretty and mind bogging.

I stumbled up a way to visualize LM’s structure of structure structures in a 3D volume.

Here is the Gist Link with a speed run of the idea.

Some images:

y3i12/Prisma (my research model)

Qwen/Qwen3.5-0.8B

HuggingFaceTB/SmolLM-360M

RWKV/rwkv-4-430m-pile

state-spaces/mamba-370m-hf

At the present moment I’m looking for a place where I can upload the interactive HTML. If you know of something, let me know that I’ll link them. It is very much a lot mesmerizing to keep looking at them at different angles.

The mediator surface that comes out of this is also pretty interesting:

https://preview.redd.it/zbbvba1m9mqg1.png?width=749&format=png&auto=webp&s=48f2a44273bdba30176b89d8057c0e9880cb9401

I wonder if this one of many possible interpretations of “loss landscape”.

submitted by /u/y3i12
[link] [comments]

Liked Liked