it is so extremely weird to me that so many systems that are explicitly graphs (semweb/linked data, graphql, etc.) don't do the obvious, convenient, and well defined thing of making the interface a freaking *graph*

@jonny this might be a surprise given my graph evangelism but I actually don't like graphs as an interface. In principle they could be useful but in practice I usually find they just become a mess for anything even mildly complicated. There are some tricks if you have extra structure (like bundled connections), but I've never seen anything really convincing.

@neuralreckoning thinking about that over here: social.coop/@jonny/11043226541

Anything can be hyper complex! but we develop tools to deal with that complexity, show it, hide it, etc. that I also have not seen done convincingly with graphs. To be clear i'm not saying 'make everything the typical force-directed layout of balls and sticks' without much order, but making tools to manipulate and view graphs in lots of different ways - as a tree! comparing two subgraphs! etc.

@jonny yes absolutely! It's on my to do list (in the section of things I'll probably never get around to) to play around with better ways of interacting with graphs.

@neuralreckoning i have the blessing and the curse of having set myself up to need to do this pretty imminently lmao

@neuralreckoning tangent/not-tangent, i have felt the same way about our conceptual vocabulary for graphlike things in neuroscience, and specifically for dynamical phenomena over graphs (which is what the brain does) in the same way that we have conceptual vocabulary for much simpler phenomena like 'inhibition' or 'neuromodulation' and so on. Like there *isn't* a good way to think and talk about the configuration of activity through time, but that's all we have to talk about.

@jonny agreed. Dynamics plus graphs is just incredibly mind bending. No idea how to even get started on that, but agree it could be important.

@jonny it's actually a live question for me. We're trying to think about the dynamics of specialisation, namely how modules could specialise on different things at different times. Basically impossible to visualise in even quite simple cases.

@neuralreckoning this is one of the cases where the extremely fucked state of our communication mediums poses a real and tangible limitation to our ability to understand things. Because we have to reduce everything down to a static two-dimensional representation to fit in PDF, we can't do things like "look there is no 2d representation of this, you have to interact with it and explore it and see it change over time." TO SAY NOTHING of how we d=2 with t-SNE and PCA regardless of dimensionality

@jonny yes absolutely. I am 100% convinced it needs to be a dynamic representation not a static 2d thing. Have tried a few things but nothing convincing yet.

@neuralreckoning @jonny Are your adventures documented anywhere? This has been on my mind (ha ha) a loy lately too. Mostly thinking 3d spatial and using the third dimenaion for time with layers interlinked in stacks. Then moving it around to see the connections better with your hands

@mauve @neuralreckoning @jonny While not directly (I think) what you're speaking of, the topic seems to overlap with a persistent problem I have yet to find a solution for:

How can I digitize, classify, tag/organize 100+ years' worth of family #Geneology - correspondences/images/media in such a way that it can be traversed and explored. The dataset is SO rich and encompasses several lifetimes.

I have yet to find a model that can capture, let alone allow complex exploration and discovery.

A *general* solution for my particular problem could be applicable to many other long-lived and intertwining conversations. Any ideas?

@david @neuralreckoning @jonny Hmm, I was thinking about this over the weekend. What sort of stuff do you wish to explore through it?

IMO shoving everything into postgres with the proper relational links between data and exploring it via queries might get you pretty far. I've been afflicted with programmer-brain so I'm not sure if it's a reasonable suggestion.

@mauve @neuralreckoning @jonny

Hey, I appreciate the response. I've been struggling with a "proper model" for many years, and it is stopping me from beginning digitization.

The goal would be to answer questions such as "what was person X like", "what were the milestones of importance to them", "visualize 150 years of semi-overlapping parallel lifelines". "map the main places Z spent their life"

In the Chat*** world, you could imagine eventually being able to just ask questions of the corpus, and this will probably end up there, the way things are moving. A future-proofed model could anticipate an "auto-summary/classification" step.

Hand classifying dimensions/tagging feels impractical ("this letter's topics include X, Y, Z, and mentions A, B, C" - yikes!)

More dimensions will be discovered during the digitization process (oh! here's a book that X published, shoot, now I have to adjust my schema - again).

As I say, I have other projects like this - based in the physical world, rich in shared connections that are only discovered post-ingestion via traversing the data. If not captured, then will never be captured.

------

This is wordy, but there's a lot of meat here to get to a useful endpoint: help make sense of a big pile of related material. The richness suggests an immersive UX.
Follow

@david @neuralreckoning @jonny I see. 🤔 It sounds like rather than exploring and visualizing data the biggest problem is how to ingest the data in the first place. Sadly that's way harder (especially if your data isn't already structured) and outside of my personal realm of expertise. Might be worth it to look at some of the SemanticWeb stuff the Chat*** plugin ecosystem has though.

· · Web · 1 · 0 · 0

@mauve @david @neuralreckoning
oh boy i have thoughts, lemme reply from my neuromatchstodon account so i can have more than 500 characters @jonny@neuromatch.social

@mauve @david @neuralreckoning

Genealogy is a graph with people as nodes and relationships as edges! This is actually one thing that semantic web/linked data technologies are already really good at, so for example see the wikidata project on Genealogy.

Genealogy actually is the most common thing in their list of examples of visualizing wikidata.

So for example if you go here entitree.com/en/family_tree/Q4 and keep clicking the "down" arrows, you can trace Confucius' family tree to the present day.

You can see how this works at the wikidata page - he has a child property, in addition to many others.

Accommodating growing/changing/incomplete/overlapping schemas is exactly what #SemanticWeb and #LinkedData technologies are good at - I explore that and the complex politics that surround these technologies, particularly corporate #KnowledgeGraphs in this piece: jon-e.net/surveillance-graphs/
specifically here and here though as I said it's one of the basic topics of the whole piece. That naturally contrasts with traditional relational databases, which, as u all are saying, can be brittle and difficult to refactor.

I also discuss the complex and fraught relationship between knowledge graphs and chat interfaces in that piece: jon-e.net/surveillance-graphs/
and spoiler alert automatically populated graphs are, uh, pretty bad, but KGs + #LLMs is the direction that the information conglomerates are headed in a much deeper and terrifying way than I have seen in the general discourse (that's why i wrote the piece in the first place i guess).

Interface design has been one of the major problems in the history of semweb/LD (in my opinion, next to the social systems and politics, it is the largest problem). I explore that history and the constraints a bit in a prior piece, here, and talk a bit about some potential roads forward here.

Ingestion is also a huge problem! and that is partially because most of the tooling has been designed for massive information conglomerates to guzzle down the semi-structured information on the web rather than for individual people to make sense of information they care about. So eg. existing tools like Neo4j or other graph databases are pretty dang tricky to get set up, but imo i'm pretty sure the target data format you're looking for is a graph of triples, and once that's in a serializable format like turtle or n-triples or whatever I'm pretty confident you won't "regret it" or that the effort will have been wasted. You might have to shop around a bit for the appropriate visualization/storage/query tech stack, but that's the neighborhood to look in. One example is wbstack which is intended to make hosting your own instance of wikidata (wikibase) more straightforward if you don't want to shove all your data in public onto wikidata.

Marking this public so i can search for it later because these questions come up frequently :)

#SurveillanceGraphs

@mauve @david @neuralreckoning
I went to find the link to the post where i was like 'how come we don't represent things that are graphs as graphs' but i guess that is the top of this very thread lmao

@jonny @mauve @neuralreckoning I will explore your links and really appreciate the input.

I just want to clarify that this is not *just* a genealogy problem for me. I'm dealing with artifacts, and those artifacts have verifiable provenance bits that need to be tacked onto each ingested artifact. The artifacts just happen to be genealogy related.

Think: a museum collection. Object that exist, were created, were collected, traded - have history. Now think of their movement into the digital world where there can be collaborative discussion about aspects of that object, translations, citations.

"Show me everything known of an object" "let me explore all those attributes as they relate to the museum's collections"

That sort of thing.

Thanks again. I'll dig into your research.

@david @mauve @neuralreckoning
absolutely :) browse around some wikidata pages and you'll also see heterogeneous data and provenance - that's sort of the name of the game with linked data. Wikidata is specifically very good with translation and source references. They are also (somewhat implicitly) good with tracing the evolution through time (same "view history" page as is present on wikipedia/mediawiki)

wikipedia/wikidata have a complicated relationship with surfacing the kind of dialogical process you're describing, even though they are of course built on it. Sorry for bombing you with more links but if it's useful i have written about that here: jon-e.net/infrastructure/#the-

but i couldn't agree more how important it is to represent as a first-class part of the data. You're also right on that modeling documents is definitely less well-defined than genealogical relationships (eg. saying above how tedious it would be to manually annotate all the topics in some writing, totally right), and wikidata doesn't necessarily provide the perfect set of tools for doing that, but yes I plan to work on that problem after I get a draft of a p2p backend for sharing and communicating and negotiating over these kinds of packetized-graphlike information blobs <3
#SemanticWeb #LinkedData #WikiData

@david @mauve @neuralreckoning
put another way, semweb/LD approach the problem very generally - rather than "a database for genealogy," thinking about data structures that allow arbitrarily relating information to other information. Genealogy just happens to be a very well-defined graph.

There are a lot of substrains of thought on what that should look like (should it be a perfectly verifiable, logically complete schema or something more freewheeling? it is more important to make global or local sense? and so on) but yes I think you are in the right place in exploring linked data and semweb tooling, even though as said above it is the interfaces for dealing with this kind of data that have historically suffered, with wikidata being I think the flagship 'public mass infrastructure' tooling at the moment

@jonny @mauve @neuralreckoning Your https://jon-e.net/surveillance-graphs/ article is just staggering in the density of information presented and the horrors it surfaces. Still reading..

@david
@mauve @neuralreckoning
thank you, im honored you're taking the time with it. feel free to share any thoughts as you read, make annotations on the page, etc. (or don't, also fine.) it's something better made sense of together I think because of yes the horrors and also yes the information density❤️

@david
@mauve @neuralreckoning
@joelchan86 add another mark on the counter until we have to do something about these #DiscourseGraphs together

Sign in to participate in the conversation
Mauvestodon

Escape ship from centralized social media run by Mauve.