Skip to main content

On the use and abuse of Hairballs. What might graph-theory mean for cognitive neuroscience?

Olaf Sporns
Discovering the Human Connectome
Having just returned from the OHBM (Organization for Human Brain Mapping) it's not hard to be phenomenally ecstatic about the state of research and raring to apply new techniques and theories to my data.  

There is much to be hopeful for; I encountered many brilliant people who are excited about their research and how neuroscience is helping to unlock the secrets of the mind.


That said, there did seem to be a great deal of research that was following the coat-tails of last year's big ideas.  This year the "big thing" was graph theory - the branch of mathematics and computer science that was previously applied to other fields such as social network visualization and biology  by Olaf Sporns and applied to brain networks.  

Graph theory works on the notion that you can reduce a matrix (normally a correlation matrix) to a set of nodes (which represent specific brain regions) and edges (which represent either physical or functional connections between the nodes).  Once this is done, you can then analyze the resulting graph to get metrics about each of the nodes in order to discern something about the relative significance of a location and perhaps get an idea of how much information is flowing through it relative to other nodes.  

People tend to get output that resembles this...


From Insights into multimodal imaging classification of ADHD by  Colby et al., 2012
These images are pretty cool, but also a bit difficult to parse.  Less flattering descriptors are "hairballs", "fuzzballs" etc.  The clarity of the image depends almost entirely on the researcher picking a threshold that reduces the graph to a restricted set of nodes and edges that do not overwhelm the viewer.  Needless to say, this can be a bit of an arbitrary exercise - what threshold is a "good" threshold - 0.2, 0.3?  Your guess is as good as mine.  Probably a cynic would argue that it's the threshold that makes the picture look good.  

It's easy to see how the new fascination with graph theory is a natural expression of brain connectivity - which has been the emerging topic of interest for the past few years.  For a while now, researchers have noted that the brain is generally not modular.  That is to say, when you stick someone in a scanner and ask them to perform a demanding cognitive task, their entire brain is active to a certain extent.  One can then talk about functional connectivity - which areas are connected to a critical region across time while we ask you to perform a task.   The areas that are functionally connected are said to be "networks", and as it turns out, these networks can be visualized with graphs.  

So, how do we visualize networks without graph theory?  Generally, people would display their images on high resolution t-1 images (see below)


A now famous image of the default mode by Michael D. Fox  and Michael Greicius 
The advantage of the above mode of display is that the activity is situated in anatomy - one can look at the representation of the brain and be spatially oriented, it also meshes with the previous generation of research which would display discrete activations in a similar fashion.  

By contrast, the graphs displayed below show a representation of the brain's networks across the decades in spring-loaded graphs that are unconstrained by anatomy.  It is here that I think that graphs are truly useful - the ability to describe network dynamics and change is one of their peculiar strengths.  They become even more useful if one allows the edges to encode information such as strength of connection by either colour or line breadth   Nodes can similarly be coded by colour; a group of nodes that are highly interconnected with each other but not much else could be thought of as a "module" and uniformly coloured.  Similarly, nodes that connect to many other regions and thus are critical conveyors of information might be coded by size - larger nodes are both visually and figuratively more important.  


Fair, Damien A. et al., (2009). "Functional Brain Networks Develop from a 'Local to Distributed' Organization"\
Despite the obvious usefulness of graph-theory for comparing network dynamics, many people shy away from viewing graphs which are completely unconstrained in two-dimensions, which is why some researchers have made an effort to place nodes in approximate anatomical space.


Martijn P. van den Heuvel and Olaf Sporns Rich-Club Organization of the Human Connectome
The above representation not only respects anatomy, but also goes beyond traditional functional connectivity displays by allowing the viewer to assess relative strength and importance of areas to the emergent networks.  

"But hang on a minute", you might be thinking, "I can get all this information from a correlation matrix - no need for the pretty pictures" - and so far, you would be right...


Tal et al., Caffeine-induced global reductions in resting-state BOLD connectivity reflect widespread decreases in MEG connectivity

The above graph gives you all the information you might need to know about functional connectivity - each column and row is a node, the colour of a cell represents the strength of any two connections with each other.  So far, we've duplicated the information contained in the graph, which is hardly surprising given that graphs take matrices (including correlation matrices as input).  

But do we gain anything beyond ease of readability and interpretation by using graphs?  What can they tell us that correlation matrices cannot?  

The answer of course is yes of course they do.  Graphs can provide us with a series of metrics that tell us more about the flow of information than a correlation matrix can on its own.  

We can calculate the degree of a node - which is to say how many connections it has.  We can also look at the distance between two nodes, and calculate the shortest path or the most efficient path between them - which is to say, we can look at the probabilistic flow of information.  Closely related is the concept of betweeness which indicates how often a node acts as a bridge along a path between other nodes etc...

There are other metrics, and this is where I feel that the literature is still catching up.  We have a marvelous new tool available to us; it makes visualizing the networks of the brain feel almost intuitive.  But the new language of graph theory (at least to many neuroscientists) is something we need to figure out.  How important is betweeness-centrality?  Should we report both that and relative degree?  How do we interpret distance?  How do any of these metrics map onto what we know about cognition?  

Graph theory is a wonderful advance for the field of behavioural neuroscience, but as with many new advances, I get the feeling that many are seduced by the pretty pictures without spending the requisite time understanding the metrics behind them, and without the metrics, graph-theory offers little beyond displaying a more aesthetic version of a correlation matrix. 

*Update*
Since writing this post I've come across this paper by Fornito et al., on Schizophrenia and connectonomics (released in 2012).  From a cursory overview, the paper looks like it starts to address not only the process of making graph theory transparent to the reader, but also makes an earnest attempt to incorporate graph metrics to help classify normal vs. schizotypal brains.  I'll be reviewing this in the near future so stay tuned...

Comments

  1. You might take a look at BioFabric (www.BioFabric.org) as another way of visualizing networks that avoids the hairball by representing nodes as lines instead of as points. Quick demo: http://www.biofabric.org/gallery/pages/SuperQuickBioFabric.html

    ReplyDelete
  2. Very interesting Bill - I like that this method cuts down on visual clutter and certainly looks visually appealing - thanks for sharing this. Another attempt at this was created a while back by Martin Krzywinski who has advocated for using "hive-plots" http://www.hiveplot.net/ - though you've probably seen this already.

    A preliminary concern I have with BioFabric is whether or not the strength of connections can be represented in a similar way to typical weighted graphs. For example, in the van den Heuvel & Sporns figure I inserted above, nodes vary in size to give graphical weight to network metrics (in this case it represents the degree or number of connections of the node). That said, looking at the figures on your site again, I guess these sorts of metrics are tied to *position* with nodes with greater degree being located in the top-left quadrant, and less important nodes being located in the bottom-right.

    This method would appear to be excellent for visualizing smaller networks (or zooming in on section of larger networks), but does still seem a bit unwieldy at the large scale (but then what method isn't) - I had difficulty intuitively grasping at a glance how all the nodes related to one another, but the method excels when focusing at the node level.

    Perhaps some combination of visualizations would be most useful? Classic graphs/correlation matrices for larger-scale networks to visualize the overall pattern, and BioFabric in-depth exploration of details which can be obscured at the former level of remove? Just a thought - I'd be interested in your opinion.

    Thanks again for sharing!

    ReplyDelete
  3. John, I'm glad you found it interesting; thanks! I'm sorry I haven't replied earlier, but I've been traveling.

    It's still an open question for me as to the best way to present metrics such as edge weights or node properties, though one thing that comes for free (when using the shadow links feature) is node degree, since the length of the "node zones" along the diagonal are directly proportional to degree, and the typically prominent edge wedge of high-degree nodes makes them stand out naturally. For other metrics, decorating the node zone labels in some fashion might work well. As for edge weights, I have an example up in the BioFabric gallery (http://www.biofabric.org/gallery/index.html#Minerals) which suggests that displaying edge weights via thickness should be workable. In that example, I represent the magnitude of the relationship by showing multiple edges, where the edge count is proportional to the base 2 log of the edge weight. Extending the software to create wide links should not be too difficult to achieve.

    It's also true that I like the approach that I use in my other project, BioTapestry (www.BioTapestry.org), which uses a model hierarchy to allow the user to create a set of submodels that can be used to break out and highlight important features and metrics of the network within the context of the whole network. That feature will be an important future enhancement of BioFabric.

    But your mentioning of position as a possible solution is also spot-on. Placing important nodes at the top of the network, or in some other meaningful order, can convey lots of useful information. And I'm convinced that the complete freedom in placing links allows the user to provide meaningful link groupings is a feature that will be an essential tool for gaining insights into a network. As the World Bank network I discussed on my blog (e.g. http://biofabric.blogspot.com/2013/06/this-only-is-wedge-craft-i-have-used.html) demonstrates, being able to craft meaningful "edge wedge" shapes, which arises from the ability to precisely specify node and edge orders, also seems to prove very useful.

    I agree, to some extent, with your observation that BioFabric can be unwieldy at the global scale, and completely concur that "what method isn't". After all, if you are looking at 100,000 edges (to pick a small size), you have to make some compromises. BioFabric addresses the problem by producing basically a linear, sequential presentation of very large networks, and people have been organizing information in that fashion for a long time: encyclopedias, long novels, musical scores, timelines, etc. So there are metaphors that could perhaps be applied to navigating networks, such as e.g. chapters. About the only metaphor I can think of for navigating a hairball is a machete.

    Finally, it's interesting that you find that BioFabric works best with smaller networks, since I created it as a way to be able to show useful network structure at the global scale for the large, intractable hairball networks. And when a large network with many nodes is very sparse, adjacency matrices don't necessarily do well either. But I do think that being able to move between the different representations would be valuable. Between the three representations, I see BioFabric as the midpoint between the traditional pointNode-link diagram and adjacency matrices.

    Thanks for your comments!

    Bill

    ReplyDelete

Post a Comment

Popular posts from this blog

Video games allow for multitasking? A review of "Improving multi-tasking ability through action videogames"

W ould you believe that playing video games could make you more likely to succeed as a pilot? There's been quite a bit of colloquial evidence on multitasking and task-switching floating around the internet recently, thanks in no small part to work by Ophir et al., (2009) who showed that people who are habitual media-multitaskers (i.e. those annoying people who text while watching a movie or even worse while driving) tend to perform worse on a wide variety of cognitive measures of attention.  They concluded that people who are chronic media multitaskers have a "leaky filter", and as such cannot block out irrelevant (and distracting) information.  This agrees nicely other theories of attention, and particularly inhibitory theory (Hasher & Zacks, Michael C. Anderson, Gazzaley). One reason that multitasking has come under such scrutiny is that it is now encouraged and even expected in many professions and by many employers.  Even in situations where multitasking...

Dopamine helps the brain change gears: A review of "Dopamine-supports coupling of attention-related networks."

Before we begin, some background information... It seems that networks and the idea of connectivity is on everybody's minds these days (pardon the pun).  It's a fairly safe bet to say that nearly everyone affiliated with neuroscience has more than a passing familiarity with the default mode and dorsal attention networks (i.e. see Raichle et al., 2001).  These networks are anticorrelated which is to say that when one is being more heavily accessed, the activity in the other is suppressed.  Broadly, the default mode is active at rest, during mind wandering and when thinking about the autobiographical past - it has been described as being related to internally directed attention.  The dorsal attention or task positive network on the other hand is engaged when attention needs to be directed outwards at a cognitively demanding task or situation.  From Dang et al., 2012 What about the frontoparietal cognitive control network?  This network was discove...