Graphs are to Multi-Cellular Organisms as Databases are to Bacteria

Helena Deus, PhD

Published Dec 2, 2022

I’ve been reading The Song of the Cell and, like all other books by Siddhartha Mukherjee, it is wonderful!

At the same time, I’ve been thinking about the slides that I’m supposed to prepare on how to build a Biomedical knowledge graph (kgraph). As I carried these two ideas in my head, one clicked with the other and got me to wonder: “Are graphs like multi-cellular organisms that support and collaborate with each other to create a larger self?”.

In other words, even thought each kgraph is an individual unit—the kgraph at Uniprot or the kgraph at pathway commons (BioPAX)—which is only consistent and sustainable when it’s complete and passes the quality and completeness checks from its designers, it has some inter-dependencies with other kgraphs (e.g CheBI, Ensembl, HGNC). Without them, it cannot thrive. Similarly, other graphs have inter-dependencies with them.

The medium that these graphs co-inhabit, or the “inter-cellular matrix,” is the Web. As long as it’s possible to do a secure transaction of data over the web, these graphs can exchange “data” via the SPARQL SERVICE keyword (ie. federated querying) .

Recommended by LinkedIn

Filament count in activated sludge

Ahmad Omar 10 years ago

How To Calculate Microbial Diversity? Easy Step by…

Naeem Mahmood Ashraf (PhD) 7 months ago

The Discovery Series: Powered by myBaits®

Daicel Arbor Biosciences 2 months ago

No alt text provided for this image — Thank you Uniprot for always working when I need you to!!

Note: Yes, I know—those among you who have tried SPARQL federation only to notice in frustration that the remote endpoint is down and your query failed due to time-out—I hear you. But hey, it’s just like in real life! If you were a cell and needed some nutrients from a neighboring cell in order to survive, you would be dependent on that cells’ willingness to help you and share nutrients with you. And she’d only do that if sharing nutrients were mutually beneficial, such as it is in a multi-cellular organism.

A relational database, on the other hand, is like a bacteria. Self-sufficient, ruthless (because she has to), pursues her own nutrients from her surroundings, does not depend on anyone else and once it’s done it leaves her waste chemical byproducts behind.

Like a bacteria, a relational database is a self-contained, self-sufficient unit: we engineers build them for a purpose, they are meant to perform some very important tasks for a set of business users, and that purpose is what gives them energy (i.e funding) to survive (i.e. hire other engineers to maintain it and grow it) and thrive (i.e. create backup copies and evolve into something new and more adapted to future needs).

All of this to say: To decide whether to collect our data in a relational database or a knowledge graph, we can ask ourselves “Is what I am trying to accomplish like a small individual and self-supporting bacteria or is it a larger multi-cellular organism where the individual cells have inter-dependencies such that they can grow into a much larger, must more robust and future proof organism?" If the answer is "the latter" then a knowledge graph is the way to go.

Just a passing thought I had. Would love to hear what you think about this idea!

Milind Pol 3y

What if we compare knowledge graphs with noSQL databases. May be a nested json which could be nested under other json. FHIR is in a way designed with such ideology. Yes, On the other side, you may look at it as protein molecules which are used by cells to collaborate...

Christopher Larkin 3y

interesting Lena. I like the growing and building inter relationships part of the analogy. Hope you are well.

1 Reaction

Gina Donato 3y

Interesting analogy, Helena Deus, that I will use (with attribution, of course) to explain the difference to colleagues!! Really great.

1 Reaction

Miguel C. 3y

Interesting analogy. A key component of building a multicellular system is cooperation (positive and negative). So the question is how to quantify if two graphs interact positively or negatively? In evolutionary terms, when two cells have co-dependencies they optimize individual function and support each other to survive, possibly giving rise to multicellularity (network). But cheaters will always occurs and get a "free-ride". If cheaters grow too much in a population, the whole network collapses.

Graphs are to Multi-Cellular Organisms as Databases are to Bacteria

Helena Deus, PhD

Recommended by LinkedIn

More articles by Helena Deus, PhD

Others also viewed

Can we predict what gene grives organisms will do in nature? Here's how models help

GTDB: Bridging the Gap Between Genomics and Taxonomy for Microbes 🧬

Primitive multicellular clusters used metabolically driven flows for exponential growth

Synergy and biosynthesis: History and potential applications of Lichens

BioCommons quarterly digest

🌱 Genetic Variation in Plants: Mutations, Polymorphisms, and Diversity 🌱

The Role of HapMap Files in GWAS: Unlocking Genetic Potential for Crop Improvement

Biotechnology #18 Project to Sequence the Genomes of Every Complex Species

AI for Root Phenotyping: Distinguishing Axial and Lateral Roots

CLASSIFICATION OF BACTERIA

Explore content categories

Recommended by LinkedIn

More articles by Helena Deus, PhD

The Computer Is Trying to Help You. Are You Listening?

Time & Autonomous Agents

Tests & Metrics - how good is my chatbot at evaluating drug discovery hypothesis?

Decision Modeling in Drug Discovery

Neurosymbolic AI: Knowledge Graphs for systems with bounded rationality

The "That's Funny..." Moment: How Accidental Connections Lead to New Worlds (and What AI Can Learn)

Toward Ontologies of Biomedical Research (Part 2)

Momentum for Foundational Language Models in Pharma

Linking health care and life sciences at the Knowledge Graph Conference

Toward Ontologies of Biomedical Research (Part 1)

Others also viewed

Can we predict what gene grives organisms will do in nature? Here's how models help

GTDB: Bridging the Gap Between Genomics and Taxonomy for Microbes 🧬

Primitive multicellular clusters used metabolically driven flows for exponential growth

Synergy and biosynthesis: History and potential applications of Lichens

BioCommons quarterly digest

🌱 Genetic Variation in Plants: Mutations, Polymorphisms, and Diversity 🌱

The Role of HapMap Files in GWAS: Unlocking Genetic Potential for Crop Improvement

Biotechnology #18 Project to Sequence the Genomes of Every Complex Species

AI for Root Phenotyping: Distinguishing Axial and Lateral Roots

CLASSIFICATION OF BACTERIA

Explore content categories