Data Mining Your Body
Getty/Christopher Furlong

Data Mining Your Body

Machine learning and data science will do more to improve healthcare than all the biological sciences combined.

This blog is about how we're going to gather that health data, mine it and improve the lives of billions.

This blog is also a search for the best data scientists and programmers in the world, who want to join Human Longevity Inc. and work on the most epic challenge – extending the healthy human lifespan. (See bottom of blog for details).

[ Click to Tweet about this (you can edit before sending): http://ctt.ec/KWYB9 ]

What's the Big Idea?

Your genome consists of approximately 3.2 billion base pairs (your DNA) that literally code for "you."

Your genes code for what diseases you might get, whether you are good at math or music, how good your memory is, what you look like, what you sound like, how you feel, how long you'll likely live, and more.

This means that if we can decipher this genomic "code," we can predict your biological future and proactively work to anticipate and improve your health.

It's a data problem¬ – and if you are a data scientist or machine-learning expert, it is the most challenging, interesting and important problem you could ever try to tackle.

In the simplest terms, sequencing a genome means turning DNA into a series of four letters that looks like this:

ACAAGATGCCATTGTCCCCCGGCCTCCTGCTG

Each person's genome produces a text file that is about 300 gigabytes.

When we compare your sequenced genome with millions of other people's genomes AND other health data sets (see below), we can use machine learning and data mining techniques to correlate certain traits (eye color, what your face looks like) or diseases (Alzheimer's, Huntington's) to factors in the data and begin to develop diagnostics/therapies around them.

It's a Translation Problem, Like Google Translate

HLI is creating an "integrated health record" for everyone entering its database. The data sets created will include the following:

  • Genomic: The 3.2 billion nucleotides from your mother, and the 3.2 billion nucleotides from your father.
  • Microbiome: The genome of the 100 trillion + microorganisms living in our bodies. There are 10 times as many microbial cells than human cells, and their effects on our bodies are enormous and massively understudied.
  • Imaging/MRI: High resolution detailed imagery of our brain, organs and body.
  • Metabolome: The 2,300 small molecule chemicals in your bloodstream.
  • Physiological Health Data: All of the data we can collect on ourselves. Our vital signs, blood glucose levels, micro RNA’s in the bloodstream, heart rate, VO2…

Translating between all of this data and your health outcome is, metaphorically, similar to how Google Translate works.

Google Translate (GT) uses a process called statistical machine translation, which means that GT generates translations based on patterns found in large amounts of written text.

Rather than attempt to teach the computer every rule of every language, this approach lets the computer discover the rules for themselves based on statistically significant patterns in the data.

Once it finds these patterns (patterns that are unlikely to occur by chance), it can use this "model" to translate similar text in the future.

With millions and millions of documents/websites/publications online that were already translated, and a crowd of 500 million users to correct and "teach" the algorithm, GT can quickly and accurately translate between 90 different languages.

Our challenge now is applying similar techniques to all of this genomic and integrated health records… and we found the perfect person to lead this effort: Franz Och – the man responsible for building Google Translate.

Meet Franz Och, HLI's Chief Data Scientist

Franz is a renowned expert in machine learning and machine translation.

He spent 10 years at Google as a Distinguished Research Scientist and the Chief Architect of Google Translate, literally building the system from the ground up.

Now, Franz is Human Longevity Inc.'s Chief Data Scientist, responsible for developing new computational methods to translate between all of the human biological information.

… and he's building one of the most impressive teams I've seen.

When you ask Franz why he's so excited about HLI, his answer is twofold: the mission and the challenge

Franz explains, "The big thing is the mission -- the ability to affect humanity in a positive way. If you are a data scientist, why focus on making a better messaging app or better Internet advertising, when you could be advancing the understanding of disease to make sick people better and of aging to make people live longer, healthier lives?"

As far as the challenge, he goes on: "The big mission is to learn how to interpret the human genome -- to be able to predict anything that can be predicted from the source code that runs us."

HLI is Looking for World-class Talent

HLI is looking for the following:

  1. Very strong software engineers who can build reliable, high-quality, high-performance software
  2. Machine learning experts
  3. Statistics experts

Many people with these areas of expertise often don’t know about biology and health, and that is okay. We need the outside perspectives to help us tackle this problem in new ways.

At the same time, we are also looking for people with the relevant biology knowledge (genomics, immunology) who are excited to approach their domain in a new way as a massive data and machine-learning problem.

If you think you have what it takes, I encourage you to check out the open positions here and apply.

The Machine Learning Team is based out of Mountain View, CA; the genomics team is based in La Jolla.

Here is the link again: http://www.humanlongevity.com/careers/open-positions/

If you know of any one who you believe fits the description above, please share this blog with them and help us move humanity forward.

Join Me

This is also the sort of conversation we discuss at my 250-person executive mastermind group called Abundance 360. The program is highly selective and has ~97% of the spots filled. You can apply here. BTW, my Abundance 360 members are the first individuals to benefit from HLI, and are early signed up to have their genomes sequenced and their health data uploaded.

Share this email with your friends, especially if they are interested in any of the areas outlined above.

[ Click to Tweet about this (you can edit before sending): http://ctt.ec/KWYB9 ]

P.S. Every weekend I send out a "Tech Blog" like this one. If you want to sign up, go to PeterDiamandis.com and sign up for this and my Abundance blogs.

P.P.S. Please forward this to your best clients, colleagues and friends — especially those who could use some encouragement as they pursue big, bold dreams.

don't have to see any job posting to be excited about working there!

Like
Reply

The human body is the ultimate big data system. I seldom encounter a big data problem which does not have a human body or brain function application. Our progress in these fields will have wide range applications but unfortunately very few specialists have a broad enough education to enable such cross fertilization. We need fewer statisticians and mathematicians in Wall Street and more in applied sciences starting with medical sciences.

Like
Reply

Machine learning is just math tool how to manage data it cant be efficient in discovery of factors effecting longevity. For example many publications and research indicate that lower GGT level in blood relates to longevity and reduces risk of all diseases. Clinical Data directly relate to health improvement outcome.and no need in machine learning as it isonly mat tool like statistics etc. http://www.healtheiron.com/ggt-science-library

Like
Reply

To view or add a comment, sign in

More articles by Peter H. Diamandis

  • AMECA'S HUMANLIKE EXPRESSIONS ARE REDEFINING ROBOT-HUMAN INTERACTION

    Ameca is one of the world’s most advanced humanoid, and human-like, robots. Why? It’s not only Ameca’s incredibly…

    6 Comments
  • MIT’S FAVORITE HUMANOID ROBOT

    The total number of amazing humanoid robots under development is breathtaking. What was once was a wasteland of…

    3 Comments
  • FIGURE VS. TESLA: WHO LEADS THE HUMANOID ROBOT REVOLUTION?

    The race to build general-purpose humanoid robots is heating up. In my last blog, I discussed the latest (rapid)…

    15 Comments
  • TESLA’S PLANS TO DEMOCRATIZE HUMANOID ROBOTS

    Elon is a major science fiction geek, naming products like Grok, Plaid, Falcon, and Starship, so it’s no surprise he…

    19 Comments
  • HUMANOID ROBOTS ARE HERE: SOON MILLIONS, THEN BILLIONS OF THEM

    We’ve been talking about robots for decades, so why are we seeing an explosion now? Originating from the Czech term for…

    17 Comments
  • THE EMERGENCE OF HUMANOID ROBOTS (PART 1)

    What’s your favorite humanoid robot? Commander Data, C-3PO, Optimus Prime? Regardless, science fiction is now becoming…

    15 Comments
  • THE FUTURE OF LEARNING: PERSONALIZED AI TUTORS FOR EVERY STUDENT

    AI-powered education is something we owe the next generation. Alongside healthcare, I can’t think of another field more…

    56 Comments
  • HOW TO INVEST IN AI

    AI is about to become the biggest investment theme in the world. So how do you get started investing? In 2023, more…

    21 Comments
  • WILL AI REPLACE ALL CODERS?

    Will AI take over all coding? During last year’s Abundance Summit, Emad Mostaque, CEO of Stability AI, made the…

    31 Comments
  • YOUR AI WINGMAN: REVOLUTIONIZING YOUR LIFE BY 2028

    How deeply entwined will your personal AI (copilot) become in every aspect of your personal and business life? Probably…

    14 Comments

Others also viewed

Explore content categories