nVidia is doubling down on AI workloads with the Turing GPU architecture

nVidia is doubling down on AI workloads with the Turing GPU architecture

Yesterday at SIGGRAPH 2018, nVidia announced their next generation GPU architecture called Turing. Here is my preliminary analysis of the architecture.

Besides the usual improvement in graphics performance and capabilities, the thing to marvel at, if you are working with AI, is that they are dedicating a large portion of the die for machine learning workloads with their Tensor cores.

What is amazing is that a generic GPU architecture can almost catch up with the latest in dedicated TPUs or Tensor Processing Units like the Google TPUv2 with 125 TFLOPS vs Google’s 180 TFLOPS at FP16 precision.

Furthermore, they have added an INT4 format for very-low precision inference workloads but with half a Peta OPS of performance. An increasing amount of machine learning models will cope just fine with very-low precision for ‘good enough’ accuracy and will on the other hand gain tremendously in performance.

A top end Turing GPU with 48 GB of VRAM will likely set you back $10K but the equivalent CPU-based horsepower will cost you one or two orders of magnitude more.

It will not take long before the likes of SuperMicro with announce servers with 1-2 CPU sockets and 8 Turing GPUs for unheard of performance for the buck.

Imagining real-time analytics on massive amounts of data or very large-scale deep learning models being processed on a cluster of these compute nodes is mind blowing and ushers in a new dawn for artificial intelligence.

John Fabienke is the founder of arqitekta, an enterprise architecture consulting company specializing in infrastructure strategy and design, big data and AI.


To view or add a comment, sign in

More articles by John Vindahl Fabienke

  • BCM vs APO: The Strategic Showdown Reshaping IT Landscapes

    Discover how Business Capability Management and Application Portfolio Optimization are transforming enterprise…

  • The Enterprise Architecture ”Big Five”

    Building Better Architectures Aligned to Business Needs Many organizations struggle to build IT architectures that are…

  • Mastering the Art of API Design

    As the world becomes more connected, the importance of well-designed and well-documented APIs cannot be overstated…

    1 Comment
  • Rust: The Language That is Taking Over the World of Systems Programming

    Rust is a modern systems programming language that has rapidly gained popularity among developers in recent years. It…

  • Who needs 192 cores and 64 TB RAM?!

    IBM recently announced their Power System E980, which is crowning achievement of the new generation of servers based on…

    1 Comment
  • Intel's new AI features

    At its Data-Centric Innovation Summit in Santa Clara, Intel laid out the roadmap for the Cooper Lake and Ice Lake…

  • Coming out from the skunkworks…

    Last year I announced that I had quit my otherwise extensive career with CSC (now DXC) to pursuit a new idea and that…

    2 Comments
  • A New Chapter...

    After more than two decades in various roles with CSC I have chosen to move on. Looking back I have only gratitude to a…

    21 Comments

Explore content categories