The Magic of Azure Functions

Michael Graber

Published Feb 25, 2019

One of the more interesting aspects of using different components of Azure for building out solutions is that there are often multiple ways of solving the same problem. Take for example Event Hubs vs Azure Functions vs Event Grid. All 3 tools can solve the same problem. All 3 can take inputs of data, process the data according to a set of rules, and then spit out a result. However, the more I analyze the results of each one of these situations, the more I am beginning to believe Azure Functions represent a truly revolutionary way of solving these kinds of problems.

What makes Azure Functions (AF's) so great? First is the fact that AF's are (theoretically) infinitely scalable when called asynchronously. A process can spin up as many AF instances as desired, and once completed, the AF's return their data, and shut down. You are simply charged for the number of calls you make, and the amount of memory you consume in each call. The second reason AF's are so great is because they are...well...cheap! The cost of processing is like nothing I have ever seen. Hundreds of thousands of records can be processed, and the cost that results is fractions of a penny. In fact, the cost of a database feeding AF's often far exceeds the cost of AF's themselves.

So why are AF's so inexpensive? My theory is that Microsoft charges less for AF's due to the fact that AF's consume compute that is either old hardware that can no longer be used (after all Microsoft uses pre-packaged containers of compute and storage that ages, and needs to be refreshed), or "opportunistic" compute on top of infrastructure that has excess capacity that isn't easily monetized. Think of it as "scrap compute". Whatever the reason, AF's represent incredibly inexpensive compute that developers can leverage to build extremely inexpensive computational systems. As long as you can live with the time needed to spin up the function asynchronously (sometimes a second or more), live within the memory constraints of a single function call (currently 1.5GB), and deal with variable response times (eg noisy neighbor issues), you can be rewarded with the lowest cost server compute infrastructure available on the market today.

Even better, Azure Functions 2.0 now support .NET core. This means you now have the ability to use your language of choice. Since .NET core supports Python 3.6, this raises an interesting question. Is it cheaper to run data science calculations in Azure Functions, or on more traditional hadoop'esque Big Data infrastructures like HD Insight or Azure Data Bricks/Spark clusters? My next project will be to benchmark these options, but if the pattern holds with other benchmarks I have done, my guess is Azure Functions will win the day on price/performance. And this is in spite of the fact that HD Insight/Data Bricks likely runs on GPU optimized servers. No doubt each server is much faster at processing data, but there is simply no substitute for distributing your job across numerous generic ultra-low cost servers.

It will be an interesting bake-off!

To view or add a comment, sign in

The Magic of Azure Functions

Michael Graber

More articles by Michael Graber

Others also viewed

Demystifying MCP Servers: Bridging AI with the Real World

Plug, Play, Perform: Setting Up an MCP Server That Works

Building a LangChain-Powered FastAPI Service to Optimize LLM Interactions

I gave an AI agent root access to my infrastructure. Here's what 2 months of building together actually looks like.

The Complete System Design Tech Stack: A Developer's Guide to Building Scalable Applications

Apache Spark 101: Shuffle Join Vs. Broadcast Joins

The Google File System: What Every Distributed Systems Engineer Should Actually Understand About It

🚀 Build and Use Custom MCP Server with Databricks Apps

Integrating Azure AI Agent Service with MCP: Expanding Agent Capabilities Through Discovery

Exciting AWS re:Invent 2017 news for data computing professionals

Explore content categories

More articles by Michael Graber

Document Databases vs Relational

Computational Loads - Serverless Architecture

7 reasons why Electric Cars are in our future

5 traits of successful people

Database Hypercluster Virtual Private Cloud Architecture

Future of the auto industry

Business case Enterprise IT infrastructure migration to public cloud

Enterprise database migration to the public cloud