Model Selection for Optimization Under Uncertainty: Cutting Through the Noise with Representative Models

Mehrdad G.

Published Feb 14, 2017

The landscape of statistics has been rapidly shifting. Gone are the days when "inference" was the main player. Now, the emphasis has turned towards learning and forecasting. If the goal leans more towards forecasting or decision-making optimization, diving deep into inference might not always be essential.

Here's an analogy: Imagine you're listening to a symphony with hundreds of instruments. But what if you could identify just a handful of those instruments that define the entire melody, allowing you to enjoy the essence of the music without the full orchestra?

Enter the realms of modern statistical learning:

Supervised Learning: Think of it as a tutor guiding a student. Here, the aim is to establish a relationship between given inputs and their outputs and then predict outputs for new, unseen inputs.
Unsupervised Learning: This is more like self-study. Without specific outputs, the objective is to unearth patterns and relationships among inputs. Popular techniques? PCA (principal component analysis) and clustering.

Recommended by LinkedIn

The Subtle Differences among Data Science, Machine…

Pranav Gupta 💡 7 years ago

MACHINE LEARNING

Tharani T 2 years ago

Understanding the Foundations of Machine Learning:…

Nandini Verma 2 years ago

In the realm of operations research and optimization under uncertainty, decisions heavily depend on the simulation of numerous computational models. With computational resources often at a premium, a pressing question arises: Can we pinpoint a handful of models that can aptly represent the vast multitude?

This was the challenge I addressed during my research at Stanford. I crafted a methodology that leans on unsupervised statistical learning to cherry-pick those "representative models" essential for decision-making under uncertain scenarios. The magic lies in clustering computational models, segmenting them, and then choosing an ambassador from each segment. To make it work, 80% of the effort focuses on choosing the right features that represent each model aptly for clustering-based selection.

But how do we discern the right features? For this, a novel statistical method was introduced to compare and evaluate various "representative subsets". The outcome? A roadmap to identify pivotal features for model clustering and selection.

For the deep-divers, the intricacies of this methodology and its application, especially in subsurface flow processes, can be found in the published paper in Computers & Geosciences.

Model Selection for Optimization Under Uncertainty: Cutting Through the Noise with Representative Models

Mehrdad G.

Recommended by LinkedIn

More articles by Mehrdad G.

Others also viewed

Machine Learning, Picture Perfect and Reverse Engineering

Machine Learning Algorithms - Part 1- Introduction

The Perceptron algorithm and the need for optimization.

How automisation, machine learning and artificial intelligence could save the oil industry from itself

MACHINE LEARNING

A Beginner's Guide to Supervised Learning Algorithms in Machine Learning

Understanding Decision Trees: Learning Through Structured Thinking

Machine Learning in Simple Words

Deciphering Machine Learning

Explore content categories

Recommended by LinkedIn

More articles by Mehrdad G.

Closed-Loop Field Development Optimization: Reduce Risk when Drilling New Wells

Others also viewed

Machine Learning, Picture Perfect and Reverse Engineering

Machine Learning Algorithms - Part 1- Introduction

The Perceptron algorithm and the need for optimization.

How automisation, machine learning and artificial intelligence could save the oil industry from itself

MACHINE LEARNING

A Beginner's Guide to Supervised Learning Algorithms in Machine Learning

Understanding Decision Trees: Learning Through Structured Thinking

Machine Learning in Simple Words

Deciphering Machine Learning

Explore content categories