Data science and deployment
Credits - Scott Adams (Dilbert.com)

Data science and deployment

Back in the old days when I was a Mainframe programmer, when I used to code, I always knew how the end user would see the product. Over the years, programmers have mostly known and understood how their effort would translate into a product and how that product would be in the hands of the end user.

Enter #DataScience, the current crop of data scientists, talk mostly of jupyter-notebooks, python in an IDE and create models, which start on a laptop and stay there. These models generate results, which are then packed off to the requesting clients/customers. How long will this continue?

Can we call data scientists as programmers? After all, all that the customer needs is a working product. In addition, the definition of a product in the software industry has for long remained a working piece of software which provides results to aid the business and make it flourish. Hence, I do think that data scientists are programmers in a different dimension. In addition, that dimension is quickly maturing, with its own set of processes starting to set in. Data gathering must follow a strict process (a.k.a GDPR and regulations alike), Models that are created must not have an inherent bias, which they could have picked up from the underlying data (a loan default prediction program mostly indicating a default for a particular precinct), and results must be reproducible in the hands of the customer.

Finally, the critical component of making that software reach the user is the deployment. How long can data scientists remain aloof of the deployment conditions that they must adhere. We must start to emphasize and inculcate these in the recruitment processes in every data driven decision organization. Granted that AWS and Azure takes the headache out of creating services for your models and then you can embed those services in products, but we as programmers must surely know more in depth about that and care to uncover that last piece of layering that converts our models into a usable product.

The time to think of data science as a standalone cog is over. It must integrate with the processes that comes together with a software services product driven mentality.

#MLdeployment #Datascience #Datascienceproducts

Dilbert ultimate source of inspiration Suzie ? - even after all these years 😀 How you been doing man ?

Like
Reply

To view or add a comment, sign in

More articles by Sujith S.

  • Inspiration

    Many a times, we hear stories of people around us who cross insurmountable odds to finally realize their passion and do…

    6 Comments
  • Deployment using light weight git server

    In the world of deployment of platforms, this one is a bit different In this post, I will be talking about my…

  • Non-deterministic programming

    Coding for machine learning. How does it differ from the code that I have always done, COBOL, Java or setting up rules…

    2 Comments

Others also viewed

Explore content categories