Collection of blogs
I started to review the blogs I have written in past few years. Unfortunately they get lost in all the data available out there. Yet many of them are still very much interesting and have good content. So here is a collection that I picked. Some of them are selected as they are important to me personally and others because purely of the content they have.
English Blogs
A billion rows in Power BI. Testing out how Databricks Photon accelerated cluster performs as a direct query source for Power BI.
Data profiling is one of the core tools for data developers. And it is overlooked. Here is an blog how to do it easily in Databricks using Pandas Profiling.
Databricks Data Lakehouse without virtually any code by utilizing Azure Data Factory and Spark SQL.
Data quality is and will remain an issue. A technical blog about utilizing Great Expextations open source library in data platform context.
A blog about Azure Synapse on-demand SQL feature (when it was just published).
Blogit suomeksi (Blogs in Finnish)
Tiedonlaatu blogi sarjan avaava artikkeli. Mitä on tiedon laatu?
Data governance ja data management. Termit jotka sotkeentuvat helposti toisiinsa. Blogissa puntaroidaan näiden kahden termin yhteyttä.
Purviewin laajentamisen tarpeet tulevat monille esiin. Tapahtumajonoon pohjautuvat massapäivitykset ovat vaihtoehto, kun API rajapinta ei enää riitä.
Blogit DataOpsista. Tärkeitä asioita, joita pitäisi huimioida kun data kehitystä halutaan tehdä oikein.