Data Modeling and its relevance in the Cloud Era

Since 2005, I’ve immersed myself in the dynamic world of data and its modeling. It’s been fascinating to witness how data modeling has evolved over the years, navigating through its challenges and successes. Throughout this journey, advocating for its significance has been a constant endeavour, emphasizing its crucial role not just as a practice, but as a cornerstone for ensuring accurate and actionable data.

“Data models are techniques for representing information, sufficiently structured and simplistic to fit well into computer technology.” This definition by William Kent from 1978 still resonates today, capturing the essence of data modeling as a powerful communication and reflective tool for understanding organizational processes and information exchange.

However, the rise of new technologies in recent years has led some to question the relevance of data modeling. There’s a misconception that storing data indiscriminately is sufficient, relying on future machine learning algorithms to make sense of it. Yet, this overlooks the foundational principles of effective software development, often resulting in failed projects. The importance of “garbage in, garbage out” becomes evident in the era of cloud computing and AI, highlighting the risks of accumulating unnecessary data.

Consider the costs associated with maintaining terabytes of irrelevant information. Beyond tangible expenses, there are opportunity costs, especially when data lakes turn into data swamps. It’s crucial to move away from the mentality of storing everything and instead focus on purposeful data storage. Even with a schema-on-read approach, it’s essential to consider transforming data into refined zones through modeling, enabling efficient analytics and decision-making.

Despite technological advancements, including the exciting world of opportunities that imply the Cloud, the essence of data modeling remains unchanged. It continues to play a vital role in converting data into actionable insights, fostering communication, and promoting reflection within organizations. Kent’s definition still holds true, reflecting the aspirations of organizations aiming for robust data governance.

Data modeling, like other pillars of IT and software development, is here to stay. Its enduring nature is rooted in human cognition; all it requires is paper, pencil, and collective expertise to depict reality accurately. As we look ahead, data modeling will continue to shape the landscape of software development for years to come.

geohernandez

Recent Posts

Getting Started with Snowflake’s Snowpipe for Data Ingestion on Azure

In this post, I'll explain how to take full advantage of Snowflake's Snowpipe for reliable…

2 weeks ago

Integrating Azure Blob Storage with Snowflake: A Guide to Storage Integration and Staging Data

This article offers a comprehensive, step-by-step guide for integrating Azure Blob Storage with a Snowflake…

2 weeks ago

Getting Started with SnowSQL: Connecting to Your Snowflake Account

In this quick guide, we’ll walk through the essential steps to connect to Snowflake using…

3 months ago

A new step in my career as a Senior Data Architect

I am thrilled to share that I have embarked on a new professional journey as…

7 months ago

Formatting our Postgres scripts with pgformatter in DBeaver

Are you a PostgreSQL enthusiast using DBeaver on a Windows Platform? If you find yourself…

1 year ago

Looking back to Kimball’s approach to Data Warehousing

Over time, it's fascinating to witness how certain concepts, approaches, or visions age. Time, in…

1 year ago