The origins, purpose, and practice of data observability

Data Observability (DO) is an emerging category that proposes to help organizations identify, resolve, and prevent data quality issues by continuously monitoring the state of their data over time. This talk is a deep dive into DO, starting from its origins (why it matters), defining the scope and components of DO (what it is), and finally closing with actionable advice for putting observability into practice (how to do it).

We’ll rigorously define data observability to understand why it is different from software observability and existing data quality monitoring. We will derive the four pillars of DO (metrics, metadata, lineage, and logs) then describe how these pillars can be tied to common use cases encountered by teams using popular data architectures, especially on cloud data stacks.

Finally, we’ll close with pointers for how to put observability into practice, drawing from our experience helping teams across sizes, from fast-growing startups to large enterprises, successfully implement DO. Successfully implementing observability throughout an organization involves not only using the right technology, whether that be a commercial solution, an in-house initiative, or an open source project, but implementing the correct processes with the right people responsible for specific jobs. Talk participants can expect to leave with new concepts to understand how DO can help their organizations and ideas for how to implement DO.

Kevin-Hu-Metaplane.jpg

Kevin Hu

Co-founder

metaplane-logo-transparent.svg

What is the cost to attend and watch the virtual sessions?

Data Team Summit is always free and open for all to attend.

What is Data Teams Summit?

Data Team Summit is the official DataOps peer-to-peer community.

It's a time each year for everyone, from DataOps, CloudOps, AIOps, MLOps, to other technology professionals, to gather virtually to share the latest trends and best practices for running, managing, and monitoring data pipelines and data-intensive analytics workloads.

Sessions include talks by DataOps professionals at leading organizations, detailing how they’re establishing data predictability, increasing reliability, and reducing costs.

New to DataOps?

DataOps is a holistic approach to the creation, deployment, monitoring, management, and optimization of data-driven applications. It describes the culture and rules of engagement that allow data teams to deliver and maintain high-quality, on-time data products, often powered by AI and machine learning, in an agile and cost-effective way.

DataOps defines how data teams work and also affects data consumers and those whose work causes new data to be created and used within the organization. Their work enables the entire organization to access data efficiently for data-driven decision-making and for the creation and delivery of data-driven applications.

Organizations with well-developed DataOps strategies, governance, and processes can expedite the delivery of data-driven workflows and results faster and better than others.

Who comes to Data Teams Summit?

DataOps professionals and experts including data administrators, data architects, data engineers, data analysts, AI/ML professionals, and data technology leadership.

Join us for sessions on:

  • Data teams & best practices
  • Data pipelines & applications
  • DataOps observability
  • Data quality & data governance
  • Operations observability
  • MLOps
  • Data modernization & architecture
  • Biz/FinOps observability

Want to submit a session for Data Teams Summit 2023?

Send us a note to astronaut@solutionmonday.com or submit a talk proposal here: datateamssummit.com/cfp

Sign up below for a free ticket to Data Teams Summit on January 25th, 2023!

Interested in speaking at Data Teams Summit or participating as a community sponsor?

Please contact astronaut@solutionmonday.com.