Tackling data quality problems requires more than a series of tactical, one off improvement projects. By their nature, many data quality problems extend across and often beyond an organization. Addressing these issues requires a holistic architectural approach combining people, process and technology. Join Donna Burbank and Nigel Turner as they provide practical ways to control data quality issues in your organization.
Tackling Data Quality problems requires more than a series of tactical, one-off improvement projects. By their nature, many Data Quality problems extend across and often beyond an organization. Addressing these issues requires a holistic architectural approach combining people, process, and technology. Join Nigel Turner and Donna Burbank as they provide practical ways to control Data Quality issues in your organization.
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
Enterprise Architecture (EA) provides a visual blueprint of the organization, and shows key interrelationships between data, process, applications, and more. By abstracting these assets in a graphical view, it’s possible to see key interrelationships, particularly as they relate to data and its business impact across the organization. Join us for a discussion on how Data Architecture is a key component of an overall Enterprise Architecture for enhanced business value and success.
In this lecture we discuss data quality and data quality in Linked Data. This 50 minute lecture was given to masters student at Trinity College Dublin (Ireland), and had the following contents:
1) Defining Quality
2) Defining Data Quality - What, Why, Costs
3) Identifying problems early - using a simple semantic publishing process as an example
4) Assessing Linked (big) Data quality
5) Quality of LOD cloud datasets
References can be found at the end of the slides
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 (CC-BY-SA-40) International License.
This document discusses data governance and data architecture. It introduces data governance as the processes for managing data, including deciding data rights, making data decisions, and implementing those decisions. It describes how data architecture relates to data governance by providing patterns and structures for governing data. The document presents some common data architecture patterns, including a publish/subscribe pattern where a publisher pushes data to a hub and subscribers pull data from the hub. It also discusses how data architecture can support data governance goals through approaches like a subject area data model.
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
Organizations with governed metadata made available through their data catalog can answer questions their people have about the organization’s data. These organizations get more value from their data, protect their data better, gain improved ROI from data-centric projects and programs, and have more confidence in their most strategic data.
Join Bob Seiner for this lively webinar where he will talk about the value of a data catalog and how to build the use of the catalog into your stewards’ daily routines. Bob will share how the tool must be positioned for success and viewed as a must-have resource that is a steppingstone and catalyst to governed data across the organization.
To take a “ready, aim, fire” tactic to implement Data Governance, many organizations assess themselves against industry best practices. The process is not difficult or time-consuming and can directly assure that your activities target your specific needs. Best practices are always a strong place to start.
Join Bob Seiner for this popular RWDG topic, where he will provide the information you need to set your program in the best possible direction. Bob will walk you through the steps of conducting an assessment and share with you a set of typical results from taking this action. You may be surprised at how easy it is to organize the assessment and may hear results that stimulate the actions that you need to take.
In this webinar, Bob will share:
- The value of performing a Data Governance best practice assessment
- A practical list of industry Data Governance best practices
- Criteria to determine if a practice is best practice
- Steps to follow to complete an assessment
- Typical recommendations and actions that result from an assessment
Improving Data Literacy Around Data ArchitectureDATAVERSITY
Data Literacy is an increasing concern, as organizations look to become more data-driven. As the rise of the citizen data scientist and self-service data analytics becomes increasingly common, the need for business users to understand core Data Management fundamentals is more important than ever. At the same time, technical roles need a strong foundation in Data Architecture principles and best practices. Join this webinar to understand the key components of Data Literacy, and practical ways to implement a Data Literacy program in your organization.
Data Governance and Metadata ManagementDATAVERSITY
Metadata is a tool that improves data understanding, builds end-user confidence, and improves the return on investment in every asset associated with becoming a data-centric organization. Metadata’s use has expanded beyond “data about data” to cover every phase of data analytics, protection, and quality improvement. Data Governance and metadata are connected at the hip in every way possible. As the song goes, “You can’t have one without the other.”
In this RWDG webinar, Bob Seiner will provide a way to renew your energy by focusing on the valuable asset that can make or break your Data Governance program’s success. The truth is metadata is already inherent in your data environment, and it can be leveraged by making it available to all levels of the organization. At issue is finding the most appropriate ways to leverage and share metadata to improve data value and protection.
Throughout this webinar, Bob will share information about:
- Delivering an improved definition of metadata
- Communicating the relationship between successful governance and metadata
- Getting your business community to embrace the need for metadata
- Determining the metadata that will provide the most bang for your bucks
- The importance of Metadata Management to becoming data-centric
Tackling Data Quality problems requires more than a series of tactical, one-off improvement projects. By their nature, many Data Quality problems extend across and often beyond an organization. Addressing these issues requires a holistic architectural approach combining people, process, and technology. Join Nigel Turner and Donna Burbank as they provide practical ways to control Data Quality issues in your organization.
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
Enterprise Architecture (EA) provides a visual blueprint of the organization, and shows key interrelationships between data, process, applications, and more. By abstracting these assets in a graphical view, it’s possible to see key interrelationships, particularly as they relate to data and its business impact across the organization. Join us for a discussion on how Data Architecture is a key component of an overall Enterprise Architecture for enhanced business value and success.
In this lecture we discuss data quality and data quality in Linked Data. This 50 minute lecture was given to masters student at Trinity College Dublin (Ireland), and had the following contents:
1) Defining Quality
2) Defining Data Quality - What, Why, Costs
3) Identifying problems early - using a simple semantic publishing process as an example
4) Assessing Linked (big) Data quality
5) Quality of LOD cloud datasets
References can be found at the end of the slides
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 (CC-BY-SA-40) International License.
This document discusses data governance and data architecture. It introduces data governance as the processes for managing data, including deciding data rights, making data decisions, and implementing those decisions. It describes how data architecture relates to data governance by providing patterns and structures for governing data. The document presents some common data architecture patterns, including a publish/subscribe pattern where a publisher pushes data to a hub and subscribers pull data from the hub. It also discusses how data architecture can support data governance goals through approaches like a subject area data model.
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
Organizations with governed metadata made available through their data catalog can answer questions their people have about the organization’s data. These organizations get more value from their data, protect their data better, gain improved ROI from data-centric projects and programs, and have more confidence in their most strategic data.
Join Bob Seiner for this lively webinar where he will talk about the value of a data catalog and how to build the use of the catalog into your stewards’ daily routines. Bob will share how the tool must be positioned for success and viewed as a must-have resource that is a steppingstone and catalyst to governed data across the organization.
To take a “ready, aim, fire” tactic to implement Data Governance, many organizations assess themselves against industry best practices. The process is not difficult or time-consuming and can directly assure that your activities target your specific needs. Best practices are always a strong place to start.
Join Bob Seiner for this popular RWDG topic, where he will provide the information you need to set your program in the best possible direction. Bob will walk you through the steps of conducting an assessment and share with you a set of typical results from taking this action. You may be surprised at how easy it is to organize the assessment and may hear results that stimulate the actions that you need to take.
In this webinar, Bob will share:
- The value of performing a Data Governance best practice assessment
- A practical list of industry Data Governance best practices
- Criteria to determine if a practice is best practice
- Steps to follow to complete an assessment
- Typical recommendations and actions that result from an assessment
Improving Data Literacy Around Data ArchitectureDATAVERSITY
Data Literacy is an increasing concern, as organizations look to become more data-driven. As the rise of the citizen data scientist and self-service data analytics becomes increasingly common, the need for business users to understand core Data Management fundamentals is more important than ever. At the same time, technical roles need a strong foundation in Data Architecture principles and best practices. Join this webinar to understand the key components of Data Literacy, and practical ways to implement a Data Literacy program in your organization.
Data Governance and Metadata ManagementDATAVERSITY
Metadata is a tool that improves data understanding, builds end-user confidence, and improves the return on investment in every asset associated with becoming a data-centric organization. Metadata’s use has expanded beyond “data about data” to cover every phase of data analytics, protection, and quality improvement. Data Governance and metadata are connected at the hip in every way possible. As the song goes, “You can’t have one without the other.”
In this RWDG webinar, Bob Seiner will provide a way to renew your energy by focusing on the valuable asset that can make or break your Data Governance program’s success. The truth is metadata is already inherent in your data environment, and it can be leveraged by making it available to all levels of the organization. At issue is finding the most appropriate ways to leverage and share metadata to improve data value and protection.
Throughout this webinar, Bob will share information about:
- Delivering an improved definition of metadata
- Communicating the relationship between successful governance and metadata
- Getting your business community to embrace the need for metadata
- Determining the metadata that will provide the most bang for your bucks
- The importance of Metadata Management to becoming data-centric
Data-Ed Slides: Best Practices in Data Stewardship (Technical)DATAVERSITY
In order to find value in your organization's data assets, heroic data stewards are tasked with saving the day- every single day! These heroes adhere to a data governance framework and work to ensure that data is: captured right the first time, validated through automated means, and integrated into business processes. Whether its data profiling or in depth root cause analysis, data stewards can be counted on to ensure the organization's mission critical data is reliable. In this webinar we will approach this framework, and punctuate important facets of a data steward’s role.
Learning Objectives:
- Understand the business need for a data governance framework
- Learn why embedded data quality principles are an important part of system/process design
- Identify opportunities to help drive your organization to a data driven culture
This document discusses the importance of data quality and data governance. It states that poor data quality can lead to wrong decisions, bad reputation, and wasted money. It then provides examples of different dimensions of data quality like accuracy, completeness, currency, and uniqueness. It also discusses methods and tools for ensuring data quality, such as validation, data merging, and minimizing human errors. Finally, it defines data governance as a set of policies and standards to maintain data quality and provides examples of data governance team missions and a sample data quality scorecard.
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...DATAVERSITY
A solid data architecture is critical to the success of any data initiative. But what is meant by “data architecture”? Throughout the industry, there are many different “flavors” of data architecture, each with its own unique value and use cases for describing key aspects of the data landscape. Join this webinar to demystify the various architecture styles and understand how they can add value to your organization.
Activate Data Governance Using the Data CatalogDATAVERSITY
This document discusses activating data governance using a data catalog. It compares active vs passive data governance, with active embedding governance into people's work through a catalog. The catalog plays a key role by allowing stewards to document definition, production, and usage of data in a centralized place. For governance to be effective, metadata from various sources must be consolidated and maintained in the catalog.
DAS Slides: Data Quality Best PracticesDATAVERSITY
Tackling Data Quality problems requires more than a series of tactical, one-off improvement projects. By their nature, many Data Quality problems extend across and often beyond an organization. Addressing these issues requires a holistic architectural approach combining people, process, and technology. Join Nigel Turner and Donna Burbank as they provide practical ways to control Data Quality issues in your organization.
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceDATAVERSITY
Data catalogs, business glossaries, and data dictionaries house metadata that is important to your organization’s governance of data. People in your organization need to be engaged in leveraging the tools, understanding the data that is available, who is responsible for the data, and knowing how to get their hands on the data to perform their job function. The metadata will not govern itself.
Join Bob Seiner for the webinar where he will discuss how glossaries, dictionaries, and catalogs can result in effective Data Governance. People must have confidence in the metadata associated with the data that you need them to trust. Therefore, the metadata in your data catalog, business glossary, and data dictionary must result in governed data. Learn how glossaries, dictionaries, and catalogs can result in Data Governance in this webinar.
Bob will discuss the following subjects in this webinar:
- Successful Data Governance relies on value from very important tools
- What it means to govern your data catalog, business glossary, and data dictionary
- Why governing the metadata in these tools is important
- The roles necessary to govern these tools
- Governance expected from metadata in catalogs, glossaries, and dictionaries
This introduction to data governance presentation covers the inter-related DM foundational disciplines (Data Integration / DWH, Business Intelligence and Data Governance). Some of the pitfalls and success factors for data governance.
• IM Foundational Disciplines
• Cross-functional Workflow Exchange
• Key Objectives of the Data Governance Framework
• Components of a Data Governance Framework
• Key Roles in Data Governance
• Data Governance Committee (DGC)
• 4 Data Governance Policy Areas
• 3 Challenges to Implementing Data Governance
• Data Governance Success Factors
This presentation reports on data governance best practices. Based on a definition of fundamental terms and the business rationale for data governance, a set of case studies from leading companies is presented. The content of this presentation is a result of the Competence Center Corporate Data Quality (CC CDQ) at the University of St. Gallen, Switzerland.
Data Architecture Strategies: Data Architecture for Digital TransformationDATAVERSITY
MDM, data quality, data architecture, and more. At the same time, combining these foundational data management approaches with other innovative techniques can help drive organizational change as well as technological transformation. This webinar will provide practical steps for creating a data foundation for effective digital transformation.
Gartner: Master Data Management FunctionalityGartner
MDM solutions require tightly integrated capabilities including data modeling, integration, synchronization, propagation, flexible architecture, granular and packaged services, performance, availability, analysis, information quality management, and security. These capabilities allow organizations to extend data models, integrate and synchronize data in real-time and batch processes across systems, measure ROI and data quality, and securely manage the MDM solution.
How to Build & Sustain a Data Governance Operating Model DATUM LLC
Learn how to execute a data governance strategy through creation of a successful business case and operating model.
Originally presented to an audience of 400+ at the Master Data Management & Data Governance Summit.
Visit www.datumstrategy.com for more!
1. It is important to define data quality metrics that are purpose-fit and meaningful to customers. Dashboards should focus more on driving outcomes than just design.
2. Commonly used data quality dimensions include completeness, conformity, consistency, duplication, integrity, and accuracy. Specific metrics are then defined within each dimension tied to business objectives and rules.
3. Targets and trends provide valuable insights, with traffic light targets highlighting priority areas in red and trends showing progress over time.
Data Modeling, Data Governance, & Data QualityDATAVERSITY
Data Governance is often referred to as the people, processes, and policies around data and information, and these aspects are critical to the success of any data governance implementation. But just as critical is the technical infrastructure that supports the diverse data environments that run the business. Data models can be the critical link between business definitions and rules and the technical data systems that support them. Without the valuable metadata these models provide, data governance often lacks the “teeth” to be applied in operational and reporting systems.
Join Donna Burbank and her guest, Nigel Turner, as they discuss how data models & metadata-driven data governance can be applied in your organization in order to achieve improved data quality.
DAS Slides: Data Governance - Combining Data Management with Organizational ...DATAVERSITY
Data Governance is both a technical and an organizational discipline, and getting Data Governance right requires a combination of Data Management fundamentals aligned with organizational change and stakeholder buy-in. Join Nigel Turner and Donna Burbank as they provide an architecture-based approach to aligning business motivation, organizational change, Metadata Management, Data Architecture and more in a concrete, practical way to achieve success in your organization.
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task. The opportunity in getting it right can be significant, however, as data drives many of the key initiatives in today’s marketplace: digital transformation, marketing, customer centricity, and more. This webinar will help de-mystify Data Strategy and Data Architecture and will provide concrete, practical ways to get started.
• History of Data Management
• Business Drivers for implementation of data governance • Building Data Strategy & Governance Framework
• Data Management Maturity Models
• Data Quality Management
• Metadata and Governance
• Metadata Management
• Data Governance Stakeholder Communication Strategy
The first step towards understanding what data assets mean for your organization is understanding what those assets mean for each other. Metadata—literally, data about data—is one of many data management disciplines inherent in good systems development, and is perhaps the most mislabeled and misunderstood out of the lot. Understanding metadata and its associated technologies as more than just straightforward technological tools can provide powerful insight, the efficiency of organizational practices, and can also enable you to combine more sophisticated data management techniques in support of larger and more complex business initiatives.
In this webinar, we will:
Illustrate how to leverage metadata in support of your business strategy
Discuss foundational metadata concepts based on the DAMA Guide to Data Management Book of Knowledge (DAMA DMBOK)
Enumerate guiding principles for and lessons previously learned from metadata and its practical uses
Master Data Management – Aligning Data, Process, and GovernanceDATAVERSITY
Master Data Management (MDM) provides organizations with an accurate and comprehensive view of their business-critical data such as customers, products, vendors, and more. While mastering these key data areas can be a complex task, the value of doing so can be tremendous – from real-time operational integration to data warehousing and analytic reporting. This webinar will provide practical strategies for gaining value from your MDM initiative, while at the same time assuring a solid architectural and governance foundation that will ensure long-term, enterprise-wide success.
The Business Value of Metadata for Data GovernanceRoland Bullivant
In today’s digital economy, data drives the core processes that deliver profitability and growth - from marketing, to finance, to sales, supply chain, and more. It is also likely that for many large organizations much of their key data is retained in application packages from SAP, Oracle, Microsoft, Salesforce and others. In order to ensure that their foundational data infrastructure runs smoothly, most organizations have adopted a data governance initiative. These typically focus on the people and processes around managing data and information. Without an actionable link to the physical systems that run key business processes, however, governance programs can often lack the ‘teeth’ to effectively implement business change.
Metadata management is a process that can link business processes and drivers with the technical applications that support them. This makes data governance actionable and relevant in today’s fast-paced and results-driven business environment. One of the challenges facing data governance teams however, is the variety in format, accessibility and complexity of metadata across the organization’s systems.
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
Enterprise Architecture (EA) provides a visual blueprint of the organization, and shows key interrelationships between data, process, applications, and more. By abstracting these assets in a graphical view, it’s possible to see key interrelationships, particularly as they relate to data and its business impact across the organization. Join us for a discussion on how data architecture is a key component of an overall enterprise architecture for enhanced business value and success.
Data-Ed Slides: Best Practices in Data Stewardship (Technical)DATAVERSITY
In order to find value in your organization's data assets, heroic data stewards are tasked with saving the day- every single day! These heroes adhere to a data governance framework and work to ensure that data is: captured right the first time, validated through automated means, and integrated into business processes. Whether its data profiling or in depth root cause analysis, data stewards can be counted on to ensure the organization's mission critical data is reliable. In this webinar we will approach this framework, and punctuate important facets of a data steward’s role.
Learning Objectives:
- Understand the business need for a data governance framework
- Learn why embedded data quality principles are an important part of system/process design
- Identify opportunities to help drive your organization to a data driven culture
This document discusses the importance of data quality and data governance. It states that poor data quality can lead to wrong decisions, bad reputation, and wasted money. It then provides examples of different dimensions of data quality like accuracy, completeness, currency, and uniqueness. It also discusses methods and tools for ensuring data quality, such as validation, data merging, and minimizing human errors. Finally, it defines data governance as a set of policies and standards to maintain data quality and provides examples of data governance team missions and a sample data quality scorecard.
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...DATAVERSITY
A solid data architecture is critical to the success of any data initiative. But what is meant by “data architecture”? Throughout the industry, there are many different “flavors” of data architecture, each with its own unique value and use cases for describing key aspects of the data landscape. Join this webinar to demystify the various architecture styles and understand how they can add value to your organization.
Activate Data Governance Using the Data CatalogDATAVERSITY
This document discusses activating data governance using a data catalog. It compares active vs passive data governance, with active embedding governance into people's work through a catalog. The catalog plays a key role by allowing stewards to document definition, production, and usage of data in a centralized place. For governance to be effective, metadata from various sources must be consolidated and maintained in the catalog.
DAS Slides: Data Quality Best PracticesDATAVERSITY
Tackling Data Quality problems requires more than a series of tactical, one-off improvement projects. By their nature, many Data Quality problems extend across and often beyond an organization. Addressing these issues requires a holistic architectural approach combining people, process, and technology. Join Nigel Turner and Donna Burbank as they provide practical ways to control Data Quality issues in your organization.
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceDATAVERSITY
Data catalogs, business glossaries, and data dictionaries house metadata that is important to your organization’s governance of data. People in your organization need to be engaged in leveraging the tools, understanding the data that is available, who is responsible for the data, and knowing how to get their hands on the data to perform their job function. The metadata will not govern itself.
Join Bob Seiner for the webinar where he will discuss how glossaries, dictionaries, and catalogs can result in effective Data Governance. People must have confidence in the metadata associated with the data that you need them to trust. Therefore, the metadata in your data catalog, business glossary, and data dictionary must result in governed data. Learn how glossaries, dictionaries, and catalogs can result in Data Governance in this webinar.
Bob will discuss the following subjects in this webinar:
- Successful Data Governance relies on value from very important tools
- What it means to govern your data catalog, business glossary, and data dictionary
- Why governing the metadata in these tools is important
- The roles necessary to govern these tools
- Governance expected from metadata in catalogs, glossaries, and dictionaries
This introduction to data governance presentation covers the inter-related DM foundational disciplines (Data Integration / DWH, Business Intelligence and Data Governance). Some of the pitfalls and success factors for data governance.
• IM Foundational Disciplines
• Cross-functional Workflow Exchange
• Key Objectives of the Data Governance Framework
• Components of a Data Governance Framework
• Key Roles in Data Governance
• Data Governance Committee (DGC)
• 4 Data Governance Policy Areas
• 3 Challenges to Implementing Data Governance
• Data Governance Success Factors
This presentation reports on data governance best practices. Based on a definition of fundamental terms and the business rationale for data governance, a set of case studies from leading companies is presented. The content of this presentation is a result of the Competence Center Corporate Data Quality (CC CDQ) at the University of St. Gallen, Switzerland.
Data Architecture Strategies: Data Architecture for Digital TransformationDATAVERSITY
MDM, data quality, data architecture, and more. At the same time, combining these foundational data management approaches with other innovative techniques can help drive organizational change as well as technological transformation. This webinar will provide practical steps for creating a data foundation for effective digital transformation.
Gartner: Master Data Management FunctionalityGartner
MDM solutions require tightly integrated capabilities including data modeling, integration, synchronization, propagation, flexible architecture, granular and packaged services, performance, availability, analysis, information quality management, and security. These capabilities allow organizations to extend data models, integrate and synchronize data in real-time and batch processes across systems, measure ROI and data quality, and securely manage the MDM solution.
How to Build & Sustain a Data Governance Operating Model DATUM LLC
Learn how to execute a data governance strategy through creation of a successful business case and operating model.
Originally presented to an audience of 400+ at the Master Data Management & Data Governance Summit.
Visit www.datumstrategy.com for more!
1. It is important to define data quality metrics that are purpose-fit and meaningful to customers. Dashboards should focus more on driving outcomes than just design.
2. Commonly used data quality dimensions include completeness, conformity, consistency, duplication, integrity, and accuracy. Specific metrics are then defined within each dimension tied to business objectives and rules.
3. Targets and trends provide valuable insights, with traffic light targets highlighting priority areas in red and trends showing progress over time.
Data Modeling, Data Governance, & Data QualityDATAVERSITY
Data Governance is often referred to as the people, processes, and policies around data and information, and these aspects are critical to the success of any data governance implementation. But just as critical is the technical infrastructure that supports the diverse data environments that run the business. Data models can be the critical link between business definitions and rules and the technical data systems that support them. Without the valuable metadata these models provide, data governance often lacks the “teeth” to be applied in operational and reporting systems.
Join Donna Burbank and her guest, Nigel Turner, as they discuss how data models & metadata-driven data governance can be applied in your organization in order to achieve improved data quality.
DAS Slides: Data Governance - Combining Data Management with Organizational ...DATAVERSITY
Data Governance is both a technical and an organizational discipline, and getting Data Governance right requires a combination of Data Management fundamentals aligned with organizational change and stakeholder buy-in. Join Nigel Turner and Donna Burbank as they provide an architecture-based approach to aligning business motivation, organizational change, Metadata Management, Data Architecture and more in a concrete, practical way to achieve success in your organization.
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task. The opportunity in getting it right can be significant, however, as data drives many of the key initiatives in today’s marketplace: digital transformation, marketing, customer centricity, and more. This webinar will help de-mystify Data Strategy and Data Architecture and will provide concrete, practical ways to get started.
• History of Data Management
• Business Drivers for implementation of data governance • Building Data Strategy & Governance Framework
• Data Management Maturity Models
• Data Quality Management
• Metadata and Governance
• Metadata Management
• Data Governance Stakeholder Communication Strategy
The first step towards understanding what data assets mean for your organization is understanding what those assets mean for each other. Metadata—literally, data about data—is one of many data management disciplines inherent in good systems development, and is perhaps the most mislabeled and misunderstood out of the lot. Understanding metadata and its associated technologies as more than just straightforward technological tools can provide powerful insight, the efficiency of organizational practices, and can also enable you to combine more sophisticated data management techniques in support of larger and more complex business initiatives.
In this webinar, we will:
Illustrate how to leverage metadata in support of your business strategy
Discuss foundational metadata concepts based on the DAMA Guide to Data Management Book of Knowledge (DAMA DMBOK)
Enumerate guiding principles for and lessons previously learned from metadata and its practical uses
Master Data Management – Aligning Data, Process, and GovernanceDATAVERSITY
Master Data Management (MDM) provides organizations with an accurate and comprehensive view of their business-critical data such as customers, products, vendors, and more. While mastering these key data areas can be a complex task, the value of doing so can be tremendous – from real-time operational integration to data warehousing and analytic reporting. This webinar will provide practical strategies for gaining value from your MDM initiative, while at the same time assuring a solid architectural and governance foundation that will ensure long-term, enterprise-wide success.
The Business Value of Metadata for Data GovernanceRoland Bullivant
In today’s digital economy, data drives the core processes that deliver profitability and growth - from marketing, to finance, to sales, supply chain, and more. It is also likely that for many large organizations much of their key data is retained in application packages from SAP, Oracle, Microsoft, Salesforce and others. In order to ensure that their foundational data infrastructure runs smoothly, most organizations have adopted a data governance initiative. These typically focus on the people and processes around managing data and information. Without an actionable link to the physical systems that run key business processes, however, governance programs can often lack the ‘teeth’ to effectively implement business change.
Metadata management is a process that can link business processes and drivers with the technical applications that support them. This makes data governance actionable and relevant in today’s fast-paced and results-driven business environment. One of the challenges facing data governance teams however, is the variety in format, accessibility and complexity of metadata across the organization’s systems.
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
Enterprise Architecture (EA) provides a visual blueprint of the organization, and shows key interrelationships between data, process, applications, and more. By abstracting these assets in a graphical view, it’s possible to see key interrelationships, particularly as they relate to data and its business impact across the organization. Join us for a discussion on how data architecture is a key component of an overall enterprise architecture for enhanced business value and success.
Data Governance & Data Architecture - Alignment and SynergiesDATAVERSITY
The definition of Data Governance can vary depending on the audience. To many, Data Governance consists of committees and stewardship roles. To others, it focuses on technical Data Management and controls. Holistic Data Governance combines both aspects, and a robust Data Architecture can be the “glue” that binds business and IT governance together. Join this webinar for practical tips and hands-on exercises for aligning Data Architecture and Data Governance for business and IT success.
Data Integration is a key part of many of today’s data management challenges: from data warehousing, to MDM, to mergers & acquisitions. Issues can arise not only in trying to align technical formats from various databases and legacy systems, but in trying to achieve common business definitions and rules.
Join this webinar to see how a data model can help with both of these challenges – from ‘bottom-up’ technical integration, to the ‘top-down’ business alignment.
This document provides an overview of best practices in metadata management. It discusses what metadata is, why it is important, and how it adds context and definition to data. Metadata management is part of an overall data strategy. The document outlines different types of metadata and how it is used by various roles like developers, business people, auditors, and data architects. It discusses challenges like inconsistent metadata that can lead to issues. It also provides examples of metadata sources, architectural options, and how metadata enables capabilities like data lineage, impact analysis, and semantic relationships.
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...DATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task. The opportunity in getting it right can be significant, however, as data drives many of the key initiatives in today’s marketplace from digital transformation, to marketing, to customer centricity, population health, and more. This webinar will help de-mystify data strategy and data architecture and will provide concrete, practical ways to get started.
DAS Slides: Data Quality Best PracticesDATAVERSITY
Tackling data quality problems requires more than a series of tactical, one off improvement projects. By their nature, many data quality problems extend across and often beyond an organization. Addressing these issues requires a holistic architectural approach combining people, process and technology. Join Nigel Turner and Donna Burbank as they provide practical ways to control data quality issues in your organization.
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task – but it’s worth the effort. Getting your Data Strategy right can provide significant value, as data drives many of the key initiatives in today’s marketplace, from digital transformation to marketing, customer centricity, population health, and more. This webinar will help demystify Data Strategy and its relationship to Data Architecture and will provide concrete, practical ways to get started.
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...DATAVERSITY
Achieving a ‘single version of the truth’ is critical to any MDM, DW, or data integration initiative. But have you ever tried to get people to agree on a single definition of “customer”? Or to get Sales, Marketing, and IT to agree on a target audience?
This webinar will discuss how a conceptual data model can be used as a powerful communication tool for data-intensive initiatives. It will cover how to build a high-level data model, how the core concepts in a data model can have significant business impact on an organization, and will provide some easy-to-use templates and guidelines for a step-by-step approach to implementing a conceptual data model in your organization.
Data modeling continues to be a tried-and-true method of managing critical data aspects from both the business and technical perspective. Like any tool or methodology, there is a “right tool for the right job”, and specific model types exist for both business and technical users across operational, reporting, analytic, and other use cases. This webinar will provide an overview of the various data modeling techniques available, and how to use each for maximum value to the organization.
Data Integrity: From speed dating to lifelong partnershipPrecisely
Governance has little to do with governance…it’s about delivering and demonstrating value. It’s one thing for your colleagues to intellectually believe in the value of data, good data, and governed data, but it’s another thing entirely to have them emotionally engaged and excited to be involved. In this presentation from the CDO Sit-Down series, Shaun Connolly, Vice President of International Strategic Services, shares his thoughts and experience on approaches to win over reluctant leaders and business teams and describe the key components of successful programs.
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task – but it’s worth the effort. Getting your Data Strategy right can provide significant value, as data drives many of the key initiatives in today’s marketplace, from digital transformation to marketing, customer centricity, population health, and more. This webinar will help demystify Data Strategy and its relationship to Data Architecture and will provide concrete, practical ways to get started.
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...DATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task. The opportunity in getting it right can be significant, however, as data drives many of the key initiatives in today’s marketplace from digital transformation, to marketing, to customer centricity, population health, and more. This webinar will help de-mystify data strategy and data architecture and will provide concrete, practical ways to get started.
What is the value of data? Data governance must look beyond master data to deliver real value.
Visit www,masterdata.co.za/index.php/data-governance-solutions
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...DATAVERSITY
The majority of successful organizations in today’s economy are data-driven, and innovative companies are looking at new ways to leverage data and information for strategic advantage. While the opportunities are vast, and the value has clearly been shown across a number of industries in using data to strategic advantage, the choices in technology can be overwhelming. From Big Data to Artificial Intelligence to Data Lakes and Warehouses, the industry is continually evolving to provide new and exciting technological solutions.
This webinar will help make sense of the various data architectures & technologies available, and how to leverage them for business value and success. A practical framework will be provided to generate “quick wins” for your organization, while at the same time building towards a longer-term sustainable architecture. Case studies will also be provided to show how successful organizations have successfully built a data strategies to support their business goals.
Data Governance — Aligning Technical and Business ApproachesDATAVERSITY
Data Governance can have a varied definition, depending on the audience. To many, data governance consists of committee meetings and stewardship roles. To others, it focuses on technical data management and controls. Holistic data governance combines both of these aspects, and a robust data architecture and associated diagrams can be the “glue” that binds business and IT governance together. Join this webinar for practical tips and hands-on exercises for aligning data architecture & data governance for business and IT success.
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...DATAVERSITY
Data can provide tremendous value to an organization in today’s information-driven economy. New customer insights, better efficiency, and new product innovation are just some of the ways organizations are obtaining value through data. But in order to achieve this value, a strong data architecture is required to ensure that the data infrastructure runs smoothly, while at the same time aligning with business needs and corporate culture. A Data Strategy can assist in building a data architecture foundation through:
Identifying business requirements, rules & definitions via a business-centric data model
Creating a data inventory & integrating disparate data sources
Building a technical data architecture through data models & related artifacts
Coordinating the people, processes and culture necessary for success
Identifying tools & technology needed for creating & maintaining high quality data
LDM Webinar: Data Modeling & Business IntelligenceDATAVERSITY
Business Intelligence (BI) is a valuable way to use information to show the overall health and performance of the organization. At its core is quality, well-structured data that allows for successful reporting and analytics. A data model helps provide both the business definitions as well as the structural optimization needed for successful BI implementations.
Join this webinar to see how a data model underpins business intelligence and analytics in today’s organization.
Data Modeling Best Practices - Business & Technical ApproachesDATAVERSITY
Data Modeling is hotter than ever, according to a number of recent surveys. Part of the appeal of data models lies in their ability to translate complex data concepts in an intuitive, visual way to both business and technical stakeholders. This webinar provides real-world best practices in using Data Modeling for both business and technical teams.
DAS Slides: Data Modeling Case Study — Business Data Modeling at KiewitDATAVERSITY
The document discusses data modeling efforts at Kiewit, a large construction company. It describes how Kiewit created conceptual data models to gain clarity on key business data assets and drive their data strategy. The models were organized by business capability and resonated well with stakeholders. This allowed Kiewit to identify data issues and gain insights that informed activities like data literacy training and system decommissioning planning. The modeling process highlighted differences between IT and business views of data and helped improve communication.
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
Organizations today need a broad set of enterprise data cloud services with key data functionality to modernize applications and utilize machine learning. They need a comprehensive platform designed to address multi-faceted needs by offering multi-function data management and analytics to solve the enterprise’s most pressing data and analytic challenges in a streamlined fashion.
In this research-based session, I’ll discuss what the components are in multiple modern enterprise analytics stacks (i.e., dedicated compute, storage, data integration, streaming, etc.) and focus on total cost of ownership.
A complete machine learning infrastructure cost for the first modern use case at a midsize to large enterprise will be anywhere from $3 million to $22 million. Get this data point as you take the next steps on your journey into the highest spend and return item for most companies in the next several years.
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
Do you ever wonder how data-driven organizations fuel analytics, improve customer experience, and accelerate business productivity? They are successful by governing and mastering data effectively so they can get trusted data to those who need it faster. Efficient data discovery, mastering and democratization is critical for swiftly linking accurate data with business consumers. When business teams can quickly and easily locate, interpret, trust, and apply data assets to support sound business judgment, it takes less time to see value.
Join data mastering and data governance experts from Informatica—plus a real-world organization empowering trusted data for analytics—for a lively panel discussion. You’ll hear more about how a single cloud-native approach can help global businesses in any economy create more value—faster, more reliably, and with more confidence—by making data management and governance easier to implement.
What is data literacy? Which organizations, and which workers in those organizations, need to be data-literate? There are seemingly hundreds of definitions of data literacy, along with almost as many opinions about how to achieve it.
In a broader perspective, companies must consider whether data literacy is an isolated goal or one component of a broader learning strategy to address skill deficits. How does data literacy compare to other types of skills or “literacy” such as business acumen?
This session will position data literacy in the context of other worker skills as a framework for understanding how and where it fits and how to advocate for its importance.
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task – but it’s worth the effort. Getting your Data Strategy right can provide significant value, as data drives many of the key initiatives in today’s marketplace – from digital transformation, to marketing, to customer centricity, to population health, and more. This webinar will help demystify Data Strategy and its relationship to Data Architecture and will provide concrete, practical ways to get started.
Uncover how your business can save money and find new revenue streams.
Driving profitability is a top priority for companies globally, especially in uncertain economic times. It's imperative that companies reimagine growth strategies and improve process efficiencies to help cut costs and drive revenue – but how?
By leveraging data-driven strategies layered with artificial intelligence, companies can achieve untapped potential and help their businesses save money and drive profitability.
In this webinar, you'll learn:
- How your company can leverage data and AI to reduce spending and costs
- Ways you can monetize data and AI and uncover new growth strategies
- How different companies have implemented these strategies to achieve cost optimization benefits
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
Organizations with governed metadata made available through their data catalog can answer questions their people have about the organization’s data. These organizations get more value from their data, protect their data better, gain improved ROI from data-centric projects and programs, and have more confidence in their most strategic data.
Join Bob Seiner for this lively webinar where he will talk about the value of a data catalog and how to build the use of the catalog into your stewards’ daily routines. Bob will share how the tool must be positioned for success and viewed as a must-have resource that is a steppingstone and catalyst to governed data across the organization.
In this webinar, Bob will focus on:
-Selecting the appropriate metadata to govern
-The business and technical value of a data catalog
-Building the catalog into people’s routines
-Positioning the data catalog for success
-Questions the data catalog can answer
Because every organization produces and propagates data as part of their day-to-day operations, data trends are becoming more and more important in the mainstream business world’s consciousness. For many organizations in various industries, though, comprehension of this development begins and ends with buzzwords: “Big Data,” “NoSQL,” “Data Scientist,” and so on. Few realize that all solutions to their business problems, regardless of platform or relevant technology, rely to a critical extent on the data model supporting them. As such, data modeling is not an optional task for an organization’s data effort, but rather a vital activity that facilitates the solutions driving your business. Since quality engineering/architecture work products do not happen accidentally, the more your organization depends on automation, the more important the data models driving the engineering and architecture activities of your organization. This webinar illustrates data modeling as a key activity upon which so much technology and business investment depends.
Specific learning objectives include:
- Understanding what types of challenges require data modeling to be part of the solution
- How automation requires standardization on derivable via data modeling techniques
- Why only a working partnership between data and the business can produce useful outcomes
Analytics play a critical role in supporting strategic business initiatives. Despite the obvious value to analytic professionals of providing the analytics for these initiatives, many executives question the economic return of analytics as well as data lakes, machine learning, master data management, and the like.
Technology professionals need to calculate and present business value in terms business executives can understand. Unfortunately, most IT professionals lack the knowledge required to develop comprehensive cost-benefit analyses and return on investment (ROI) measurements.
This session provides a framework to help technology professionals research, measure, and present the economic value of a proposed or existing analytics initiative, no matter the form that the business benefit arises. The session will provide practical advice about how to calculate ROI and the formulas, and how to collect the necessary information.
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
Data Mesh is a trending approach to building a decentralized data architecture by leveraging a domain-oriented, self-service design. However, the pure definition of Data Mesh lacks a center of excellence or central data team and doesn’t address the need for a common approach for sharing data products across teams. The semantic layer is emerging as a key component to supporting a Hub and Spoke style of organizing data teams by introducing data model sharing, collaboration, and distributed ownership controls.
This session will explain how data teams can define common models and definitions with a semantic layer to decentralize analytics product creation using a Hub and Spoke architecture.
Attend this session to learn about:
- The role of a Data Mesh in the modern cloud architecture.
- How a semantic layer can serve as the binding agent to support decentralization.
- How to drive self service with consistency and control.
Enterprise data literacy. A worthy objective? Certainly! A realistic goal? That remains to be seen. As companies consider investing in data literacy education, questions arise about its value and purpose. While the destination – having a data-fluent workforce – is attractive, we wonder how (and if) we can get there.
Kicking off this webinar series, we begin with a panel discussion to explore the landscape of literacy, including expert positions and results from focus groups:
- why it matters,
- what it means,
- what gets in the way,
- who needs it (and how much they need),
- what companies believe it will accomplish.
In this engaging discussion about literacy, we will set the stage for future webinars to answer specific questions and feature successful literacy efforts.
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
Change is hard, especially in response to negative stimuli or what is perceived as negative stimuli. So organizations need to reframe how they think about data privacy, security and governance, treating them as value centers to 1) ensure enterprise data can flow where it needs to, 2) prevent – not just react – to internal and external threats, and 3) comply with data privacy and security regulations.
Working together, these roles can accelerate faster access to approved, relevant and higher quality data – and that means more successful use cases, faster speed to insights, and better business outcomes. However, both new information and tools are required to make the shift from defense to offense, reducing data drama while increasing its value.
Join us for this panel discussion with experts in these fields as they discuss:
- Recent research about where data privacy, security and governance stand
- The most valuable enterprise data use cases
- The common obstacles to data value creation
- New approaches to data privacy, security and governance
- Their advice on how to shift from a reactive to resilient mindset/culture/organization
You’ll be educated, entertained and inspired by this panel and their expertise in using the data trifecta to innovate more often, operate more efficiently, and differentiate more strategically.
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
With technological innovation and change occurring at an ever-increasing rate, it’s hard to keep track of what’s hype and what can provide practical value for your organization. Join this webinar to see the results of a recent DATAVERSITY survey on emerging trends in Data Architecture, along with practical commentary and advice from industry expert Donna Burbank.
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
As DATAVERSITY’s RWDG series hurdles into our 12th year, this webinar takes a quick look behind us, evaluates the present, and predicts the future of Data Governance. Based on webinar numbers, hot Data Governance topics have evolved over the years from policies and best practices, roles and tools, data catalogs and frameworks, to supporting data mesh and fabric, artificial intelligence, virtualization, literacy, and metadata governance.
Join Bob Seiner as he reflects on the past and what has and has not worked, while sharing examples of enterprise successes and struggles. In this webinar, Bob will challenge the audience to stay a step ahead by learning from the past and blazing a new trail into the future of Data Governance.
In this webinar, Bob will focus on:
- Data Governance’s past, present, and future
- How trials and tribulations evolve to success
- Leveraging lessons learned to improve productivity
- The great Data Governance tool explosion
- The future of Data Governance
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
1) The document discusses best practices for data protection on Google Cloud, including setting data policies, governing access, classifying sensitive data, controlling access, encryption, secure collaboration, and incident response.
2) It provides examples of how to limit access to data and sensitive information, gain visibility into where sensitive data resides, encrypt data with customer-controlled keys, harden workloads, run workloads confidentially, collaborate securely with untrusted parties, and address cloud security incidents.
3) The key recommendations are to protect data at rest and in use through classification, access controls, encryption, confidential computing; securely share data through techniques like secure multi-party computation; and have an incident response plan to quickly address threats.
It is a fascinating, explosive time for enterprise analytics.
It is from the position of analytics leadership that the enterprise mission will be executed and company leadership will emerge. The data professional is absolutely sitting on the performance of the company in this information economy and has an obligation to demonstrate the possibilities and originate the architecture, data, and projects that will deliver analytics. After all, no matter what business you’re in, you’re in the business of analytics.
The coming years will be full of big changes in enterprise analytics and data architecture. William will kick off the fifth year of the Advanced Analytics series with a discussion of the trends winning organizations should build into their plans, expectations, vision, and awareness now.
Too often I hear the question “Can you help me with our data strategy?” Unfortunately, for most, this is the wrong request because it focuses on the least valuable component: the data strategy itself. A more useful request is: “Can you help me apply data strategically?” Yes, at early maturity phases the process of developing strategic thinking about data is more important than the actual product! Trying to write a good (must less perfect) data strategy on the first attempt is generally not productive –particularly given the widespread acceptance of Mike Tyson’s truism: “Everybody has a plan until they get punched in the face.” This program refocuses efforts on learning how to iteratively improve the way data is strategically applied. This will permit data-based strategy components to keep up with agile, evolving organizational strategies. It also contributes to three primary organizational data goals. Learn how to improve the following:
- Your organization’s data
- The way your people use data
- The way your people use data to achieve your organizational strategy
This will help in ways never imagined. Data are your sole non-depletable, non-degradable, durable strategic assets, and they are pervasively shared across every organizational area. Addressing existing challenges programmatically includes overcoming necessary but insufficient prerequisites and developing a disciplined, repeatable means of improving business objectives. This process (based on the theory of constraints) is where the strategic data work really occurs as organizations identify prioritized areas where better assets, literacy, and support (data strategy components) can help an organization better achieve specific strategic objectives. Then the process becomes lather, rinse, and repeat. Several complementary concepts are also covered, including:
- A cohesive argument for why data strategy is necessary for effective data governance
- An overview of prerequisites for effective strategic use of data strategy, as well as common pitfalls
- A repeatable process for identifying and removing data constraints
- The importance of balancing business operation and innovation
Who Should Own Data Governance – IT or Business?DATAVERSITY
The question is asked all the time: “What part of the organization should own your Data Governance program?” The typical answers are “the business” and “IT (information technology).” Another answer to that question is “Yes.” The program must be owned and reside somewhere in the organization. You may ask yourself if there is a correct answer to the question.
Join this new RWDG webinar with Bob Seiner where Bob will answer the question that is the title of this webinar. Determining ownership of Data Governance is a vital first step. Figuring out the appropriate part of the organization to manage the program is an important second step. This webinar will help you address these questions and more.
In this session Bob will share:
- What is meant by “the business” when it comes to owning Data Governance
- Why some people say that Data Governance in IT is destined to fail
- Examples of IT positioned Data Governance success
- Considerations for answering the question in your organization
- The final answer to the question of who should own Data Governance
This document summarizes a research study that assessed the data management practices of 175 organizations between 2000-2006. The study had both descriptive and self-improvement goals, such as understanding the range of practices and determining areas for improvement. Researchers used a structured interview process to evaluate organizations across six data management processes based on a 5-level maturity model. The results provided insights into an organization's practices and a roadmap for enhancing data management.
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
MLOps is a practice for collaboration between Data Science and operations to manage the production machine learning (ML) lifecycles. As an amalgamation of “machine learning” and “operations,” MLOps applies DevOps principles to ML delivery, enabling the delivery of ML-based innovation at scale to result in:
Faster time to market of ML-based solutions
More rapid rate of experimentation, driving innovation
Assurance of quality, trustworthiness, and ethical AI
MLOps is essential for scaling ML. Without it, enterprises risk struggling with costly overhead and stalled progress. Several vendors have emerged with offerings to support MLOps: the major offerings are Microsoft Azure ML and Google Vertex AI. We looked at these offerings from the perspective of enterprise features and time-to-value.
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...DATAVERSITY
This document discusses the importance of data observability for improving data quality. It begins with an introduction to data observability and how it works by continuously monitoring data to detect anomalies and issues. This is unlike traditional reactive approaches. Examples are then provided of how unexpected data values or volumes could negatively impact downstream processes but be resolved quicker with data observability alerts. The document emphasizes that data observability allows issues to be identified and addressed before they become costly problems. It promotes data observability as a way to proactively improve data integrity and ensure accurate, consistent data for confident decision making.
202406 - Cape Town Snowflake User Group - LLM & RAG.pdfDouglas Day
Content from the July 2024 Cape Town Snowflake User Group focusing on Large Language Model (LLM) functions in Snowflake Cortex. Topics include:
Prompt Engineering.
Vector Data Types and Vector Functions.
Implementing a Retrieval
Augmented Generation (RAG) Solution within Snowflake
Dive into the details of how to leverage these advanced features without leaving the Snowflake environment.
Do People Really Know Their Fertility Intentions? Correspondence between Sel...Xiao Xu
Fertility intention data from surveys often serve as a crucial component in modeling fertility behaviors. Yet, the persistent gap between stated intentions and actual fertility decisions, coupled with the prevalence of uncertain responses, has cast doubt on the overall utility of intentions and sparked controversies about their nature. In this study, we use survey data from a representative sample of Dutch women. With the help of open-ended questions (OEQs) on fertility and Natural Language Processing (NLP) methods, we are able to conduct an in-depth analysis of fertility narratives. Specifically, we annotate the (expert) perceived fertility intentions of respondents and compare them to their self-reported intentions from the survey. Through this analysis, we aim to reveal the disparities between self-reported intentions and the narratives. Furthermore, by applying neural topic modeling methods, we could uncover which topics and characteristics are more prevalent among respondents who exhibit a significant discrepancy between their stated intentions and their probable future behavior, as reflected in their narratives.
Do People Really Know Their Fertility Intentions? Correspondence between Sel...
Data Quality Best Practices
1. Copyright Global Data Strategy, Ltd. 2021
Data Quality Best Practices
Donna Burbank and Nigel Turner
Global Data Strategy, Ltd.
August 26th, 2021
Follow on Twitter @donnaburbank, @nigelturner8
@GlobalDataStrat
Twitter Event hashtag: #DAStrategies
2. Global Data Strategy, Ltd. 2021
Donna Burbank
2
• Recognized industry expert in information
management with over 25 years of
experience in data strategy, information
management, data modeling, metadata
management, and enterprise architecture
• Managing Director at Global Data Strategy,
Ltd., an international information
management consulting company that
specializes in the alignment of business
drivers with data-centric technology
• Worked with dozens of Fortune 500
companies worldwide in the Americas,
Europe, Asia, and Africa and speaks
regularly at industry conferences
• Excellence in Data Management Award
from DAMA International
• Past President and Advisor to the DAMA
Rocky Mountain chapter
• Co-author of several books on data
management
• Regular contributor to industry
publications
• She can be reached at
donna.burbank@globaldatastrategy.com
Donna is based in Boulder, Colorado, US
Follow on Twitter @donnaburbank
@GlobalDataStrat
3. Global Data Strategy, Ltd. 2021
Nigel Turner
• Worked in Information Management
(IM) and related areas for over 25
years. Experience has embraced Data
Governance, Information Strategy,
Data Quality, Data Governance, Master
Data Management & Business
Intelligence.
• Spent much of his career in British
Telecommunications Group (BT)
where he led a series of enterprise-
wide IM & data governance initiatives.
• Also been VP of Information
Management Strategy at Harte Hanks
Trillium Software, and Principal
Consultant at FromHereOn and IPL.
• Nigel is very active in professional Data
Management organizations and is an
elected Data Management Association
(DAMA) UK Committee member.
• He was the joint winner of DAMA
International’s 2015 Community Award
for the work he initiated and led in
setting up a mentoring scheme in the
UK where experienced DAMA
professionals coach and support newer
data management professionals.
• Nigel is based in Cardiff, Wales, UK.
Follow on Twitter @NigelTurner8
Today’s hashtag: # DAStrategies
4. Global Data Strategy, Ltd. 2021
DATAVERSITY Data Architecture Strategies
• January Emerging Trends in Data Architecture – What’s the Next Big Thing?
• February Building a Data Strategy - Practical Steps for Aligning with Business Goals
• March Data Modeling Case Study – Business Data Modeling at Kiewit
• April Master Data Management – Aligning Data, Process, and Governance
• May Data Architecture, Solution Architecture, Platform Architecture – What’s the Difference?
• June Enterprise Architecture vs. Data Architecture
• July Best Practices in Metadata Management
• August Data Quality Best Practices (with guest Nigel Turner)
• September Data Modeling Techniques
• October Data Governance: Aligning Technical & Business Approaches
• December Data Architecture for Digital Transformation
4
This Year’s Lineup
5. Global Data Strategy, Ltd. 2021 5
What We’ll Cover Today
• Tackling data quality problems requires more than a
series of tactical, one off improvement projects.
• By their nature, many data quality problems extend
across and often beyond an organization.
• Addressing these issues requires a holistic architectural
approach combining people, process and technology.
6. Global Data Strategy, Ltd. 2021
Agenda
6
• Discuss how to deliver data quality improvements in the Baseline & Develop
phases of the A2E methodology
• Highlight the critical role of Business Rules in improving Data Quality
• Illustrate why getting Business Rules right is critical
• Outline how to use Business Rules to correct poor data quality and sustain
improved data quality
7. Global Data Strategy, Ltd. 2021 7
A Successful Data Strategy links Business Goals with Technology Solutions
“Top-Down” alignment with
business priorities
“Bottom-Up” management &
inventory of data sources
Managing the people, process,
policies & culture around data
Coordinating & integrating
disparate data sources
Leveraging & managing data for
strategic advantage
Data Quality is Part of a Wider Data Strategy
www.globaldatastrategy.com
8. Global Data Strategy, Ltd. 2021
Tackling Data Quality: the A2E approach
8
Assess
Baseline
Converge
Develop
Evaluate
Cycle of Continuous
Data Quality Improvement
Step Purpose
Assess Business
Usage
Understand what data exists and how it is used
within the organization
Baseline Data
Sources
Baseline the current quality of the data and
assess how well it is meeting business needs
Converge on
Business Critical Areas
Focus priorities to optimise early business
benefits and set ‘fit for purpose’ quality targets
to guide improvement activities
Develop
Improvements
Design & deploy improvement initiatives
(encompassing people, process, and technology)
and measure the impact against targets
Evaluate Benefits &
ROI
Regularly measure the data and continue to
improve it so that it continues to meet current
and future business needs
9. Global Data Strategy, Ltd. 2021
Data Quality Improvement: The Importance of Business Rules
9
”A Business Rule is a criterion
used to guide day-to-day
business activity, shape
operational business judgments,
or make operational business
decisions.”
Ronald Ross, quoted in
architectureandgovernance.com
• In a data context, business rules are used to define and
enforce the standards that data must conform to
• Have a key role in assessing, baselining and improving data
quality
• Can be used to:
• Cleanse and enhance existing data
• Become standards which new data must conform to
• Guide data design in new developments
• Enforce data standards in existing applications and platforms
• Stop poor quality data being entered at source, e.g. via drop
down lists, screen entry validation etc.
10. Global Data Strategy, Ltd. 2021
How Do You Classify Business Rules?
• Many different ways to classify business rules – can be very complex
• A simple classification is:
10
FORMAT BUSINESS RULES CONTENT BUSINESS RULES
Specify the format standards data
should comply with
Include:
• Field length
(fixed, variable etc.)
• Character format
(e.g. Alphabetic, Numeric,
Alphanumeric etc.)
Specify the allowable content
of records or fields
Include:
• Allowable values
• Whether mandatory or
optional
• Relationships with other
fields or records
11. Global Data Strategy, Ltd. 2021
Example Data Related Business Rules
11
FORMAT RULES
• A UK National Insurance Number must be in the format: aa nn nn nn a
• An employee must have a unique Employee ID in the format: aa nnnn
• Date of birth should be in North American format of MM/DD/YYYY
• A full US zip code must be in the format nnnnn-nnnn
• Internet router identifier must be in the format Aaa_Nan_Naa
12. Global Data Strategy, Ltd. 2021
Example Data Related Business Rules
12
CONTENT RULES
• Every Sales Representative must be assigned to one and only one Sales Region
• A valid email address must be entered by a customer to enable a customer’s
order to be accepted
• Gender codes must have the valid value of Male, Female or Unknown
• A supplier must have at least one associated geographical address
• Product Price should be Product Unit Cost + 25%
CONTENT
13. Global Data Strategy, Ltd. 2021
How Do You Identify Business Rules?
• Business rules can be discovered or derived from:
• Data models (Business / Logical / Physical)
• Business documentation (e.g. Process Descriptions, User Instructions)
• IT Documentation (e.g. requirements specifications, system manuals)
• Source code (e.g. If ‘A Then B’ statements)
• Master and / or Reference Data Sources (e.g. currency codes, product
master data)
• Documented metadata (e.g. Business Glossaries, Data Dictionaries,
Metadata Repositories)
• Data profiling outputs
• Talking to key stakeholders:
• Data owners and data stewards (if in place)
• Data producers and consumers
• Other business and IT subject matter experts
13
VITAL IMPORTANCE OF STAKEHOLDER
ENGAGEMENT:
• Business rules are frequently implicit (i.e. locked
in people’s heads) and not formally documented
• Where business rules are documented,
documentation is often out of date and not
updated in line with system changes
14. Global Data Strategy, Ltd. 2021
Data Models Describe the Organization
• Relationships define the data-centric Business Rules of an organization
• You should be able to “read” a data model like a sentence
• The Entities / Concepts are the “nouns” – the boxes on a data model
• It’s often helpful to start by taking some text describing the organization (or transcripts
from stakeholder interviews) and draw boxes around the nouns to find the core entities
• An employee can work for more than one department.
• A customer can have more than one account.
• A department can contain more than one employee.
Customer
Employee
Account
Department
14
BUSINESS
RULES
15. Global Data Strategy, Ltd. 2021
Deriving Business Rules: Business Data Model
• A business data
model provides
core definitions
of key data
objects.
• It also shows key
relationships
between data
objects.
• Even a simple
diagram as the
one on the right
can tell a
powerful “story”
…. And
uncover key
business rules
• Communication & definition of core data concepts & their definitions
BUSINESS RULE:
A COMPANY must
contain 1 or more
customers with an
active account
BUSINESS RULE:
An EMPLOYEE must be
on the active payroll
BUSINESS RULE:
A CUSTOMER is a
current or former client
who must have had an
account active within
the last 6 months
16. Global Data Strategy, Ltd. 2021 16
REAL
QUALITY
DATA
LIFE
STORIES
HORROR
2021
When Business Rules Go Wrong or Go Missing
17. Global Data Strategy, Ltd. 2021
Why Do Business Rules Matter? DQ ‘Short’comings
• Liam Thorp made headline news in the UK in Feb 2021
• Received a priority invite for a Covid-19 vaccination because
he was medically classed as ‘morbidly obese’
• The reason – his local health board had recorded his height as
6.2 centimetres and not his real height of 6 feet 2 inches
• This made his Body Mass Index (BMI) 28,000, calculated by his
weight / height ratio
• A BMI of 40 and above is classed as ‘morbidly obese’
• Now corrected, and he was put back in his rightful place in the
vaccine queue
17
Liam Thorp
32 years old
Liverpool
resident
“I can see the funny
side of this story but
also recognise there is
an important issue for
us to address”
Chair of the Liverpool
Clinical Commissioning
Group (leading the city’s
vaccine roll out)
Beatles statue
City of Liverpool
KEY PROBLEM - ABSENCE
OF BUSINESS RULES TO
SPECIFY:
• Minimum Height
• Maximum BMI
(Content)
18. Global Data Strategy, Ltd. 2021
Why Do Business Rules Matter? ‘Miss’ing weight
• UK Air Accidents Investigation Branch (AAIB) report (April 2021)
declared a ‘Serious Incident’ at Birmingham airport, UK
• Report highlighted that 3 flights to Europe in July 2020 had taken off with
the weight of the plane load underestimated by an average 1,200kg
• This miscalculation could have caused a ‘serious incident’ on take off as it
determines take off speed, thrust etc.
• Problem happened because all passengers with the title ‘Miss’ were
automatically assumed by outsourced IT suppliers to be children and not
adults
• A child’s standard estimated weight is 35kg; an adult 69kg
• The airline described it as ‘ a simple flaw in its IT system’
• In reality, there was a serious problem with its business rules!
• The airline has now introduced manual validation of all passengers at
check in to ensure adults titled ‘Miss’ are changed to ‘Ms’ on the
passenger roster (?)
KEY PROBLEMS:
• Reliance on IT, and not the business,
to specify the business rules
• Making cultural assumptions that
were incorrect
19. Global Data Strategy, Ltd. 2021
Four Step Process: Using Business Rules for Data Quality Improvement
19
STEP 1:
Profile
data
sources
STEP 2:
Agree
priority DQ
problems &
design
Business
Rules
STEP 3:
Deploy
Business
Rules
STEP 4:
Monitor &
report
adherence
to Business
Rules
CYCLE OF CONTINUOUS
DATA QUALITY
IMPROVEMENT
20. Global Data Strategy, Ltd. 2021
Step 1: Quantifying Data Problems - The Value of Data Profiling
20
• The benefits of data profiling include:
• Checks conformance of the dataset with
business rules
• Enables fact-based discussion of the causes and
impacts of data problems
• Great starting point for Data Quality
improvement workshops
• Automatic generation of metadata
• Supports both data quality focus &
improvement and metadata capture
• Data profiling tools automate the process
of assessing and reporting on the quality
of data sources
• Data profiling can also be done via SQL,
without purchasing a tool
Example partial Data Profiling report
21. Global Data Strategy, Ltd. 2021
Step 1: An Alternative Approach to Quantifying Data Problems
21
Source:
Only 3% of Companies’ Data
Meets Basic Quality Standards
Tadhg Nagle, Thomas C. Redman
& David Sammon
Harvard Business Review
September 11 2017
21
22. Global Data Strategy, Ltd. 2021
EMPLOYEE NO SURNAME FIRST NAME GENDER DATE OF BIRTH
ROLE
CODE
802540 Smith Brian Female 31/01/56 PM16
YN4176B Gregg Male 07/09/80 9999
811609 Patel Priya XXXX 25/12/78 AL60
22298 Bothroyd Bridget Female 28/08/09 TBD
802540 Smith Bryan Male 31/01/56 PM10
855265 Hayes Leslie Female 00/00/00 AL76
Taylor Kevin Unknown 12/30/69 US18
22
Note: Records extracted and anonymized from an actual HR database
Step 1: Data Profiling & Potential Data Quality Problem Identification
23. Global Data Strategy, Ltd. 2021
EMPLOYEE NO SURNAME FIRST NAME GENDER
DATE OF
BIRTH
ROLE CODE
802540 Smith Brian Female 31/01/56 PM16
YN4176B Gregg Male 07/09/80 9999
811609 Patel Priya XXXX 25/12/78 AL60
22298 Bothroyd Bridget Female 28/08/09 TBD
802540 Smith Bryan Male 31/01/56 PM10
855265 Hayes Leslie Female 00/00/00 AL76
Taylor Kevin Unknown 12/30/69 US18
ANSWER: Total number of potential Data Quality problems is 13 or 19, depending on
whether Smith is a duplicate record
23
23
Step 1: Data Profiling & Potential DQ Problem Identification
Key:
Potential
Duplicate
Record
Potential
Data Quality
Problem
24. Global Data Strategy, Ltd. 2021
Step 2: Business Review & Validation
• Data profiling findings should be reviewed by appropriate business & IT
stakeholders
• If formal Data Governance in place, this should ideally led by the Data Stewards
responsible for the specific data domains
• Aim to reach consensus on what the business impact is
• Ways of doing this:
• Workshops and / or meetings (virtual or F2F)
• By workflows, seeking views on the potential problem areas
• For priority areas, agree Business Rules which should be in place to drive and
enforce data quality improvement
• Create and deploy Business Rules
• Test rules first in case of unforeseen downstream impacts
• Embed in appropriate operational systems or Data Quality Rules Engine (see later)
24
25. Global Data Strategy, Ltd. 2021
Step 3: Using Business Rules to steer and enforce Data Quality standards
25
Example potential format
business rules
Example potential
content business rules
Employee No. must be in format
nnnnnn. Blank Employee Numbers
are allowed if new starter awaiting
Emp. No. allocation
Gender should align with First
Name derived from Common
Names Reference file
First Name must not be blank Allowable Genders are FEMALE,
MALE, SELF-DETERMINED or
UNKNOWN
Role code must be in format AAnn Date of Birth must be expressed
as DD/MM/YY and in the range
01/01/1940 to 12/12/2005
Date of Birth must be in format
nn/nn/nn
Employee No. should be unique.
Only one Emp. No. should be
allocated to any individual
employee
26. Global Data Strategy, Ltd. 2021
Step 3: Deploying Business Rules - Approaches
26
Data Quality Tool:
DQ Business Rules
Engine
Master & Reference
Data Management
Application Code
(e.g. data input
validation)
Data Entry
Guidelines,
Business Glossary
& Training
27. Global Data Strategy, Ltd. 2021
Step 3: Automating Data Quality Business Rules via a DQ Rules Engine
DATA
INPUT
DATA
WAREHOUSE
STAGING / ETL
LAYER
SOURCE
SYSTEMS
REPORTING
LAYER
DATA
MARTS
Real Time Data Validation
Batch
Validation
DATA QUALITY
RULES ENGINE
28. Global Data Strategy, Ltd. 2021
Step 4: Monitor & Report Adherence
• When Business Rules are implemented can be used to:
• Check continued adherence of existing data
• Enforce the rules on new data to prevent new problems
• Best monitored via Data Quality Dashboards
• Provide regular reports on adherence of data to Business Rules
• Set KPIs to drive continuous data improvement
• Identify data quality trends
• Highlight areas where corrective action required
• Indicate where / if Business Rules may need to be amended to
meet changing business needs
• When reporting always try to relate data quality to business
outcomes
• Address the ‘so what’ objection
• Puts a financial or other benefit on continued data quality
improvement
28
Data Quality Dashboard
29. Global Data Strategy, Ltd. 2021
Summary
• Business Rules are key to uncovering data quality
problems and driving data quality improvement
• Business Rules can be explicit or implicit so have to be
discovered and created in a variety of ways
• Follow the simple 4 Step process outlined to ensure you
optimize the value of Business Rules in your data quality
initiatives
• Remember that Business Rules are not set in stone and
need to be monitored and amended in line with changing
organizational needs and requirements
• With data quality the business always ultimately rules, so
Business Rules provide the means to enable this
29
30. Global Data Strategy, Ltd. 2021
Who We Are: Business-Focused Data Strategy
Maximize the Organizational Value of Your Data Investment
In today’s business environment, showing rapid time to value for
any technical investment is critical.
But technology and data can be complex. At Global Data Strategy,
we help demystify technical complexity to help you:
• Demonstrate the ROI and business value of data to your
management
• Build a data strategy at your pace to match your unique culture
and organizational style.
• Create an actionable roadmap for “quick wins”, which building
towards a long-term scalable architecture.
Global Data Strategy’s shares experience from some of the largest
international organizations scaled to the pace of your unique team.
www.globaldatastrategy.com
Global Data Strategy has worked with organizations globally in the
following industries:
Finance · Retail · Social Services · Health Care · Education · Manufacturing
· Government · Public Utilities · Construction · Media & Entertainment ·
Insurance …. and more
31. Global Data Strategy, Ltd. 2021
DATAVERSITY Data Architecture Strategies
• January Emerging Trends in Data Architecture – What’s the Next Big Thing?
• February Building a Data Strategy - Practical Steps for Aligning with Business Goals
• March Data Modeling Case Study – Business Data Modeling at Kiewit
• April Master Data Management – Aligning Data, Process, and Governance
• May Data Architecture, Solution Architecture, Platform Architecture – What’s the Difference?
• June Enterprise Architecture vs. Data Architecture
• July Best Practices in Metadata Management
• August Data Quality Best Practices (with guest Nigel Turner)
• September Data Modeling Techniques
• October Data Governance: Aligning Technical & Business Approaches
• December Data Architecture for Digital Transformation
31
This Year’s Lineup