尊敬的 微信汇率:1円 ≈ 0.046239 元 支付宝汇率:1円 ≈ 0.04633元 [退出登录]
SlideShare a Scribd company logo
`
Enterprise Data Catalog
Workshop
Data
is the Critical Foundation
Improve Customer
Engagement
Improve Patient
Outcomes
Reduce Fraud
Reduce
Compliance Risk
© Informatica. Proprietary and Confidential.33
Governance
& Compliance
Because data drives all digital transformation priorities…
Self-Service /
Advanced Analytics
Cloud
Migration
Customer
Experience
© Informatica. Proprietary and Confidential.44
Governance
& Compliance
…And organization rely on data to drive digital transformation
Self-Service /
Advanced Analytics
Cloud
Migration
Customer
Experience
Discovery of critical enterprise
data provided foundation for data
governance program
Global Financial
Services Company
Flipped the 80-20% rule for finding
data to analyzing effort for data
analysts
Enterprise-scale metadata
understanding allows them to
simplify cloud migration projects
Opening up data visibility is fueling
development of new services and
improving quality of patient care
Regional Health
Care Company
Large National
Retailer
Large
Shipping Company
5 © Informatica. Proprietary and Confidential.5
An AI-Powered data catalog is essential to digital transformation
Share your
data knowledge
Understand &
trust your data
Catalog ALL
enterprise data
6 © Informatica. Proprietary and Confidential.6 © Informatica. Proprietary and Confidential.
Data
Steward
How can I manage
metadata for key
enterprise data
assets?
How do I assess
and manage data
quality through the
lifecycle?
Data
Governance Office
How can we
validate and
enforce our data
governance
policies and
definitions?
How can I ensure
data managed
within application
and supporting
processes deliver
value to the
business?
Data
Owner
How can I
discover,
understand and
trust data required
for my analysis?
Data
Consumer
How can IT enable
business discover
data assets with
verified data
quality and
traceability?
Data
Architect
TechnicalBusiness
The questions a data catalog solves
7 © Informatica. Proprietary and Confidential.7 © Informatica. Proprietary and Confidential.7 © Informatica. Proprietary and Confidential.
Enterprise Data Catalog
• Find the data you need with simple,
powerful semantic search
• Understand your enterprise data
with a holistic view
• AI-powered automatic discovery,
classification and business context
• Comprehensive metadata
connectivity for all enterprise data
• Big Data scale and flexible
deployment options
• Open framework for custom
extensions and integrations
8 © Informatica. Proprietary and Confidential.8 © Informatica. Proprietary and Confidential.8 © Informatica. Proprietary and Confidential.
Enterprise Data Catalog
Broad
Metadata
Sources
• Technical
• Operational
• Usage
Business
Context
• Glossary
• Policies
• Process
Wisdom
of Crowd
• Comments
• Ratings
• Behavior Knowledge Graph
Business & Crowd
Sourced Curation
AI Curated Catalog
Enterprise Data Catalog
Data Governance
[Data Stewards, Data Architects]
• Associate Business glossary to
technical objects
• Verify business to technical lineage
• Track key data elements compliance
Self Service Analytics
[Data Analysts, Data Scientists]
• Google for enterprise data assets
• Data Lineage, holistic relationship view
• Trust with data profile
• Access to data
Data Asset Management
[Architects, Developers]
• Analyze column-level Lineage &
Change Impact
• View transformation Logic
• Data asset and BI usage
Structure Discovery, Profiling
and Domain Discovery,
Similarity Clustering,
Recommendations
Business Glossary
Associations, Business
Classifications, Annotations,
Comments
9 © Informatica. Proprietary and Confidential.9
Smart Discovery
Semantic Search
Search datasets using business terminology, synonyms,
across related objects and data flows. Example: when
you type “grade”, EDC can suggest “tier” aligning with the
terminology used in the organization.
Data Profiling and Domain Discovery
View data profiling statistics alongside data assets to
understand data quality before using data for analysis.
Profiling statistics include value distributions, patterns,
and data type and data domain inference. Example: you
can search for “tables with emails” to get all tables that
have email information, regardless of the underlying
column names
Facets
Intelligent facets, based on the search results, allow
users to narrow the search to the data sets of interest.
Facets include both system and custom classifications
10 © Informatica. Proprietary and Confidential.
Automatic Data Lineage
…Upstream and Downstream
• Lineage Discovery from:
- Informatica Big Data Management
- Cloudera Navigator
- HW Atlas
- HiveQL
- + Other Enterprise Sources
- Informatica Powercenter
- Microsoft SSIS, Datastage, SAP BO,
Oracle Data Integrator
- BI
11 © Informatica. Proprietary and Confidential.
Relationship Discovery
360 Relationships View
Get a 360-degree view of data in a knowledge graph that lets you quickly
search, discover, and understand enterprise data and meaningful data
relationships. Automatically discover related data sets, technical,
business, semantic and usage based relationships.
Data Lineage
Interactively trace data origin through business-friendly summarized
lineage views that highlight the end points and all the complex details in
between. A drill-down lineage view expands any lineage path to show
columns and lineage diagram metrics.
Impact Analysis
Perform detailed impact analysis on upstream and downstream data
assets. See impact across data asset, resources and users.
12 © Informatica. Proprietary and Confidential.
But cataloging and curating
all the enterprise data doesn’t
sound easy!!
Enterprise
Unified
Metadata
Usage Operational
Technical Business
PowerCenter | DQ
MDM | BDM | DIH
BG | ILM | Axon | Informatica
Cloud
Informatica
Oracle | DB2 | DB2 for z/OS
SQL Server | Sybase | Teradata
Netezza | JDBC | SQL Scripts |
SAP HANA | Stored Procedures
Databases
SAP R/3 | Salesforce
Oracle | Workday
Applications
HIVE (Cloudera, Hortonworks, MapR,
IBM BigInsights, EMR, HDI)
HDFS | MapRFS |
Cloudera Navigator | Atlas
Big Data
AWS S3 | AWS Redshift | Azure
SQL DB | Azure SQL DW | Azure
ADLS | Azure Blob | Google
BigQuery | ADLS Gen 2
Cloud Platforms
CSV | Delimited | XML | JSON |
Avro | Parquet | MS Excel |
Adobe PDF | Flat File | MS
PowerPoint | MS Word
File Formats
Tableau | IBM Cognos |
SAP BusinessObjects
MicroStrategy | OBIEE
Business Intelligence
Microsoft SSIS | Erwin Models |
PowerDesigner | Oracle Data
Integrator | IBM DataStage |
Custom Scanner Framework
Other
Enterprise
Data
Catalog
=
CLAIRE
The Intelligence in the Intelligent Data Platform
Simplify Data
Stewardship
Automate Integration and
data migration
Amplify Analytics
Spotlight Data Risk
Accelerate Governance
Improve Operations
Metadata-driven
Artificial Intelligence
17 © Informatica. Proprietary and Confidential.
Smart Domains
Uses Column Similarity
Like photo tagging CLAIRE for Columns
Clustering based on column
metadata and Jaccard
Coefficient and Bray Curtis
Similarity
Column similarity based on
data overlap. Large overlap of
distinct values
Similar value frequencies for
overlapping columns
18 © Informatica. Proprietary and Confidential.
True Type Discovery
…for Entities and Elements
Intelligently recommends other
data sets that are similar to what
they are working on
Discovers data domains
(name, phone, email…) and data
entities (purchase order,
health record…)
Automatically tags data by
learning from users tagging fields
and columns, etc.
19 © Informatica. Proprietary and Confidential.
…Across Structured, Semi-Structured and
Unstructured
Person(6)Location(3)Date(1) …
Passage from “Chapter 2: The Science of Deduction” from “A Study in Scarlet” by Arthur Conan Doyle
Support for rule based
data domains
Identifying and classifying
entities from both structured
and unstructured data
New unstructured sources
supported: Excel, Word, PDF, text
files, PPT and more formats
20 © Informatica. Proprietary and Confidential.
Open Metadata APIs
Access metadata knowledge graph with REST APIs
20 © Informatica. Proprietary and Confidential.
21 © Informatica. Proprietary and Confidential.21 © Informatica. Proprietary and Confidential.21 © Informatica. Proprietary and Confidential.
Intelligent Business Term Associations
• CLAIRETM powered automatic
association of business terms
with physical data
• Built on top of auto-data domain
discovery and data similarity
capabilities
• Uses NLP techniques to relate
business terms to field and
column names
• Reduces a tedious manual step in
data governance
Name Asset Type Business Term Recommendation
Mil_ID Column Military ID Number
Med_Nmbr Column Practitioner Medicare number
C_ADJ_KEY Column Claim Adjustment Key
NH_Plan_ID Column National Health Plan Identifier
DSCH_STAT_CD Column Discharge Status Code
NatEmp_ID Column National Employee Identifier
22 © Informatica. Proprietary and Confidential.22 © Informatica. Proprietary and Confidential.22 © Informatica. Proprietary and Confidential.
Dataset Certifications
• Data Owners, Data Stewards and
Subject Matter Experts can certify
datasets and data elements
• Includes certification comments
and keyword specification
• Search rankings prioritize certified
data assets
• New search facet for showing
certified datasets only in searches
Certify fit for use datasets
with right business context
to enable high quality
analytics
Data Steward
Use the most relevant and
trusted data for my analysis
Data Consumer
23 © Informatica. Proprietary and Confidential.23 © Informatica. Proprietary and Confidential.23 © Informatica. Proprietary and Confidential.
Ratings and Reviews – Yelp for Data
• Catalog users can rate data assets
and write reviews
• User ratings with summaries are
displayed for each dataset
• Reviewer or Data Owner can edit or
remove ratings
• Search across data asset user
reviews
• New search facet for showing
results by ratings
Find the most relevant and
trusted data for my analysis
using reviews provided by my
peers
Data Consumer
24 © Informatica. Proprietary and Confidential.24 © Informatica. Proprietary and Confidential.24 © Informatica. Proprietary and Confidential.
Question & Answer – Quora for Data Assets
• Questions can be asked by users
on the data assets
• Answers can be provided by any
user
• Users can mark answers helpful,
which moves the answer up in the
default view
• Question & Answer text is indexed
to help with data asset search
ranking
No more multiple emails and
phone calls for the same
queries on data
Subject Matter Expert
Find experts, ask questions
and get timely answers in the
context of the dataset
Data Consumer
25 © Informatica. Proprietary and Confidential.25 © Informatica. Proprietary and Confidential.25 © Informatica. Proprietary and Confidential.
Change Notifications
• Follow data assets of interest, get
notified of changes
• Follow specific changes to objects
- source changes, enrichments,
collaboration updates
• Watch for changes at individual
asset level or for an entire resource
• Change notifications sent via in
app notification center, event email,
periodic digest email
• New tab at Resource level for a
summary of all changes
Get notified when new
assets are introduced to
apply data standards and
associate business
context
Data Steward
Manage change impact
by tracking changes at
the data source level
DB Admin
Stay on top of changes
to important datasets
and reports
Data Consumer
Lesson #1
Data Catalog Basics
27 © Informatica. Proprietary and Confidential.27
• Duration: 20 minutes
• In this lesson, you will learn the basics of the data catalog. You will learn to search for relevant
data assets using search and dynamic faceting capabilities and explore data assets.
• In this data catalog basics lesson, you will accomplish the following tasks:
• Use the search feature provided by Enterprise Data Catalog to search for an asset and understand the asset details.
• Verify the profiling information displayed for the asset in the Asset Details tab to ensure the quality of data, understand data domains
and similar columns
• Understand the flow of data using Lineage and Impact
• Understand the relationships of the asset
• Read reviews, ratings, or comments on the asset and ask question about the asset.
Lesson#1 The Basics of Enterprise Data Catalog
Lesson #2
Data Catalog for
Self-Service Analytics
29 © Informatica. Proprietary and Confidential.29
Duration: 20 minutes
As a data analyst, you need analyze the customer order report to understand the elements that make up the report as there
are concerns with the report.
In this lesson you will learn how to:
• Search for the customer order report
• Understand how the report was created
• Review and provide feedback on the customer order details table based on different aspects of the asset
• Ask question about the customer order report
• Follow an asset to be informed about the changes that are made to the asset
• Additionally (Optionally)
• In this lesson, you will learn how the Catalog automatically classifies data based on known domains. You will also learn how you can annotate datasets
to further classify data assets along multiple dimensions.
• Search Classified Columns
• Data Domain Overview
• Annotate Data Assets
Lesson #2 Data Catalog for Self-Service Analytics
Lesson #3
Data Catalog for
Data Asset Management
31 © Informatica. Proprietary and Confidential.31
Lesson #3: Data Asset Management
Duration: 20 minutes
In this lesson, you will learn how to use the new drill down lineage views in the
catalog to visualize data provenance. You will also learn how to use the detailed
impact analysis reports in the catalog to understand impact due to change in data
assets or ETL flows.
Objectives
• Understand Drill Down Lineage Views in the Catalog
• Perform Impact Analysis on Data Assets
• Understand Data Domain Curation
Lesson #4
Data Catalog for
Data Governance
33 © Informatica. Proprietary and Confidential.33
Lesson #4: Data Catalog for Data Governance (optional)
Duration: 15 minutes
In this lesson, you will also learn how to associate business terminology with
technical assets. Finally, you will use EDC’s discovery features to search for
technical assets using business vocabulary.
Objectives
• Associate Business Terms with Technical Metadata
• Explore Business Metadata in the Catalog
• Search technical assets using business glossary
Wrap Up
``
Thank You!

More Related Content

What's hot

Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
DATAVERSITY
 
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DATAVERSITY
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management
DATAVERSITY
 
Data Governance
Data GovernanceData Governance
Data Governance
Rob Lux
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
DATAVERSITY
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
Data Catalog as a Business Enabler
Data Catalog as a Business EnablerData Catalog as a Business Enabler
Data Catalog as a Business Enabler
Srinivasan Sankar
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
DataScienceConferenc1
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
Denodo
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture Deliverables
Lars E Martinsson
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data Quality
DATAVERSITY
 
Data Quality & Data Governance
Data Quality & Data GovernanceData Quality & Data Governance
Data Quality & Data Governance
Tuba Yaman Him
 
How to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data QualityHow to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data Quality
DATAVERSITY
 
The Five Pillars of Data Governance 2.0 Success
The Five Pillars of Data Governance 2.0 SuccessThe Five Pillars of Data Governance 2.0 Success
The Five Pillars of Data Governance 2.0 Success
DATAVERSITY
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
Data Governance Powerpoint Presentation Slides
Data Governance Powerpoint Presentation SlidesData Governance Powerpoint Presentation Slides
Data Governance Powerpoint Presentation Slides
SlideTeam
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
DATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 

What's hot (20)

Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Catalog as a Business Enabler
Data Catalog as a Business EnablerData Catalog as a Business Enabler
Data Catalog as a Business Enabler
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture Deliverables
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data Quality
 
Data Quality & Data Governance
Data Quality & Data GovernanceData Quality & Data Governance
Data Quality & Data Governance
 
How to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data QualityHow to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data Quality
 
The Five Pillars of Data Governance 2.0 Success
The Five Pillars of Data Governance 2.0 SuccessThe Five Pillars of Data Governance 2.0 Success
The Five Pillars of Data Governance 2.0 Success
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Data Governance Powerpoint Presentation Slides
Data Governance Powerpoint Presentation SlidesData Governance Powerpoint Presentation Slides
Data Governance Powerpoint Presentation Slides
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 

Similar to Why an AI-Powered Data Catalog Tool is Critical to Business Success

AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...
AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...
AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...
Amazon Web Services
 
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
Amazon Web Services
 
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsEmpowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Precisely
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DATAVERSITY
 
Modern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingModern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail Banking
Cambridge Semantics
 
What Data Do You Have and Where is It?
What Data Do You Have and Where is It? What Data Do You Have and Where is It?
What Data Do You Have and Where is It?
Caserta
 
Achieving a Single View of Business – Critical Data with Master Data Management
Achieving a Single View of Business – Critical Data with Master Data ManagementAchieving a Single View of Business – Critical Data with Master Data Management
Achieving a Single View of Business – Critical Data with Master Data Management
DATAVERSITY
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
Caserta
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
Databricks
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo
 
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
Albert Hoitingh
 
Kaizentric Presentation
Kaizentric PresentationKaizentric Presentation
Kaizentric Presentation
Azhagarasan Annadorai
 
Introducing Trillium DQ for Big Data: Powerful Profiling and Data Quality for...
Introducing Trillium DQ for Big Data: Powerful Profiling and Data Quality for...Introducing Trillium DQ for Big Data: Powerful Profiling and Data Quality for...
Introducing Trillium DQ for Big Data: Powerful Profiling and Data Quality for...
Precisely
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
Big Data Week
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
CCG
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
Cambridge Semantics
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AI
Gary Allemann
 
Apps
AppsApps
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
DATAVERSITY
 

Similar to Why an AI-Powered Data Catalog Tool is Critical to Business Success (20)

AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...
AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...
AWS Summit Singapore - Accelerate Digital Transformation through AI-powered C...
 
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
Accelerate Digital Transformation Through AI-powered Cloud Analytics Moderniz...
 
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsEmpowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog Requirements
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
 
Modern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingModern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail Banking
 
What Data Do You Have and Where is It?
What Data Do You Have and Where is It? What Data Do You Have and Where is It?
What Data Do You Have and Where is It?
 
Achieving a Single View of Business – Critical Data with Master Data Management
Achieving a Single View of Business – Critical Data with Master Data ManagementAchieving a Single View of Business – Critical Data with Master Data Management
Achieving a Single View of Business – Critical Data with Master Data Management
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
 
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
 
Kaizentric Presentation
Kaizentric PresentationKaizentric Presentation
Kaizentric Presentation
 
Introducing Trillium DQ for Big Data: Powerful Profiling and Data Quality for...
Introducing Trillium DQ for Big Data: Powerful Profiling and Data Quality for...Introducing Trillium DQ for Big Data: Powerful Profiling and Data Quality for...
Introducing Trillium DQ for Big Data: Powerful Profiling and Data Quality for...
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AI
 
Apps
AppsApps
Apps
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 

Recently uploaded

Independent Call Girls In Kolkata ✔ 7014168258 ✔ Hi I Am Divya Vip Call Girl ...
Independent Call Girls In Kolkata ✔ 7014168258 ✔ Hi I Am Divya Vip Call Girl ...Independent Call Girls In Kolkata ✔ 7014168258 ✔ Hi I Am Divya Vip Call Girl ...
Independent Call Girls In Kolkata ✔ 7014168258 ✔ Hi I Am Divya Vip Call Girl ...
simmi singh
 
Introduction to Python and Basic Syntax.pptx
Introduction to Python and Basic Syntax.pptxIntroduction to Python and Basic Syntax.pptx
Introduction to Python and Basic Syntax.pptx
GevitaChinnaiah
 
Going AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applicationsGoing AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applications
Alina Yurenko
 
Lightning Talk - Ephemeral Containers on Kubernetes in 10 MInutes.pdf
Lightning Talk -  Ephemeral Containers on Kubernetes in 10 MInutes.pdfLightning Talk -  Ephemeral Containers on Kubernetes in 10 MInutes.pdf
Lightning Talk - Ephemeral Containers on Kubernetes in 10 MInutes.pdf
Natan Yellin
 
Accelerate your Sitecore development with GenAI
Accelerate your Sitecore development with GenAIAccelerate your Sitecore development with GenAI
Accelerate your Sitecore development with GenAI
Ahmed Okour
 
Call Girls Solapur ☎️ +91-7426014248 😍 Solapur Call Girl Beauty Girls Solapur...
Call Girls Solapur ☎️ +91-7426014248 😍 Solapur Call Girl Beauty Girls Solapur...Call Girls Solapur ☎️ +91-7426014248 😍 Solapur Call Girl Beauty Girls Solapur...
Call Girls Solapur ☎️ +91-7426014248 😍 Solapur Call Girl Beauty Girls Solapur...
anshsharma8761
 
Digital Marketing Introduction and conclusion
Digital Marketing Introduction and conclusionDigital Marketing Introduction and conclusion
Digital Marketing Introduction and conclusion
Staff AgentAI
 
The Ultimate Guide to Top 36 DevOps Testing Tools for 2024.pdf
The Ultimate Guide to Top 36 DevOps Testing Tools for 2024.pdfThe Ultimate Guide to Top 36 DevOps Testing Tools for 2024.pdf
The Ultimate Guide to Top 36 DevOps Testing Tools for 2024.pdf
kalichargn70th171
 
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdfSoftware Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
kalichargn70th171
 
Devops Tools Pratical Preparatório LPI
Devops Tools Pratical   Preparatório LPIDevops Tools Pratical   Preparatório LPI
Devops Tools Pratical Preparatório LPI
DborahDmaris
 
Microsoft-Power-Platform-Adoption-Planning.pptx
Microsoft-Power-Platform-Adoption-Planning.pptxMicrosoft-Power-Platform-Adoption-Planning.pptx
Microsoft-Power-Platform-Adoption-Planning.pptx
jrodriguezq3110
 
Orca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container OrchestrationOrca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container Orchestration
Pedro J. Molina
 
Call Girls Bangalore🔥7023059433🔥Best Profile Escorts in Bangalore Available 24/7
Call Girls Bangalore🔥7023059433🔥Best Profile Escorts in Bangalore Available 24/7Call Girls Bangalore🔥7023059433🔥Best Profile Escorts in Bangalore Available 24/7
Call Girls Bangalore🔥7023059433🔥Best Profile Escorts in Bangalore Available 24/7
manji sharman06
 
Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)
wonyong hwang
 
Extreme DDD Modelling Patterns - 2024 Devoxx Poland
Extreme DDD Modelling Patterns - 2024 Devoxx PolandExtreme DDD Modelling Patterns - 2024 Devoxx Poland
Extreme DDD Modelling Patterns - 2024 Devoxx Poland
Alberto Brandolini
 
Beginner's Guide to Observability@Devoxx PL 2024
Beginner's  Guide to Observability@Devoxx PL 2024Beginner's  Guide to Observability@Devoxx PL 2024
Beginner's Guide to Observability@Devoxx PL 2024
michniczscribd
 
Refactoring legacy systems using events commands and bubble contexts
Refactoring legacy systems using events commands and bubble contextsRefactoring legacy systems using events commands and bubble contexts
Refactoring legacy systems using events commands and bubble contexts
Michał Kurzeja
 
TheFutureIsDynamic-BoxLang-CFCamp2024.pdf
TheFutureIsDynamic-BoxLang-CFCamp2024.pdfTheFutureIsDynamic-BoxLang-CFCamp2024.pdf
TheFutureIsDynamic-BoxLang-CFCamp2024.pdf
Ortus Solutions, Corp
 
OpenChain Webinar - Open Source Due Diligence for M&A - 2024-06-17
OpenChain Webinar - Open Source Due Diligence for M&A - 2024-06-17OpenChain Webinar - Open Source Due Diligence for M&A - 2024-06-17
OpenChain Webinar - Open Source Due Diligence for M&A - 2024-06-17
Shane Coughlan
 

Recently uploaded (20)

Independent Call Girls In Kolkata ✔ 7014168258 ✔ Hi I Am Divya Vip Call Girl ...
Independent Call Girls In Kolkata ✔ 7014168258 ✔ Hi I Am Divya Vip Call Girl ...Independent Call Girls In Kolkata ✔ 7014168258 ✔ Hi I Am Divya Vip Call Girl ...
Independent Call Girls In Kolkata ✔ 7014168258 ✔ Hi I Am Divya Vip Call Girl ...
 
Introduction to Python and Basic Syntax.pptx
Introduction to Python and Basic Syntax.pptxIntroduction to Python and Basic Syntax.pptx
Introduction to Python and Basic Syntax.pptx
 
Going AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applicationsGoing AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applications
 
Lightning Talk - Ephemeral Containers on Kubernetes in 10 MInutes.pdf
Lightning Talk -  Ephemeral Containers on Kubernetes in 10 MInutes.pdfLightning Talk -  Ephemeral Containers on Kubernetes in 10 MInutes.pdf
Lightning Talk - Ephemeral Containers on Kubernetes in 10 MInutes.pdf
 
Accelerate your Sitecore development with GenAI
Accelerate your Sitecore development with GenAIAccelerate your Sitecore development with GenAI
Accelerate your Sitecore development with GenAI
 
Call Girls Solapur ☎️ +91-7426014248 😍 Solapur Call Girl Beauty Girls Solapur...
Call Girls Solapur ☎️ +91-7426014248 😍 Solapur Call Girl Beauty Girls Solapur...Call Girls Solapur ☎️ +91-7426014248 😍 Solapur Call Girl Beauty Girls Solapur...
Call Girls Solapur ☎️ +91-7426014248 😍 Solapur Call Girl Beauty Girls Solapur...
 
Digital Marketing Introduction and conclusion
Digital Marketing Introduction and conclusionDigital Marketing Introduction and conclusion
Digital Marketing Introduction and conclusion
 
The Ultimate Guide to Top 36 DevOps Testing Tools for 2024.pdf
The Ultimate Guide to Top 36 DevOps Testing Tools for 2024.pdfThe Ultimate Guide to Top 36 DevOps Testing Tools for 2024.pdf
The Ultimate Guide to Top 36 DevOps Testing Tools for 2024.pdf
 
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdfSoftware Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
 
bgiolcb
bgiolcbbgiolcb
bgiolcb
 
Devops Tools Pratical Preparatório LPI
Devops Tools Pratical   Preparatório LPIDevops Tools Pratical   Preparatório LPI
Devops Tools Pratical Preparatório LPI
 
Microsoft-Power-Platform-Adoption-Planning.pptx
Microsoft-Power-Platform-Adoption-Planning.pptxMicrosoft-Power-Platform-Adoption-Planning.pptx
Microsoft-Power-Platform-Adoption-Planning.pptx
 
Orca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container OrchestrationOrca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container Orchestration
 
Call Girls Bangalore🔥7023059433🔥Best Profile Escorts in Bangalore Available 24/7
Call Girls Bangalore🔥7023059433🔥Best Profile Escorts in Bangalore Available 24/7Call Girls Bangalore🔥7023059433🔥Best Profile Escorts in Bangalore Available 24/7
Call Girls Bangalore🔥7023059433🔥Best Profile Escorts in Bangalore Available 24/7
 
Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)
 
Extreme DDD Modelling Patterns - 2024 Devoxx Poland
Extreme DDD Modelling Patterns - 2024 Devoxx PolandExtreme DDD Modelling Patterns - 2024 Devoxx Poland
Extreme DDD Modelling Patterns - 2024 Devoxx Poland
 
Beginner's Guide to Observability@Devoxx PL 2024
Beginner's  Guide to Observability@Devoxx PL 2024Beginner's  Guide to Observability@Devoxx PL 2024
Beginner's Guide to Observability@Devoxx PL 2024
 
Refactoring legacy systems using events commands and bubble contexts
Refactoring legacy systems using events commands and bubble contextsRefactoring legacy systems using events commands and bubble contexts
Refactoring legacy systems using events commands and bubble contexts
 
TheFutureIsDynamic-BoxLang-CFCamp2024.pdf
TheFutureIsDynamic-BoxLang-CFCamp2024.pdfTheFutureIsDynamic-BoxLang-CFCamp2024.pdf
TheFutureIsDynamic-BoxLang-CFCamp2024.pdf
 
OpenChain Webinar - Open Source Due Diligence for M&A - 2024-06-17
OpenChain Webinar - Open Source Due Diligence for M&A - 2024-06-17OpenChain Webinar - Open Source Due Diligence for M&A - 2024-06-17
OpenChain Webinar - Open Source Due Diligence for M&A - 2024-06-17
 

Why an AI-Powered Data Catalog Tool is Critical to Business Success

  • 2. Data is the Critical Foundation Improve Customer Engagement Improve Patient Outcomes Reduce Fraud Reduce Compliance Risk
  • 3. © Informatica. Proprietary and Confidential.33 Governance & Compliance Because data drives all digital transformation priorities… Self-Service / Advanced Analytics Cloud Migration Customer Experience
  • 4. © Informatica. Proprietary and Confidential.44 Governance & Compliance …And organization rely on data to drive digital transformation Self-Service / Advanced Analytics Cloud Migration Customer Experience Discovery of critical enterprise data provided foundation for data governance program Global Financial Services Company Flipped the 80-20% rule for finding data to analyzing effort for data analysts Enterprise-scale metadata understanding allows them to simplify cloud migration projects Opening up data visibility is fueling development of new services and improving quality of patient care Regional Health Care Company Large National Retailer Large Shipping Company
  • 5. 5 © Informatica. Proprietary and Confidential.5 An AI-Powered data catalog is essential to digital transformation Share your data knowledge Understand & trust your data Catalog ALL enterprise data
  • 6. 6 © Informatica. Proprietary and Confidential.6 © Informatica. Proprietary and Confidential. Data Steward How can I manage metadata for key enterprise data assets? How do I assess and manage data quality through the lifecycle? Data Governance Office How can we validate and enforce our data governance policies and definitions? How can I ensure data managed within application and supporting processes deliver value to the business? Data Owner How can I discover, understand and trust data required for my analysis? Data Consumer How can IT enable business discover data assets with verified data quality and traceability? Data Architect TechnicalBusiness The questions a data catalog solves
  • 7. 7 © Informatica. Proprietary and Confidential.7 © Informatica. Proprietary and Confidential.7 © Informatica. Proprietary and Confidential. Enterprise Data Catalog • Find the data you need with simple, powerful semantic search • Understand your enterprise data with a holistic view • AI-powered automatic discovery, classification and business context • Comprehensive metadata connectivity for all enterprise data • Big Data scale and flexible deployment options • Open framework for custom extensions and integrations
  • 8. 8 © Informatica. Proprietary and Confidential.8 © Informatica. Proprietary and Confidential.8 © Informatica. Proprietary and Confidential. Enterprise Data Catalog Broad Metadata Sources • Technical • Operational • Usage Business Context • Glossary • Policies • Process Wisdom of Crowd • Comments • Ratings • Behavior Knowledge Graph Business & Crowd Sourced Curation AI Curated Catalog Enterprise Data Catalog Data Governance [Data Stewards, Data Architects] • Associate Business glossary to technical objects • Verify business to technical lineage • Track key data elements compliance Self Service Analytics [Data Analysts, Data Scientists] • Google for enterprise data assets • Data Lineage, holistic relationship view • Trust with data profile • Access to data Data Asset Management [Architects, Developers] • Analyze column-level Lineage & Change Impact • View transformation Logic • Data asset and BI usage Structure Discovery, Profiling and Domain Discovery, Similarity Clustering, Recommendations Business Glossary Associations, Business Classifications, Annotations, Comments
  • 9. 9 © Informatica. Proprietary and Confidential.9 Smart Discovery Semantic Search Search datasets using business terminology, synonyms, across related objects and data flows. Example: when you type “grade”, EDC can suggest “tier” aligning with the terminology used in the organization. Data Profiling and Domain Discovery View data profiling statistics alongside data assets to understand data quality before using data for analysis. Profiling statistics include value distributions, patterns, and data type and data domain inference. Example: you can search for “tables with emails” to get all tables that have email information, regardless of the underlying column names Facets Intelligent facets, based on the search results, allow users to narrow the search to the data sets of interest. Facets include both system and custom classifications
  • 10. 10 © Informatica. Proprietary and Confidential. Automatic Data Lineage …Upstream and Downstream • Lineage Discovery from: - Informatica Big Data Management - Cloudera Navigator - HW Atlas - HiveQL - + Other Enterprise Sources - Informatica Powercenter - Microsoft SSIS, Datastage, SAP BO, Oracle Data Integrator - BI
  • 11. 11 © Informatica. Proprietary and Confidential. Relationship Discovery 360 Relationships View Get a 360-degree view of data in a knowledge graph that lets you quickly search, discover, and understand enterprise data and meaningful data relationships. Automatically discover related data sets, technical, business, semantic and usage based relationships. Data Lineage Interactively trace data origin through business-friendly summarized lineage views that highlight the end points and all the complex details in between. A drill-down lineage view expands any lineage path to show columns and lineage diagram metrics. Impact Analysis Perform detailed impact analysis on upstream and downstream data assets. See impact across data asset, resources and users.
  • 12. 12 © Informatica. Proprietary and Confidential. But cataloging and curating all the enterprise data doesn’t sound easy!!
  • 14. PowerCenter | DQ MDM | BDM | DIH BG | ILM | Axon | Informatica Cloud Informatica Oracle | DB2 | DB2 for z/OS SQL Server | Sybase | Teradata Netezza | JDBC | SQL Scripts | SAP HANA | Stored Procedures Databases SAP R/3 | Salesforce Oracle | Workday Applications HIVE (Cloudera, Hortonworks, MapR, IBM BigInsights, EMR, HDI) HDFS | MapRFS | Cloudera Navigator | Atlas Big Data AWS S3 | AWS Redshift | Azure SQL DB | Azure SQL DW | Azure ADLS | Azure Blob | Google BigQuery | ADLS Gen 2 Cloud Platforms CSV | Delimited | XML | JSON | Avro | Parquet | MS Excel | Adobe PDF | Flat File | MS PowerPoint | MS Word File Formats Tableau | IBM Cognos | SAP BusinessObjects MicroStrategy | OBIEE Business Intelligence Microsoft SSIS | Erwin Models | PowerDesigner | Oracle Data Integrator | IBM DataStage | Custom Scanner Framework Other Enterprise Data Catalog
  • 15. =
  • 16. CLAIRE The Intelligence in the Intelligent Data Platform Simplify Data Stewardship Automate Integration and data migration Amplify Analytics Spotlight Data Risk Accelerate Governance Improve Operations Metadata-driven Artificial Intelligence
  • 17. 17 © Informatica. Proprietary and Confidential. Smart Domains Uses Column Similarity Like photo tagging CLAIRE for Columns Clustering based on column metadata and Jaccard Coefficient and Bray Curtis Similarity Column similarity based on data overlap. Large overlap of distinct values Similar value frequencies for overlapping columns
  • 18. 18 © Informatica. Proprietary and Confidential. True Type Discovery …for Entities and Elements Intelligently recommends other data sets that are similar to what they are working on Discovers data domains (name, phone, email…) and data entities (purchase order, health record…) Automatically tags data by learning from users tagging fields and columns, etc.
  • 19. 19 © Informatica. Proprietary and Confidential. …Across Structured, Semi-Structured and Unstructured Person(6)Location(3)Date(1) … Passage from “Chapter 2: The Science of Deduction” from “A Study in Scarlet” by Arthur Conan Doyle Support for rule based data domains Identifying and classifying entities from both structured and unstructured data New unstructured sources supported: Excel, Word, PDF, text files, PPT and more formats
  • 20. 20 © Informatica. Proprietary and Confidential. Open Metadata APIs Access metadata knowledge graph with REST APIs 20 © Informatica. Proprietary and Confidential.
  • 21. 21 © Informatica. Proprietary and Confidential.21 © Informatica. Proprietary and Confidential.21 © Informatica. Proprietary and Confidential. Intelligent Business Term Associations • CLAIRETM powered automatic association of business terms with physical data • Built on top of auto-data domain discovery and data similarity capabilities • Uses NLP techniques to relate business terms to field and column names • Reduces a tedious manual step in data governance Name Asset Type Business Term Recommendation Mil_ID Column Military ID Number Med_Nmbr Column Practitioner Medicare number C_ADJ_KEY Column Claim Adjustment Key NH_Plan_ID Column National Health Plan Identifier DSCH_STAT_CD Column Discharge Status Code NatEmp_ID Column National Employee Identifier
  • 22. 22 © Informatica. Proprietary and Confidential.22 © Informatica. Proprietary and Confidential.22 © Informatica. Proprietary and Confidential. Dataset Certifications • Data Owners, Data Stewards and Subject Matter Experts can certify datasets and data elements • Includes certification comments and keyword specification • Search rankings prioritize certified data assets • New search facet for showing certified datasets only in searches Certify fit for use datasets with right business context to enable high quality analytics Data Steward Use the most relevant and trusted data for my analysis Data Consumer
  • 23. 23 © Informatica. Proprietary and Confidential.23 © Informatica. Proprietary and Confidential.23 © Informatica. Proprietary and Confidential. Ratings and Reviews – Yelp for Data • Catalog users can rate data assets and write reviews • User ratings with summaries are displayed for each dataset • Reviewer or Data Owner can edit or remove ratings • Search across data asset user reviews • New search facet for showing results by ratings Find the most relevant and trusted data for my analysis using reviews provided by my peers Data Consumer
  • 24. 24 © Informatica. Proprietary and Confidential.24 © Informatica. Proprietary and Confidential.24 © Informatica. Proprietary and Confidential. Question & Answer – Quora for Data Assets • Questions can be asked by users on the data assets • Answers can be provided by any user • Users can mark answers helpful, which moves the answer up in the default view • Question & Answer text is indexed to help with data asset search ranking No more multiple emails and phone calls for the same queries on data Subject Matter Expert Find experts, ask questions and get timely answers in the context of the dataset Data Consumer
  • 25. 25 © Informatica. Proprietary and Confidential.25 © Informatica. Proprietary and Confidential.25 © Informatica. Proprietary and Confidential. Change Notifications • Follow data assets of interest, get notified of changes • Follow specific changes to objects - source changes, enrichments, collaboration updates • Watch for changes at individual asset level or for an entire resource • Change notifications sent via in app notification center, event email, periodic digest email • New tab at Resource level for a summary of all changes Get notified when new assets are introduced to apply data standards and associate business context Data Steward Manage change impact by tracking changes at the data source level DB Admin Stay on top of changes to important datasets and reports Data Consumer
  • 27. 27 © Informatica. Proprietary and Confidential.27 • Duration: 20 minutes • In this lesson, you will learn the basics of the data catalog. You will learn to search for relevant data assets using search and dynamic faceting capabilities and explore data assets. • In this data catalog basics lesson, you will accomplish the following tasks: • Use the search feature provided by Enterprise Data Catalog to search for an asset and understand the asset details. • Verify the profiling information displayed for the asset in the Asset Details tab to ensure the quality of data, understand data domains and similar columns • Understand the flow of data using Lineage and Impact • Understand the relationships of the asset • Read reviews, ratings, or comments on the asset and ask question about the asset. Lesson#1 The Basics of Enterprise Data Catalog
  • 28. Lesson #2 Data Catalog for Self-Service Analytics
  • 29. 29 © Informatica. Proprietary and Confidential.29 Duration: 20 minutes As a data analyst, you need analyze the customer order report to understand the elements that make up the report as there are concerns with the report. In this lesson you will learn how to: • Search for the customer order report • Understand how the report was created • Review and provide feedback on the customer order details table based on different aspects of the asset • Ask question about the customer order report • Follow an asset to be informed about the changes that are made to the asset • Additionally (Optionally) • In this lesson, you will learn how the Catalog automatically classifies data based on known domains. You will also learn how you can annotate datasets to further classify data assets along multiple dimensions. • Search Classified Columns • Data Domain Overview • Annotate Data Assets Lesson #2 Data Catalog for Self-Service Analytics
  • 30. Lesson #3 Data Catalog for Data Asset Management
  • 31. 31 © Informatica. Proprietary and Confidential.31 Lesson #3: Data Asset Management Duration: 20 minutes In this lesson, you will learn how to use the new drill down lineage views in the catalog to visualize data provenance. You will also learn how to use the detailed impact analysis reports in the catalog to understand impact due to change in data assets or ETL flows. Objectives • Understand Drill Down Lineage Views in the Catalog • Perform Impact Analysis on Data Assets • Understand Data Domain Curation
  • 32. Lesson #4 Data Catalog for Data Governance
  • 33. 33 © Informatica. Proprietary and Confidential.33 Lesson #4: Data Catalog for Data Governance (optional) Duration: 15 minutes In this lesson, you will also learn how to associate business terminology with technical assets. Finally, you will use EDC’s discovery features to search for technical assets using business vocabulary. Objectives • Associate Business Terms with Technical Metadata • Explore Business Metadata in the Catalog • Search technical assets using business glossary
  翻译: