尊敬的 微信汇率:1円 ≈ 0.046239 元 支付宝汇率:1円 ≈ 0.04633元 [退出登录]
SlideShare a Scribd company logo
The First Step in Information Management
looker.com
Produced by:
MONTHLY SERIES
In partnership with:
The Missed Promise of Hadoop and
New and Emerging Technologies
August 2, 2018
Welcome to Today’s Discussion
 Evolution of Hadoop
 Current state of Hadoop: pros and cons for big data and analytics
 Role of Hadoop in enterprise architectures
 Successful use cases and lessons learned
 Hadoop alternatives
 Best practices and key takeaways
 Q&A
pg 2© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Evolution of Hadoop
 15 years – Hadoop is firmly entrenched in the landscape
 Hadoop in the cloud means it’s transparent
 Commoditization
 The use of Hadoop seems to be as the landing zone
 Not mission-critical but still relevant – and still maintains some of its long-term issues
pg 3© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
MANAGED
CLOUD
SERVICES
SERVERLESS
MICROSERVICES
VENDOR
SOLUTIONS
PART OF
ORDINARY
ENTERPRISE
ARCHITECTURE
FABRIC
Scope of Hadoop for Analytics
 Data at rest
 Permanent fixture in enterprise architectures
 Will relational ever go away for BI? – No
 Likewise, Hadoop won’t go away for
departmental analytics of data at rest
pg 4© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Analyst Viewpoints on Hadoop
pg 5© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
 Well-entrenched
 Spark, while a replacement, is still
entering the trough of
disillusionment
 Other large volume options (e.g., in
memory DBMS) are mature
 It is part of the landscape and other
tools take over its limitations
 AI may eventually overtake Hadoop
(let us explain)
Current State of Hadoop – Not a Focal Point, but Important
pg 6© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Proven data lake “at rest”
technology
The “only” Big Data store and file
management (HDFS)
Analyzing large data sets easily
Good availability of supporting
“wrapping” technology (Podium,
Cloudera, etc.)
Haas becoming more common
Trend to real-time and low-
latency affects relevance
Being supplanted by Spark
and Storm
Open source but still needs
expertise
Lack of governance and
security
Extended implementation
and time-to-market
Goodtoknow…..
Needtobeaware…
Role of Hadoop – Integral to Reference Architectures
pg 7© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Data Life
Cycles
DATA INSIGHT ARCHITECTURE
1Management
Data Usage
Vintage Area Contemporary Area
Business Strategy
Legacy BI and Reporting
Data Warehouse, ODS, Mart
Low latency (Spark)
Graph
Advanced Analytics
RDBMS, SQL, In-Memory
Appliance
Metadata Lineage Reference Data
Alignment
Data Monetization
Visualization ETL,EAI,ReplicationMobile Logical DW
Unstructured Data
Hadoop. HDFS (for lakes)
ERP, Legacy Applications
Scenarios for Hadoop as a Main Player and a Focal Point
pg 8© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Use Case 1 – Large Organization
• Very large organization with lots of data and skilled technologists
• Owned data centers and offered as a service
• Located where there is subpar internet back bone (e.g., Hawaii)
Use Case 2 – Direct Control
• Where economics are less important than direct control
• The safest connection is no connection (e.g., research facilities,
atomic/nuclear, military)
Lessons Learned from Hadoop (and Other Technologies)
pg 9© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Pizza Tastes Awful
Square slices (Culture management)
Lost recipe (DG)
Poor preparation ((DM)
Poor ingredients (DQ)
*Graphic source: Albert Barron “Pizza as a
Service,” published on LinkedIn July 30, 2014
Lessons Learned from Hadoop (and Other Technologies)
 Delivery and storage
architecture are irrelevant if
you don’t control the raw
ingredients and process
 You are still responsible for
above-the-line capabilities
 Regardless where Hadoop
lives, there are still significant
shortcomings across all
industries that affect its
usefulness, but aren’t the
fault of Hadoop
pg 10© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Pizza Tastes Awful
Square slices (Culture management)
Lost recipe (DG)
Poor preparation ((DM)
Poor ingredients (DQ)
*Graphic source:
Albert Barron “Pizza
as a Service,”
published on LinkedIn
July 30, 2014
Alternatives to Hadoop (not Exhaustive)
pg 11© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
 Apache Spark
 Apache Storm
 Google BigQuery
 DataTorrent RTS
 Hydra
 Amazon S3
 Snowflake
 Amazon Redshift
 Podium
 Hortonworks
 Cloudera
 Cassandra
 MongoDB
 Apache HBase
Non-standard Uses of Hadoop
 AI and Machine Learning for
creating analytic data sets
− Use algorithms to match and
identify related data
− Create at-rest data sets (in
Hadoop) and use new tools
for access
− Data scientists vet results
and confirm usefulness
− Use as a Lake to store, access
and distribute results
pg 12
© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Vintage Area
1
Data Life
Cycles
Data Management
Data Usage
Vintage Area
Legacy BI and
Reporting
Data Warehouse, ODS,
Mart ETL,
EAI,
Msg,
Copy
Data Lake
Hadoop
RDBMS, SQL, In-
Memory Appliance
Unstructured
Data
Structured
Data
Non-standard Uses of Hadoop
 AI and Machine learning for metadata
− Use AI and machine learning to extract
metadata
− Provide insight to data stewards and
custodians
− Use at rest Hadoop as “system of
record” for metadata
− Use Graph data base for metadata
access
pg 13© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Vintage Area
1
Data
Life
Cycles
Data Management
Data Usage
Vintage Area
Legacy BI and
Reporting
Data Warehouse,
ODS, Mart ETL,
EAI,
Msg,
Copy
Data Lake
Hadoop
Advanced Analytics
RDBMS, SQL, In-
Memory Appliance
Metadata
Logical DW
Data Sources
Graph
BIVisualization
The Future of Hadoop
pg 14© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
*Image source: The Globe and Mail “The Maytag Man
through the years” published June 19, 2017
Staple/mainstay of reliable technology…
Best Practices
 Stick with Hadoop and a product that
addresses shortcomings for data at rest
 Design a data architecture, whether or
not you are using cloud
 Consider if your organization has unique
characteristics that may place Hadoop
center stage
pg 15© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Key Takeaways
pg 16© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
KEEP IN MIND…
 If streaming data, then you need to get serious
about other types of tech, or just place it in the cloud
 There is no avoiding data governance and data management,
regardless of your approach
 Hadoop will remain a part of data supply chains and analytics
ecosystems for data at rest and lakes
Hadoop is technology – which ALWAYS commoditizes or
disappears – it is not a decision of IF, but HOW
Please Share Your Questions and Comments
MONTHLY SERIES
Thank you for joining us today!
Our Thursday, September 6
#DIAnaltyics webinar is:
Advanced Databases and
Knowledge Management
.
John Ladley @jladley
john@firstsanfranciscopartners.com
Kelle O’Neal @kellezoneal
kelle@firstsanfranciscopartners.com

More Related Content

What's hot

Business Value Metrics for Data Governance
Business Value Metrics for Data GovernanceBusiness Value Metrics for Data Governance
Business Value Metrics for Data Governance
DATAVERSITY
 
Real-World Data Governance Webinar: Using Data Governance to Achieve Data Qua...
Real-World Data Governance Webinar: Using Data Governance to Achieve Data Qua...Real-World Data Governance Webinar: Using Data Governance to Achieve Data Qua...
Real-World Data Governance Webinar: Using Data Governance to Achieve Data Qua...
DATAVERSITY
 
Advanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and StewardshipAdvanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and Stewardship
DATAVERSITY
 
A Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKANA Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKAN
Dinothan Muthulingam
 
Data Quality Strategies
Data Quality StrategiesData Quality Strategies
Data Quality Strategies
DATAVERSITY
 
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
DATAVERSITY
 
Do-It-Yourself Metadata Framework
Do-It-Yourself Metadata FrameworkDo-It-Yourself Metadata Framework
Do-It-Yourself Metadata Framework
DATAVERSITY
 
RWDG Slides: Data Governance Roles and Responsibilities
RWDG Slides: Data Governance Roles and ResponsibilitiesRWDG Slides: Data Governance Roles and Responsibilities
RWDG Slides: Data Governance Roles and Responsibilities
DATAVERSITY
 
Balancing Data and Processes to Achieve Organizational Maturity
Balancing Data and Processes to Achieve Organizational MaturityBalancing Data and Processes to Achieve Organizational Maturity
Balancing Data and Processes to Achieve Organizational Maturity
DATAVERSITY
 
RWDG Slides: Three Ways to Manage Your Data Stewards
RWDG Slides: Three Ways to Manage Your Data StewardsRWDG Slides: Three Ways to Manage Your Data Stewards
RWDG Slides: Three Ways to Manage Your Data Stewards
DATAVERSITY
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
DATAVERSITY
 
Governing Quality Analytics
Governing Quality AnalyticsGoverning Quality Analytics
Governing Quality Analytics
DATAVERSITY
 
IT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights TogetherIT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights Together
DATAVERSITY
 
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data GovernanceRWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
DATAVERSITY
 
Big Data Strategies – Organizational Structure and Technology
Big Data Strategies – Organizational Structure and TechnologyBig Data Strategies – Organizational Structure and Technology
Big Data Strategies – Organizational Structure and Technology
DATAVERSITY
 
Metadata Strategies - Data Squared
Metadata Strategies - Data SquaredMetadata Strategies - Data Squared
Metadata Strategies - Data Squared
DATAVERSITY
 
RWDG Slides: Building a Data Governance Roadmap
RWDG Slides: Building a Data Governance RoadmapRWDG Slides: Building a Data Governance Roadmap
RWDG Slides: Building a Data Governance Roadmap
DATAVERSITY
 
How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?
DATAVERSITY
 
Trends in Data Analytics - From Database to Analyst
Trends in Data Analytics - From Database to AnalystTrends in Data Analytics - From Database to Analyst
Trends in Data Analytics - From Database to Analyst
DATAVERSITY
 
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
DATAVERSITY
 

What's hot (20)

Business Value Metrics for Data Governance
Business Value Metrics for Data GovernanceBusiness Value Metrics for Data Governance
Business Value Metrics for Data Governance
 
Real-World Data Governance Webinar: Using Data Governance to Achieve Data Qua...
Real-World Data Governance Webinar: Using Data Governance to Achieve Data Qua...Real-World Data Governance Webinar: Using Data Governance to Achieve Data Qua...
Real-World Data Governance Webinar: Using Data Governance to Achieve Data Qua...
 
Advanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and StewardshipAdvanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and Stewardship
 
A Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKANA Broader Data Management Strategy with DKAN
A Broader Data Management Strategy with DKAN
 
Data Quality Strategies
Data Quality StrategiesData Quality Strategies
Data Quality Strategies
 
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
 
Do-It-Yourself Metadata Framework
Do-It-Yourself Metadata FrameworkDo-It-Yourself Metadata Framework
Do-It-Yourself Metadata Framework
 
RWDG Slides: Data Governance Roles and Responsibilities
RWDG Slides: Data Governance Roles and ResponsibilitiesRWDG Slides: Data Governance Roles and Responsibilities
RWDG Slides: Data Governance Roles and Responsibilities
 
Balancing Data and Processes to Achieve Organizational Maturity
Balancing Data and Processes to Achieve Organizational MaturityBalancing Data and Processes to Achieve Organizational Maturity
Balancing Data and Processes to Achieve Organizational Maturity
 
RWDG Slides: Three Ways to Manage Your Data Stewards
RWDG Slides: Three Ways to Manage Your Data StewardsRWDG Slides: Three Ways to Manage Your Data Stewards
RWDG Slides: Three Ways to Manage Your Data Stewards
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
 
Governing Quality Analytics
Governing Quality AnalyticsGoverning Quality Analytics
Governing Quality Analytics
 
IT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights TogetherIT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights Together
 
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data GovernanceRWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
 
Big Data Strategies – Organizational Structure and Technology
Big Data Strategies – Organizational Structure and TechnologyBig Data Strategies – Organizational Structure and Technology
Big Data Strategies – Organizational Structure and Technology
 
Metadata Strategies - Data Squared
Metadata Strategies - Data SquaredMetadata Strategies - Data Squared
Metadata Strategies - Data Squared
 
RWDG Slides: Building a Data Governance Roadmap
RWDG Slides: Building a Data Governance RoadmapRWDG Slides: Building a Data Governance Roadmap
RWDG Slides: Building a Data Governance Roadmap
 
How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?
 
Trends in Data Analytics - From Database to Analyst
Trends in Data Analytics - From Database to AnalystTrends in Data Analytics - From Database to Analyst
Trends in Data Analytics - From Database to Analyst
 
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
 

Similar to The Missed Promise of Hadoop and New and Emerging Technologies

Big and fast data strategy 2017 jr
Big and fast data strategy 2017 jrBig and fast data strategy 2017 jr
Big and fast data strategy 2017 jr
Jonathan Raspaud
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
Krisshhna Daasaarii
 
Supply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 TrendsSupply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 Trends
Retigence Technologies
 
The Top 8 Trends for Big Data in 2016
The Top 8 Trends for Big Data in 2016The Top 8 Trends for Big Data in 2016
The Top 8 Trends for Big Data in 2016
Tableau Software
 
BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social Media
Skillspeed
 
Big data tim
Big data timBig data tim
Big data tim
T Weir
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
Stephen Alex
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
Stephen Alex
 
Future of big data nick kabra speaker compendium march 2013
Future of big data nick kabra speaker compendium march 2013Future of big data nick kabra speaker compendium march 2013
Future of big data nick kabra speaker compendium march 2013
nkabra
 
Data Lake Architecture
Data Lake ArchitectureData Lake Architecture
Data Lake Architecture
DATAVERSITY
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public Cloud
IMC Institute
 
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus ModelUnstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus Model
Editor IJCATR
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
Hortonworks
 
Gartner peer forum sept 2011 orbitz
Gartner peer forum sept 2011   orbitzGartner peer forum sept 2011   orbitz
Gartner peer forum sept 2011 orbitz
Raghu Kashyap
 
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Jonathan Seidman
 
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
Revolution Analytics
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Hortonworks
 
The Big Picture on Big Data and Cognos
The Big Picture on Big Data and CognosThe Big Picture on Big Data and Cognos
The Big Picture on Big Data and Cognos
Senturus
 
SAP HANA SQL Data Warehousing (Sefan Linders)
SAP HANA SQL Data Warehousing (Sefan Linders)SAP HANA SQL Data Warehousing (Sefan Linders)
SAP HANA SQL Data Warehousing (Sefan Linders)
Twan van den Broek
 
Big Data in Action – Real-World Solution Showcase
 Big Data in Action – Real-World Solution Showcase Big Data in Action – Real-World Solution Showcase
Big Data in Action – Real-World Solution Showcase
Inside Analysis
 

Similar to The Missed Promise of Hadoop and New and Emerging Technologies (20)

Big and fast data strategy 2017 jr
Big and fast data strategy 2017 jrBig and fast data strategy 2017 jr
Big and fast data strategy 2017 jr
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
 
Supply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 TrendsSupply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 Trends
 
The Top 8 Trends for Big Data in 2016
The Top 8 Trends for Big Data in 2016The Top 8 Trends for Big Data in 2016
The Top 8 Trends for Big Data in 2016
 
BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social Media
 
Big data tim
Big data timBig data tim
Big data tim
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
 
Future of big data nick kabra speaker compendium march 2013
Future of big data nick kabra speaker compendium march 2013Future of big data nick kabra speaker compendium march 2013
Future of big data nick kabra speaker compendium march 2013
 
Data Lake Architecture
Data Lake ArchitectureData Lake Architecture
Data Lake Architecture
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public Cloud
 
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus ModelUnstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus Model
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
 
Gartner peer forum sept 2011 orbitz
Gartner peer forum sept 2011   orbitzGartner peer forum sept 2011   orbitz
Gartner peer forum sept 2011 orbitz
 
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
Architecting for Big Data - Gartner Innovation Peer Forum Sept 2011
 
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
The Big Picture on Big Data and Cognos
The Big Picture on Big Data and CognosThe Big Picture on Big Data and Cognos
The Big Picture on Big Data and Cognos
 
SAP HANA SQL Data Warehousing (Sefan Linders)
SAP HANA SQL Data Warehousing (Sefan Linders)SAP HANA SQL Data Warehousing (Sefan Linders)
SAP HANA SQL Data Warehousing (Sefan Linders)
 
Big Data in Action – Real-World Solution Showcase
 Big Data in Action – Real-World Solution Showcase Big Data in Action – Real-World Solution Showcase
Big Data in Action – Real-World Solution Showcase
 

More from DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
DATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
DATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
DATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
DATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
DATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
DATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Recently uploaded

Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
FilipTomaszewski5
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
christinelarrosa
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
Discover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched ContentDiscover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched Content
ScyllaDB
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
From NCSA to the National Research Platform
From NCSA to the National Research PlatformFrom NCSA to the National Research Platform
From NCSA to the National Research Platform
Larry Smarr
 
CTO Insights: Steering a High-Stakes Database Migration
CTO Insights: Steering a High-Stakes Database MigrationCTO Insights: Steering a High-Stakes Database Migration
CTO Insights: Steering a High-Stakes Database Migration
ScyllaDB
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
AlexanderRichford
 
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDBScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
Tobias Schneck
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
ScyllaDB
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
leebarnesutopia
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
UiPathCommunity
 
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
dipikamodels1
 
Real-Time Persisted Events at Supercell
Real-Time Persisted Events at  SupercellReal-Time Persisted Events at  Supercell
Real-Time Persisted Events at Supercell
ScyllaDB
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
Cyber Recovery Wargame
Cyber Recovery WargameCyber Recovery Wargame
Cyber Recovery Wargame
Databarracks
 
ScyllaDB Real-Time Event Processing with CDC
ScyllaDB Real-Time Event Processing with CDCScyllaDB Real-Time Event Processing with CDC
ScyllaDB Real-Time Event Processing with CDC
ScyllaDB
 

Recently uploaded (20)

Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
Discover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched ContentDiscover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched Content
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
From NCSA to the National Research Platform
From NCSA to the National Research PlatformFrom NCSA to the National Research Platform
From NCSA to the National Research Platform
 
CTO Insights: Steering a High-Stakes Database Migration
CTO Insights: Steering a High-Stakes Database MigrationCTO Insights: Steering a High-Stakes Database Migration
CTO Insights: Steering a High-Stakes Database Migration
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
 
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDBScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
 
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
 
Real-Time Persisted Events at Supercell
Real-Time Persisted Events at  SupercellReal-Time Persisted Events at  Supercell
Real-Time Persisted Events at Supercell
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
Cyber Recovery Wargame
Cyber Recovery WargameCyber Recovery Wargame
Cyber Recovery Wargame
 
ScyllaDB Real-Time Event Processing with CDC
ScyllaDB Real-Time Event Processing with CDCScyllaDB Real-Time Event Processing with CDC
ScyllaDB Real-Time Event Processing with CDC
 

The Missed Promise of Hadoop and New and Emerging Technologies

  • 1. The First Step in Information Management looker.com Produced by: MONTHLY SERIES In partnership with: The Missed Promise of Hadoop and New and Emerging Technologies August 2, 2018
  • 2. Welcome to Today’s Discussion  Evolution of Hadoop  Current state of Hadoop: pros and cons for big data and analytics  Role of Hadoop in enterprise architectures  Successful use cases and lessons learned  Hadoop alternatives  Best practices and key takeaways  Q&A pg 2© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
  • 3. Evolution of Hadoop  15 years – Hadoop is firmly entrenched in the landscape  Hadoop in the cloud means it’s transparent  Commoditization  The use of Hadoop seems to be as the landing zone  Not mission-critical but still relevant – and still maintains some of its long-term issues pg 3© 2018 First San Francisco Partners www.firstsanfranciscopartners.com MANAGED CLOUD SERVICES SERVERLESS MICROSERVICES VENDOR SOLUTIONS PART OF ORDINARY ENTERPRISE ARCHITECTURE FABRIC
  • 4. Scope of Hadoop for Analytics  Data at rest  Permanent fixture in enterprise architectures  Will relational ever go away for BI? – No  Likewise, Hadoop won’t go away for departmental analytics of data at rest pg 4© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
  • 5. Analyst Viewpoints on Hadoop pg 5© 2018 First San Francisco Partners www.firstsanfranciscopartners.com  Well-entrenched  Spark, while a replacement, is still entering the trough of disillusionment  Other large volume options (e.g., in memory DBMS) are mature  It is part of the landscape and other tools take over its limitations  AI may eventually overtake Hadoop (let us explain)
  • 6. Current State of Hadoop – Not a Focal Point, but Important pg 6© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Proven data lake “at rest” technology The “only” Big Data store and file management (HDFS) Analyzing large data sets easily Good availability of supporting “wrapping” technology (Podium, Cloudera, etc.) Haas becoming more common Trend to real-time and low- latency affects relevance Being supplanted by Spark and Storm Open source but still needs expertise Lack of governance and security Extended implementation and time-to-market Goodtoknow….. Needtobeaware…
  • 7. Role of Hadoop – Integral to Reference Architectures pg 7© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Data Life Cycles DATA INSIGHT ARCHITECTURE 1Management Data Usage Vintage Area Contemporary Area Business Strategy Legacy BI and Reporting Data Warehouse, ODS, Mart Low latency (Spark) Graph Advanced Analytics RDBMS, SQL, In-Memory Appliance Metadata Lineage Reference Data Alignment Data Monetization Visualization ETL,EAI,ReplicationMobile Logical DW Unstructured Data Hadoop. HDFS (for lakes) ERP, Legacy Applications
  • 8. Scenarios for Hadoop as a Main Player and a Focal Point pg 8© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Use Case 1 – Large Organization • Very large organization with lots of data and skilled technologists • Owned data centers and offered as a service • Located where there is subpar internet back bone (e.g., Hawaii) Use Case 2 – Direct Control • Where economics are less important than direct control • The safest connection is no connection (e.g., research facilities, atomic/nuclear, military)
  • 9. Lessons Learned from Hadoop (and Other Technologies) pg 9© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Pizza Tastes Awful Square slices (Culture management) Lost recipe (DG) Poor preparation ((DM) Poor ingredients (DQ) *Graphic source: Albert Barron “Pizza as a Service,” published on LinkedIn July 30, 2014
  • 10. Lessons Learned from Hadoop (and Other Technologies)  Delivery and storage architecture are irrelevant if you don’t control the raw ingredients and process  You are still responsible for above-the-line capabilities  Regardless where Hadoop lives, there are still significant shortcomings across all industries that affect its usefulness, but aren’t the fault of Hadoop pg 10© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Pizza Tastes Awful Square slices (Culture management) Lost recipe (DG) Poor preparation ((DM) Poor ingredients (DQ) *Graphic source: Albert Barron “Pizza as a Service,” published on LinkedIn July 30, 2014
  • 11. Alternatives to Hadoop (not Exhaustive) pg 11© 2018 First San Francisco Partners www.firstsanfranciscopartners.com  Apache Spark  Apache Storm  Google BigQuery  DataTorrent RTS  Hydra  Amazon S3  Snowflake  Amazon Redshift  Podium  Hortonworks  Cloudera  Cassandra  MongoDB  Apache HBase
  • 12. Non-standard Uses of Hadoop  AI and Machine Learning for creating analytic data sets − Use algorithms to match and identify related data − Create at-rest data sets (in Hadoop) and use new tools for access − Data scientists vet results and confirm usefulness − Use as a Lake to store, access and distribute results pg 12 © 2018 First San Francisco Partners www.firstsanfranciscopartners.com Vintage Area 1 Data Life Cycles Data Management Data Usage Vintage Area Legacy BI and Reporting Data Warehouse, ODS, Mart ETL, EAI, Msg, Copy Data Lake Hadoop RDBMS, SQL, In- Memory Appliance Unstructured Data Structured Data
  • 13. Non-standard Uses of Hadoop  AI and Machine learning for metadata − Use AI and machine learning to extract metadata − Provide insight to data stewards and custodians − Use at rest Hadoop as “system of record” for metadata − Use Graph data base for metadata access pg 13© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Vintage Area 1 Data Life Cycles Data Management Data Usage Vintage Area Legacy BI and Reporting Data Warehouse, ODS, Mart ETL, EAI, Msg, Copy Data Lake Hadoop Advanced Analytics RDBMS, SQL, In- Memory Appliance Metadata Logical DW Data Sources Graph BIVisualization
  • 14. The Future of Hadoop pg 14© 2018 First San Francisco Partners www.firstsanfranciscopartners.com *Image source: The Globe and Mail “The Maytag Man through the years” published June 19, 2017 Staple/mainstay of reliable technology…
  • 15. Best Practices  Stick with Hadoop and a product that addresses shortcomings for data at rest  Design a data architecture, whether or not you are using cloud  Consider if your organization has unique characteristics that may place Hadoop center stage pg 15© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
  • 16. Key Takeaways pg 16© 2018 First San Francisco Partners www.firstsanfranciscopartners.com KEEP IN MIND…  If streaming data, then you need to get serious about other types of tech, or just place it in the cloud  There is no avoiding data governance and data management, regardless of your approach  Hadoop will remain a part of data supply chains and analytics ecosystems for data at rest and lakes Hadoop is technology – which ALWAYS commoditizes or disappears – it is not a decision of IF, but HOW
  • 17. Please Share Your Questions and Comments MONTHLY SERIES
  • 18. Thank you for joining us today! Our Thursday, September 6 #DIAnaltyics webinar is: Advanced Databases and Knowledge Management . John Ladley @jladley john@firstsanfranciscopartners.com Kelle O’Neal @kellezoneal kelle@firstsanfranciscopartners.com
  翻译: