尊敬的 微信汇率:1円 ≈ 0.046078 元 支付宝汇率:1円 ≈ 0.046168元 [退出登录]
SlideShare a Scribd company logo
Thursday, June 21
11:30 – 12:10 PM
Meeting Room 230C
MIGRATING LEGACY ERP
DATA INTO HADOOP
TECHNICAL PRESENTATION FOR DATAWORKS
2018
Data Processing and Warehousing
2
Jordan Martz
Director, Technology Solutions
ATTUNITY
David Freriks
Technology Evangelist, Office Strategy
Mgmt
Qlik
Today’s Speakers
WORKING TOGETHER:
REFERENCE ARCHITECTURE FOR
MIGRATING LEGACY ERP DATA INTO
4© 2017 Attunity
• Qlik Sense: a highly flexible and scalable analytics platform for BI
• Microsoft Azure: set of cloud services to help organizations meet their business challenges.
• Attunity Replicate: software that accelerates data replication, ingest and streaming across a
wide range of heterogeneous databases, data warehouses and data platforms
• SAP: data management platforms to handle both transactions and analytics in memory on a
single data copy
• Hortonworks Data Flow (HDF): end-to-end platform that collects, curates, analyzes and acts
on data in real-time with a drag-and-drop visual interface
SUMMARY OF “SMASH” USE CASES & PRODUCT
SCENARIOS
5© 2017 Attunity
MODERN DATA INGEST
METADATA
HIVE
OPTIMIZED
STREAM
OPTIMIZED
CHANGE DATA CAPTURE
CLOUD ON PREM
WAREHOUSE MAINFRAME RDBMS SAP
CDC (log-based) for
high performance,
low latency and low
impact
Single platform for
all key enterprise
systems
Hive-optimized for
HDP and Stream-
optimized for HDF
Point-and-Click with
NO coding and NO
agents
6© 2017 Attunity
SAP DATA INGEST
METADATA
HIVE
OPTIMIZED
STREAM
OPTIMIZED
CHANGE DATA CAPTURE
SAP
NATIVE AGENT
Unlock and decode SAP
application data
Real-time and
continuous ingest with
CDC
Native agent, SAP
certified
All core and industry-
specific SAP ECC
modules
All the standard SAP ECC modules
(FI, CO, MM, PM, SD, PM, HR, …)
All industry specific solutions
(i.e. IS-Utilities, IS-OIL, …)
SAP
SRM
SAP
ERP
SAP
BW
SAP
HR
SAP
GTS
SAP
CRM
SAP
EWM
SAP
TM
SAP
SCM
ANY INDUSTRY
SOLUTION
SAP
EM
10© 2017 Attunity
ATTUNITY’S DATA INGEST ACROSS HADOOP
LANDSCAPE
Automate data ingest, flows and refresh
• Use Attunity Replicate to connect across
all data sources
• Keep EDW, HDP & HDF refreshed with
Change Data Capture (CDC)
• Bulk-Load and CDC cross many sources,
with Meta-Data
Data
Marts
Business
Analytics
Visualization
& Dashboards
HDP HDF
Hot / Cold DataEnterprise Data
Warehouse
Hot
Clickstream Web & Social Geolocation Sensor
& Machine
Server
Logs
Unstructured
Batch Ingest & CDC for Data & Meta-Data
Systems of Record
RDBMS
ERP
CRM
EDW
Legacy
11© 2017 Attunity
DATA INTEGRATION MATURITY MODEL
Level 1
Sandbox
Level 2
Opportunistic
Level 3
Workgroup
Level 5
Transformative
Level 4
Enterprise
Bulk data transfer Manual change data
capture
Non-invasive CDC
via change logs
Automatically generate
target schemas, process
DML, and respond to
source DDL changes
Hybrid deployments;
publish to multiple
streams; Microservices
API;
Programmatic, resource
intensive
System resource
intensive; inflexible and
brittle; people intensive
change management
Non-invasive, agentless,
automated movement,
flexible
Real-time analytic
availability; Lambda
architecture; fully
automated
Resilient; high-
availability; single
console management for
global deployments
Style
Capabilities
Product
Examples
Sqoop
Sqoop with database
time stamps, triggers
and ChangeTables;
or Query-based CDC
Attunity Replicate
Attunity Enterprise
Manager
Attunity Visibility
Attunity Compose
for Hive
Manual
Automated
ATTUNITY REPLICATE
Accelerates data replication, ingest and streaming across a wide range
of heterogeneous databases, data warehouses and data platforms.
13© 2017 Attunity
ONE USE CASE: REAL-TIME REPLICATION FOR SAP
Native SAP integration
Simplified mapping of complex SAP data model
Decode the proprietary source structures
All core and industry-specific SAP modules
Integrate real-time with all major targets
Deliver to Data Lakes, Cloud, et al
SOFTWARE
14© 2017 Attunity
• S4 (on HANA)
• ERP / ERP Core
Components*
• CRM
• SRM
• Global Trade System
• Master Data
Governance
ATTUNITY REPLICATE SUPPORT FOR SAP
ENVIRONMENTS
SAP Versions DatabasesApplications
* All modules supported but HR
• Primarily SAP ECC 6.0
+ all EhP levels
• Also ECC 5.0, 4.7
Enterprise and 4.6C
15© 2017 Attunity 15© 2017 Attunity
Replicate for SAP
TransformFilter
Batch
CDC Incremental
In-Memory
File Channel
Batch
ARCHITECTURE
Persistent Store
Extract relationships for Pool and Cluster Tables
RDBMS
(Oracle, DB2, etc.)
Redo/ Archive
logs
or
Journal
File
----------------
Transparent
Tables
On Premises
Kafka
Cloud
Navigate, select SAP objects
within ECC/ERP
Automated ABAP Mapping,
CDC for Pool/Cluster tables
RFC Calls
Attunity Replicate
SAP ECC
(Enterprise Central
Component)
16© 2017 Attunity 16© 2017 Attunity
SAP MODULES
Module Module Description Specialized Modules Modules
FICO Finance & Controlling CRM Customer Relationship Management
SD Sales & Distribution SRM Supplier Relationship Management
MM Materials Management APO Advanced Planner and Optimizer
PP Production Planning PLM Product Lifecycle Management
SM Service Management SCM Supply Chain Management
QM Quality Management E-Procurement
WM Warehouse Management FSCM Financial Supply Chain Management
TM Transportation Management EHS Environment Health and Safety
HR Human Resources SEM Strategic Enterprise Management
PS Project Systems BI Business Intelligence
PI Process Integration
EWM Extended Warehouse Management
See the whole story
that lives within your SAP data
Qlik Connector for SAP
APIs
Qlik SAP Integration with Attunity
Qlik App
Qlik Platform
Portal integration
Advanced Self Service
Live Access WebApps
The Qlik platform – for all users
Most Big Data Users are not Data Scientists
Deep drilling
Mostly drilling, some exploration
Mostly exploration,
some drilling
Data Experts
Data Scientists
Breadth of Coverage
DepthofCoverage
Data Explorers
Descriptive, diagnostic and predictive analytics
(“What happened?”, “Why did it happen?” and “What is likely to happen?”
Qlik + Attunity + Hortonworks Replicate Benefits
• Supports all SAP modules with
corresponding pre-built Qlik Sense or
QlikView applications
• Real-time access to your SAP data
• Changes in SAP are replicated in
real-time into Hortonworks
• Leverage the power of Hortonworks to
transform your SAP data
• Leverage the power of Qlik to unlock
the insights in your SAP data
Qlik – Sales & Distribution App
TablesTransforms
Data Model
Qlik – Material Management App
Data Model
Tables Transforms
Thank you
attunity.com

More Related Content

What's hot

What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3
DataWorks Summit
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
DataWorks Summit
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
DataWorks Summit
 
The rise of big data governance: insight on this emerging trend from active o...
The rise of big data governance: insight on this emerging trend from active o...The rise of big data governance: insight on this emerging trend from active o...
The rise of big data governance: insight on this emerging trend from active o...
DataWorks Summit
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
avanttic Consultoría Tecnológica
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...
Daniel Martin
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
 
Key trends in Big Data and new reference architecture from Hewlett Packard En...
Key trends in Big Data and new reference architecture from Hewlett Packard En...Key trends in Big Data and new reference architecture from Hewlett Packard En...
Key trends in Big Data and new reference architecture from Hewlett Packard En...
Ontico
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
DataWorks Summit
 
Benefits of Hadoop as Platform as a Service
Benefits of Hadoop as Platform as a ServiceBenefits of Hadoop as Platform as a Service
Benefits of Hadoop as Platform as a Service
DataWorks Summit/Hadoop Summit
 
Log I am your father
Log I am your fatherLog I am your father
Log I am your father
DataWorks Summit/Hadoop Summit
 
Breakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreBreakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data Store
Cloudera, Inc.
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
DataWorks Summit
 
Securing your Big Data Environments in the Cloud
Securing your Big Data Environments in the CloudSecuring your Big Data Environments in the Cloud
Securing your Big Data Environments in the Cloud
DataWorks Summit
 
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
Logical Data Warehouse: How to Build a Virtualized Data Services LayerLogical Data Warehouse: How to Build a Virtualized Data Services Layer
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
DataWorks Summit
 
LinkedIn2
LinkedIn2LinkedIn2
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine Learning
DataWorks Summit
 
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAXHow Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
BMC Software
 
Sharing metadata across the data lake and streams
Sharing metadata across the data lake and streamsSharing metadata across the data lake and streams
Sharing metadata across the data lake and streams
DataWorks Summit
 
Data processing at the speed of 100 Gbps@Apache Crail (Incubating)
Data processing at the speed of 100 Gbps@Apache Crail (Incubating)Data processing at the speed of 100 Gbps@Apache Crail (Incubating)
Data processing at the speed of 100 Gbps@Apache Crail (Incubating)
DataWorks Summit
 

What's hot (20)

What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
 
The rise of big data governance: insight on this emerging trend from active o...
The rise of big data governance: insight on this emerging trend from active o...The rise of big data governance: insight on this emerging trend from active o...
The rise of big data governance: insight on this emerging trend from active o...
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Key trends in Big Data and new reference architecture from Hewlett Packard En...
Key trends in Big Data and new reference architecture from Hewlett Packard En...Key trends in Big Data and new reference architecture from Hewlett Packard En...
Key trends in Big Data and new reference architecture from Hewlett Packard En...
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
 
Benefits of Hadoop as Platform as a Service
Benefits of Hadoop as Platform as a ServiceBenefits of Hadoop as Platform as a Service
Benefits of Hadoop as Platform as a Service
 
Log I am your father
Log I am your fatherLog I am your father
Log I am your father
 
Breakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreBreakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data Store
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
 
Securing your Big Data Environments in the Cloud
Securing your Big Data Environments in the CloudSecuring your Big Data Environments in the Cloud
Securing your Big Data Environments in the Cloud
 
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
Logical Data Warehouse: How to Build a Virtualized Data Services LayerLogical Data Warehouse: How to Build a Virtualized Data Services Layer
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
 
LinkedIn2
LinkedIn2LinkedIn2
LinkedIn2
 
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine Learning
 
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAXHow Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
 
Sharing metadata across the data lake and streams
Sharing metadata across the data lake and streamsSharing metadata across the data lake and streams
Sharing metadata across the data lake and streams
 
Data processing at the speed of 100 Gbps@Apache Crail (Incubating)
Data processing at the speed of 100 Gbps@Apache Crail (Incubating)Data processing at the speed of 100 Gbps@Apache Crail (Incubating)
Data processing at the speed of 100 Gbps@Apache Crail (Incubating)
 

Similar to Migrating legacy ERP data into Hadoop

Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow Presentation
Denodo
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & Bénéfices
Denodo
 
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
Hortonworks
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?
James Serra
 
Data Driven Advanced Analytics using Denodo Platform on AWS
Data Driven Advanced Analytics using Denodo Platform on AWSData Driven Advanced Analytics using Denodo Platform on AWS
Data Driven Advanced Analytics using Denodo Platform on AWS
Denodo
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
StampedeCon
 
The Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- AltibaseThe Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- Altibase
Altibase
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
Denodo
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Denodo
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data Analytics
Attunity
 
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
Matt Stubbs
 
Bringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceBringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to Salesforce
Salesforce Developers
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
DataWorks Summit
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
James Serra
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
Denodo
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
DataWorks Summit/Hadoop Summit
 
Extreme SSAS- SQL 2011
Extreme SSAS- SQL 2011Extreme SSAS- SQL 2011
Extreme SSAS- SQL 2011
Itay Braun
 
Trivadis Azure Data Lake
Trivadis Azure Data LakeTrivadis Azure Data Lake
Trivadis Azure Data Lake
Trivadis
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
Databricks
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 

Similar to Migrating legacy ERP data into Hadoop (20)

Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow Presentation
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & Bénéfices
 
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?
 
Data Driven Advanced Analytics using Denodo Platform on AWS
Data Driven Advanced Analytics using Denodo Platform on AWSData Driven Advanced Analytics using Denodo Platform on AWS
Data Driven Advanced Analytics using Denodo Platform on AWS
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
The Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- AltibaseThe Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- Altibase
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data Analytics
 
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
 
Bringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceBringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to Salesforce
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
 
Extreme SSAS- SQL 2011
Extreme SSAS- SQL 2011Extreme SSAS- SQL 2011
Extreme SSAS- SQL 2011
 
Trivadis Azure Data Lake
Trivadis Azure Data LakeTrivadis Azure Data Lake
Trivadis Azure Data Lake
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 

More from DataWorks Summit

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
DataWorks Summit
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
DataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

The Strategy Behind ReversingLabs’ Massive Key-Value Migration
The Strategy Behind ReversingLabs’ Massive Key-Value MigrationThe Strategy Behind ReversingLabs’ Massive Key-Value Migration
The Strategy Behind ReversingLabs’ Massive Key-Value Migration
ScyllaDB
 
Communications Mining Series - Zero to Hero - Session 2
Communications Mining Series - Zero to Hero - Session 2Communications Mining Series - Zero to Hero - Session 2
Communications Mining Series - Zero to Hero - Session 2
DianaGray10
 
MySQL InnoDB Storage Engine: Deep Dive - Mydbops
MySQL InnoDB Storage Engine: Deep Dive - MydbopsMySQL InnoDB Storage Engine: Deep Dive - Mydbops
MySQL InnoDB Storage Engine: Deep Dive - Mydbops
Mydbops
 
Building a Semantic Layer of your Data Platform
Building a Semantic Layer of your Data PlatformBuilding a Semantic Layer of your Data Platform
Building a Semantic Layer of your Data Platform
Enterprise Knowledge
 
Cyber Recovery Wargame
Cyber Recovery WargameCyber Recovery Wargame
Cyber Recovery Wargame
Databarracks
 
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
leebarnesutopia
 
Call Girls Chennai ☎️ +91-7426014248 😍 Chennai Call Girl Beauty Girls Chennai...
Call Girls Chennai ☎️ +91-7426014248 😍 Chennai Call Girl Beauty Girls Chennai...Call Girls Chennai ☎️ +91-7426014248 😍 Chennai Call Girl Beauty Girls Chennai...
Call Girls Chennai ☎️ +91-7426014248 😍 Chennai Call Girl Beauty Girls Chennai...
anilsa9823
 
Chapter 6 - Test Tools Considerations V4.0
Chapter 6 - Test Tools Considerations V4.0Chapter 6 - Test Tools Considerations V4.0
Chapter 6 - Test Tools Considerations V4.0
Neeraj Kumar Singh
 
From NCSA to the National Research Platform
From NCSA to the National Research PlatformFrom NCSA to the National Research Platform
From NCSA to the National Research Platform
Larry Smarr
 
New ThousandEyes Product Features and Release Highlights: June 2024
New ThousandEyes Product Features and Release Highlights: June 2024New ThousandEyes Product Features and Release Highlights: June 2024
New ThousandEyes Product Features and Release Highlights: June 2024
ThousandEyes
 
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
dipikamodels1
 
Database Management Myths for Developers
Database Management Myths for DevelopersDatabase Management Myths for Developers
Database Management Myths for Developers
John Sterrett
 
Supplier Sourcing Presentation - Gay De La Cruz.pdf
Supplier Sourcing Presentation - Gay De La Cruz.pdfSupplier Sourcing Presentation - Gay De La Cruz.pdf
Supplier Sourcing Presentation - Gay De La Cruz.pdf
gaydlc2513
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
UiPathCommunity
 
Chapter 1 - Fundamentals of Testing V4.0
Chapter 1 - Fundamentals of Testing V4.0Chapter 1 - Fundamentals of Testing V4.0
Chapter 1 - Fundamentals of Testing V4.0
Neeraj Kumar Singh
 
Kubernetes Cloud Native Indonesia Meetup - June 2024
Kubernetes Cloud Native Indonesia Meetup - June 2024Kubernetes Cloud Native Indonesia Meetup - June 2024
Kubernetes Cloud Native Indonesia Meetup - June 2024
Prasta Maha
 
Ubuntu Server CLI cheat sheet 2024 v6.pdf
Ubuntu Server CLI cheat sheet 2024 v6.pdfUbuntu Server CLI cheat sheet 2024 v6.pdf
Ubuntu Server CLI cheat sheet 2024 v6.pdf
TechOnDemandSolution
 
How to Optimize Call Monitoring: Automate QA and Elevate Customer Experience
How to Optimize Call Monitoring: Automate QA and Elevate Customer ExperienceHow to Optimize Call Monitoring: Automate QA and Elevate Customer Experience
How to Optimize Call Monitoring: Automate QA and Elevate Customer Experience
Aggregage
 
Fuxnet [EN] .pdf
Fuxnet [EN]                                   .pdfFuxnet [EN]                                   .pdf
Fuxnet [EN] .pdf
Overkill Security
 

Recently uploaded (20)

The Strategy Behind ReversingLabs’ Massive Key-Value Migration
The Strategy Behind ReversingLabs’ Massive Key-Value MigrationThe Strategy Behind ReversingLabs’ Massive Key-Value Migration
The Strategy Behind ReversingLabs’ Massive Key-Value Migration
 
Communications Mining Series - Zero to Hero - Session 2
Communications Mining Series - Zero to Hero - Session 2Communications Mining Series - Zero to Hero - Session 2
Communications Mining Series - Zero to Hero - Session 2
 
MySQL InnoDB Storage Engine: Deep Dive - Mydbops
MySQL InnoDB Storage Engine: Deep Dive - MydbopsMySQL InnoDB Storage Engine: Deep Dive - Mydbops
MySQL InnoDB Storage Engine: Deep Dive - Mydbops
 
Building a Semantic Layer of your Data Platform
Building a Semantic Layer of your Data PlatformBuilding a Semantic Layer of your Data Platform
Building a Semantic Layer of your Data Platform
 
Cyber Recovery Wargame
Cyber Recovery WargameCyber Recovery Wargame
Cyber Recovery Wargame
 
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
TrustArc Webinar - Your Guide for Smooth Cross-Border Data Transfers and Glob...
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
 
Call Girls Chennai ☎️ +91-7426014248 😍 Chennai Call Girl Beauty Girls Chennai...
Call Girls Chennai ☎️ +91-7426014248 😍 Chennai Call Girl Beauty Girls Chennai...Call Girls Chennai ☎️ +91-7426014248 😍 Chennai Call Girl Beauty Girls Chennai...
Call Girls Chennai ☎️ +91-7426014248 😍 Chennai Call Girl Beauty Girls Chennai...
 
Chapter 6 - Test Tools Considerations V4.0
Chapter 6 - Test Tools Considerations V4.0Chapter 6 - Test Tools Considerations V4.0
Chapter 6 - Test Tools Considerations V4.0
 
From NCSA to the National Research Platform
From NCSA to the National Research PlatformFrom NCSA to the National Research Platform
From NCSA to the National Research Platform
 
New ThousandEyes Product Features and Release Highlights: June 2024
New ThousandEyes Product Features and Release Highlights: June 2024New ThousandEyes Product Features and Release Highlights: June 2024
New ThousandEyes Product Features and Release Highlights: June 2024
 
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
Call Girls Kochi 💯Call Us 🔝 7426014248 🔝 Independent Kochi Escorts Service Av...
 
Database Management Myths for Developers
Database Management Myths for DevelopersDatabase Management Myths for Developers
Database Management Myths for Developers
 
Supplier Sourcing Presentation - Gay De La Cruz.pdf
Supplier Sourcing Presentation - Gay De La Cruz.pdfSupplier Sourcing Presentation - Gay De La Cruz.pdf
Supplier Sourcing Presentation - Gay De La Cruz.pdf
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
 
Chapter 1 - Fundamentals of Testing V4.0
Chapter 1 - Fundamentals of Testing V4.0Chapter 1 - Fundamentals of Testing V4.0
Chapter 1 - Fundamentals of Testing V4.0
 
Kubernetes Cloud Native Indonesia Meetup - June 2024
Kubernetes Cloud Native Indonesia Meetup - June 2024Kubernetes Cloud Native Indonesia Meetup - June 2024
Kubernetes Cloud Native Indonesia Meetup - June 2024
 
Ubuntu Server CLI cheat sheet 2024 v6.pdf
Ubuntu Server CLI cheat sheet 2024 v6.pdfUbuntu Server CLI cheat sheet 2024 v6.pdf
Ubuntu Server CLI cheat sheet 2024 v6.pdf
 
How to Optimize Call Monitoring: Automate QA and Elevate Customer Experience
How to Optimize Call Monitoring: Automate QA and Elevate Customer ExperienceHow to Optimize Call Monitoring: Automate QA and Elevate Customer Experience
How to Optimize Call Monitoring: Automate QA and Elevate Customer Experience
 
Fuxnet [EN] .pdf
Fuxnet [EN]                                   .pdfFuxnet [EN]                                   .pdf
Fuxnet [EN] .pdf
 

Migrating legacy ERP data into Hadoop

  • 1. Thursday, June 21 11:30 – 12:10 PM Meeting Room 230C MIGRATING LEGACY ERP DATA INTO HADOOP TECHNICAL PRESENTATION FOR DATAWORKS 2018 Data Processing and Warehousing
  • 2. 2 Jordan Martz Director, Technology Solutions ATTUNITY David Freriks Technology Evangelist, Office Strategy Mgmt Qlik Today’s Speakers
  • 3. WORKING TOGETHER: REFERENCE ARCHITECTURE FOR MIGRATING LEGACY ERP DATA INTO
  • 4. 4© 2017 Attunity • Qlik Sense: a highly flexible and scalable analytics platform for BI • Microsoft Azure: set of cloud services to help organizations meet their business challenges. • Attunity Replicate: software that accelerates data replication, ingest and streaming across a wide range of heterogeneous databases, data warehouses and data platforms • SAP: data management platforms to handle both transactions and analytics in memory on a single data copy • Hortonworks Data Flow (HDF): end-to-end platform that collects, curates, analyzes and acts on data in real-time with a drag-and-drop visual interface SUMMARY OF “SMASH” USE CASES & PRODUCT SCENARIOS
  • 5. 5© 2017 Attunity MODERN DATA INGEST METADATA HIVE OPTIMIZED STREAM OPTIMIZED CHANGE DATA CAPTURE CLOUD ON PREM WAREHOUSE MAINFRAME RDBMS SAP CDC (log-based) for high performance, low latency and low impact Single platform for all key enterprise systems Hive-optimized for HDP and Stream- optimized for HDF Point-and-Click with NO coding and NO agents
  • 6. 6© 2017 Attunity SAP DATA INGEST METADATA HIVE OPTIMIZED STREAM OPTIMIZED CHANGE DATA CAPTURE SAP NATIVE AGENT Unlock and decode SAP application data Real-time and continuous ingest with CDC Native agent, SAP certified All core and industry- specific SAP ECC modules All the standard SAP ECC modules (FI, CO, MM, PM, SD, PM, HR, …) All industry specific solutions (i.e. IS-Utilities, IS-OIL, …) SAP SRM SAP ERP SAP BW SAP HR SAP GTS SAP CRM SAP EWM SAP TM SAP SCM ANY INDUSTRY SOLUTION SAP EM
  • 7. 10© 2017 Attunity ATTUNITY’S DATA INGEST ACROSS HADOOP LANDSCAPE Automate data ingest, flows and refresh • Use Attunity Replicate to connect across all data sources • Keep EDW, HDP & HDF refreshed with Change Data Capture (CDC) • Bulk-Load and CDC cross many sources, with Meta-Data Data Marts Business Analytics Visualization & Dashboards HDP HDF Hot / Cold DataEnterprise Data Warehouse Hot Clickstream Web & Social Geolocation Sensor & Machine Server Logs Unstructured Batch Ingest & CDC for Data & Meta-Data Systems of Record RDBMS ERP CRM EDW Legacy
  • 8. 11© 2017 Attunity DATA INTEGRATION MATURITY MODEL Level 1 Sandbox Level 2 Opportunistic Level 3 Workgroup Level 5 Transformative Level 4 Enterprise Bulk data transfer Manual change data capture Non-invasive CDC via change logs Automatically generate target schemas, process DML, and respond to source DDL changes Hybrid deployments; publish to multiple streams; Microservices API; Programmatic, resource intensive System resource intensive; inflexible and brittle; people intensive change management Non-invasive, agentless, automated movement, flexible Real-time analytic availability; Lambda architecture; fully automated Resilient; high- availability; single console management for global deployments Style Capabilities Product Examples Sqoop Sqoop with database time stamps, triggers and ChangeTables; or Query-based CDC Attunity Replicate Attunity Enterprise Manager Attunity Visibility Attunity Compose for Hive Manual Automated
  • 9. ATTUNITY REPLICATE Accelerates data replication, ingest and streaming across a wide range of heterogeneous databases, data warehouses and data platforms.
  • 10. 13© 2017 Attunity ONE USE CASE: REAL-TIME REPLICATION FOR SAP Native SAP integration Simplified mapping of complex SAP data model Decode the proprietary source structures All core and industry-specific SAP modules Integrate real-time with all major targets Deliver to Data Lakes, Cloud, et al SOFTWARE
  • 11. 14© 2017 Attunity • S4 (on HANA) • ERP / ERP Core Components* • CRM • SRM • Global Trade System • Master Data Governance ATTUNITY REPLICATE SUPPORT FOR SAP ENVIRONMENTS SAP Versions DatabasesApplications * All modules supported but HR • Primarily SAP ECC 6.0 + all EhP levels • Also ECC 5.0, 4.7 Enterprise and 4.6C
  • 12. 15© 2017 Attunity 15© 2017 Attunity Replicate for SAP TransformFilter Batch CDC Incremental In-Memory File Channel Batch ARCHITECTURE Persistent Store Extract relationships for Pool and Cluster Tables RDBMS (Oracle, DB2, etc.) Redo/ Archive logs or Journal File ---------------- Transparent Tables On Premises Kafka Cloud Navigate, select SAP objects within ECC/ERP Automated ABAP Mapping, CDC for Pool/Cluster tables RFC Calls Attunity Replicate SAP ECC (Enterprise Central Component)
  • 13. 16© 2017 Attunity 16© 2017 Attunity SAP MODULES Module Module Description Specialized Modules Modules FICO Finance & Controlling CRM Customer Relationship Management SD Sales & Distribution SRM Supplier Relationship Management MM Materials Management APO Advanced Planner and Optimizer PP Production Planning PLM Product Lifecycle Management SM Service Management SCM Supply Chain Management QM Quality Management E-Procurement WM Warehouse Management FSCM Financial Supply Chain Management TM Transportation Management EHS Environment Health and Safety HR Human Resources SEM Strategic Enterprise Management PS Project Systems BI Business Intelligence PI Process Integration EWM Extended Warehouse Management
  • 14. See the whole story that lives within your SAP data Qlik Connector for SAP
  • 15. APIs Qlik SAP Integration with Attunity Qlik App Qlik Platform Portal integration Advanced Self Service Live Access WebApps
  • 16. The Qlik platform – for all users Most Big Data Users are not Data Scientists Deep drilling Mostly drilling, some exploration Mostly exploration, some drilling Data Experts Data Scientists Breadth of Coverage DepthofCoverage Data Explorers Descriptive, diagnostic and predictive analytics (“What happened?”, “Why did it happen?” and “What is likely to happen?”
  • 17. Qlik + Attunity + Hortonworks Replicate Benefits • Supports all SAP modules with corresponding pre-built Qlik Sense or QlikView applications • Real-time access to your SAP data • Changes in SAP are replicated in real-time into Hortonworks • Leverage the power of Hortonworks to transform your SAP data • Leverage the power of Qlik to unlock the insights in your SAP data
  • 18. Qlik – Sales & Distribution App TablesTransforms Data Model
  • 19. Qlik – Material Management App Data Model Tables Transforms

Editor's Notes

  1. Hello, and welcome to “Migrating Legacy ERP Data into Hadoop”. This is a technical presentation for Dataworks 2018 in San Jose, CA.
  2. Today’s illuminating speakers are Jordan Martz, Director of Technology Solutions for Attunity. And, David Freriks, Technology Evangelist, at Qlik.
  3. ERP data can be hard to interact with at the database level and translating logic from your business from tables can be even harder. Legacy ERP architecture constructs offer a system that is a great for ERP, but a real challenge to get business insights from. As technology partners, Qlik, Attunity and Hortonworks offer a technology solution that helps you get large-scale ERP data to a platform where you can perform analytics. This joint solution makes ERP data available for business users who want to extract value from it. This session explains and demonstrates how bottlenecks within the ERP infrastructure are removed when business use Attunity Replicate to ingest onto the Hortonworks Data Platform where Qlik software is used to begin building applications for analytics.
  4. Today, we’re going to discuss a few use cases and product scenarios that make up a solution that we call “SMASH”. To keep this from turning into alphabet soup, let’s explain a few things. Qlik Sense: a highly flexible and scalable analytics platform for BI Microsoft Azure: set of cloud services to help organizations meet their business challenges. Attunity Replicate: software that accelerates data replication, ingest and streaming across a wide range of heterogeneous databases, data warehouses and data platforms SAP: data management platforms to handle both transactions and analytics in memory on a single data copy Hortonworks Data Flow (HDF): end-to-end platform that collects, curates, analyzes and acts on data in real-time with a drag-and-drop visual interface
  5. To set the stage, and because we’re presenting at the Hortonworks Dataworks conference, let’s start by talking about the Hortonworks Data Platform (HDP) and Hortonworks Data Flow (HDF). Together, they form the Connected Data Platform that works with Data in Motion (connected, real-time, tracked) and Data at Rest (massive scale analysis, retention, security). Modern Data Applications are built on the Connected Data Platform.
  6. SAP is a legacy ERP application and it’s the one that we’ll focus on for today’s presentation.
  7. The Raw content is the direct result of the Replicate tasks. Attunity delivers a semantic layer of source objects from the SAP application so you can choose individual transactions for replication. It is not necessary to know or understand the underlying SAP data model.
  8. Compressed Data Models are a flattening of the SAP data model. In the SAP application, data is spread out over dozens or hundreds of tables, views, aggregate or indexed tables. Attunity can compress those structures into a handful of objects/tables where all of the source data is available, but reduced to fewer objects. These compressed objects are still organized by individual SAP object or document (examples below): Sales Documents Delivery Documents Billing Documents Finance Documents Customer Masters Material Masters
  9. SAP Object Logical Views Logical Views are views created to define and display a particular function of the business. For example, the Sales Order (Order to Cash) process is made up of various SAP documents (customer & material master data, order, deliveries, and billing documents). The logical view takes to the most relevant aspects of those processes and provides a Logical View that can be used to query and report on. One of the unique aspects of the Logical View is that check table relationships are included so metadata can be linked to the test (i.e. Company Code 1000 can be reports as it’s text value “North America – US”). Exmaples of preconfigured Logical View are below. This list can be easily extend and developed through a Professional Services deliverable: Controlling Project Systems General Ledger Accounts Payable Accounts Receivable HR Purchasing (Procurement) Production Planning Sales/Logistics
  10. SAP Object Logical Views Logical Views are views created to define and display a particular function of the business. For example, the Sales Order (Order to Cash) process is made up of various SAP documents (customer & material master data, order, deliveries, and billing documents). The logical view takes to the most relevant aspects of those processes and provides a Logical View that can be used to query and report on. One of the unique aspects of the Logical View is that check table relationships are included so metadata can be linked to the test (i.e. Company Code 1000 can be reports as it’s text value “North America – US”). Exmaples of preconfigured Logical View are below. This list can be easily extend and developed through a Professional Services deliverable: Controlling Project Systems General Ledger Accounts Payable Accounts Receivable HR Purchasing (Procurement) Production Planning Sales/Logistics
  11. The Landscape There are three very important BI constituencies in the Big Data space. Data Scientists are one group. While very few exist in most organizations, they are very specialized in their data mining and advanced analytical skills with data. Data Knowledge Workers, like Actuaries, Financial Planners and Statisticians are a larger group, but still number in the dozens in a large organization. By far, the largest group are the Business Analysts, which can number in the hundreds, or even thousands in large organizations. These users are the daily BI users who analyze departmental and corporate data to understand and act upon metrics and discoveries that impact their business. They can be managers, analysts, specialists, executives or SMEs for a departmental group. But they all know their business area well and consume a lot of BI already. The Problem Most BI tools claiming to meet the needs of Big Data will only concentrate on the Data Scientists’ needs. While these are important needs, they typically serve very few people in an organization, and leave most of the people with Big data needs out in the cold. How is Qlik Different? Qlik’s platform is an in-memory-first model, with the ability to reach out to databases directly for deep drilling. What this means is that the bulk of all Big Data analytics (wide, exploratory BI) will remain in-memory at sub-second response times, and only when deep drilling is needed will Qlik reach out to databases for the more expensive and process-intensive queries for deeper drilling needs. This mix matches the usage patterns that organizations need. Will you opt for a BI tool that specializes in the needs of the few? Or opt for a platform that satisfies all needs, with a comprehensive Big Data approach?
  翻译: