尊敬的 微信汇率:1円 ≈ 0.046166 元 支付宝汇率:1円 ≈ 0.046257元 [退出登录]
SlideShare a Scribd company logo
Big Data Technologies
Presented By:
Pulkit Narwal
What are Big Data Technologies?
• Actually, Big Data Technologies is the utilized software that
incorporates data mining, data storage, data sharing, and data
visualization, the comprehensive term embraces data, data
framework including tools and techniques used to investigate and
transform data.
• Big Data Technology can be defined as a Software-Utility that is
designed to Analyse, Process and Extract the information from an
extremely complex and large data sets which the Traditional Data
Processing Software could never deal with.
Types of Big Data Technologies:
• Big Data Technology is mainly classified into two types:
1. Operational Big Data Technologies
2. Analytical Big Data Technologies
Operational Big Data Technologies
• The Operational Big Data is all about the normal day to day data that we
generate. This could be the Online Transactions, Social Media, or the data
from a Particular Organisation etc. You can even consider this to be a kind
of Raw Data which is used to feed the Analytical Big Data Technologies.
• A few examples of Operational Big Data Technologies are as follows:
Online ticket bookings, which includes your Rail tickets, Flight tickets, movie tickets
etc.
Online shopping which is your Amazon, Flipkart, Walmart, Snap deal and many more.
Data from social media sites like Facebook, Instagram, what’s app and a lot more.
The employee details of any Multinational Company.
Analytical Big Data Technologies
• Analytical Big Data is like the advanced version of Big Data
Technologies. It is a little complex than the Operational Big Data. In
short, Analytical big data is where the actual performance part comes
into the picture and the crucial real-time business decisions are made
by analyzing the Operational Big Data.
• Few examples of Analytical Big Data Technologies are as follows:
Stock marketing
Carrying out the Space missions where every single bit of information is
crucial.
Weather forecast information.
Medical fields where a particular patients health status can be monitored.
Types of Big Data Technologies
• Top big data technologies are divided into 4 fields which are classified
as follows:
1. Data Storage
2. Data Mining
3. Data Analytics
4. Data Visualization
Source: javatpoint
Data Storage
1. Hadoop Framework was designed to store and process data in
a Distributed Data Processing Environment with commodity
hardware with a simple programming model. It can Store and
Analyse the data present in different machines with High Speeds
and Low Costs.
• Developed by: Apache Software Foundation in the year 2011 10th of Dec.
• Written in: JAVA
• Current stable version: Hadoop 3.11
Data Storage
2. MongoDB: The NoSQL Document Databases like MongoDB, offer a
direct alternative to the rigid schema used in Relational Databases.
This allows MongoDB to offer Flexibility while handling a wide
variety of Datatypes at large volumes and across Distributed
Architectures.
• Developed by: MongoDB in the year 2009 11th of Feb
• Written in: C++, Go, JavaScript, Python
• Current stable version: MongoDB 4.0.10
Data Storage
3. RainStor is a software company that developed a Database
Management System of the same name designed to Manage and
Analyse Big Data for large enterprises. It uses Deduplication
Techniques to organize the process of storing large amounts of data
for reference.
• Developed by: RainStor Software company in the year 2004.
• Works like: SQL
• Current stable version: RainStor 5.5
Data Storage
4. Hunk lets you access data in remote Hadoop Clusters through
virtual indexes and lets you use the Splunk Search Processing
Language to analyse your data. With Hunk, you can Report and
Visualize large amounts from your Hadoop and NoSQL data sources.
• Developed by: Splunk INC in the year 2013.
• Written in: JAVA
• Current stable version: Splunk Hunk 6.2
Data Storage
5. Cassandra: Cassandra forms a top choice among the list of popular
NoSQL databases which is a free and an open-source database,
which is distributed and has a wide columnar storage and can
efficiently handle data on large commodity clusters i.e. it is used to
provide high availability along with no single failure point.
• Among the list of main features includes the ones like distributed nature,
scalability, fault-tolerant mechanism, MapReduce support, tunable
consistency, query language property, supports multi data center replication
and eventual consistency.
Data Mining
1. Presto is an open source Distributed SQL Query Engine for
running Interactive Analytic Queries against data sources of all
sizes ranging from Gigabytes to Petabytes. Presto allows querying
data in Hive, Cassandra, Relational Databases and Proprietary Data
Stores.
• Developed by: Apache Foundation in the year 2013.
• Written in: JAVA
• Current stable version: Presto 0.22
Data Mining
2. RapidMiner is a Centralized solution that features a very powerful
and robust Graphical User Interface that enables users to Create,
Deliver, and maintain Predictive Analytics. It allows creating very
Advanced Workflows, Scripting support in several languages.
• Developed by: RapidMiner in the year 2001
• Written in: JAVA
• Current stable version: RapidMiner 9.2
Data Mining
3. Elasticsearch is a Search Engine based on the Lucene Library. It
provides a Distributed, MultiTenant-capable, Full-Text Search Engine
with an HTTP Web Interface and Schema-free JSON documents.
• Developed by: Elastic NV in the year 2012.
• Written in: JAVA
• Current stable version: ElasticSearch 7.1
Data Analytics
1. Apache Kafka is a Distributed Streaming platform. A streaming
platform has Three Key Capabilities that are as follows:
i. Publisher
ii. Subscriber
iii. Consumer
This is similar to a Message Queue or an Enterprise Messaging System.
• Developed by: Apache Software Foundation in the year 2011
• Written in: Scala, JAVA
• Current stable version: Apache Kafka 2.2.0
Data Analytics
2. Splunk captures, Indexes, and correlates Real-time data in a
Searchable Repository from which it can generate Graphs, Reports,
Alerts, Dashboards, and Data Visualizations. It is also used for
Application Management, Security and Compliance, as well as
Business and Web Analytics.
• Developed by: Splunk INC in the year 2014 6th May
• Written in: AJAX, C++, Python, XML
• Current stable version: Splunk 7.3
Data Analytics
3. KNIME allows users to visually create Data Flows, Selectively
execute some or All Analysis steps, and Inspect the Results, Models,
and Interactive views. KNIME is written in Java and based on Eclipse
and makes use of its Extension mechanism to add Plugins providing
Additional Functionality.
• Developed by: KNIME in the year 2008
• Written in: JAVA
• Current stable version: KNIME 3.7.2
Data Analytics
4. Spark provides In-Memory Computing capabilities to deliver Speed,
a Generalized Execution Model to support a wide variety of
applications, and Java, Scala, and Python APIs for ease of
development.
• Developed by: Apache Software Foundation
• Written in: Java, Scala, Python, R
• Current stable version: Apache Spark 2.4.3
Data Analytics
5. R is a Programming Language and free software environment
for Statistical Computing and Graphics. The R language is widely
used among Statisticians and Data Miners for developing Statistical
Software and majorly in Data Analysis.
• Developed by: R-Foundation in the year 2000 29th Feb
• Written in: Fortran
• Current stable version: R-3.6.0
Data Analytics
6. BlockChain is used in essential functions such as payment, escrow,
and title can also reduce fraud, increase financial privacy, speed up
transactions, and internationalize markets. BlockChain can be used
for achieving the following in a Business Network Environment:
 Shared Ledger: Here we can append the Distributed System of records across a Business
network.
 Smart Contract: Business terms are embedded in the transaction Database and Executed
with transactions.
 Privacy: Ensuring appropriate Visibility, Transactions are Secure, Authenticated and
Verifiable
 Consensus: All parties in a Business network agree to network verified transactions.
•Developed by: Bitcoin
•Written in: JavaScript, C++, Python
•Current stable version: Blockchain
4.0
Data Visualization
1. Tableau is a Powerful and Fastest growing Data Visualization tool
used in the Business Intelligence Industry. Data analysis is very fast
with Tableau and the Visualizations created are in the form of
Dashboards and Worksheets.
• Developed by: TableAU 2013 May 17th
• Written in: JAVA, C++, Python, C
• Current stable version: TableAU 8.2
Data Visualization
2. Plotly: Mainly used to make creating Graphs faster and more
efficient. API libraries for Python, R, MATLAB, Node.js,
Julia, and Arduino and a REST API. Plotly can also be used to style
Interactive Graphs with Jupyter notebook.
• Developed by: Plotly in the year 2012
• Written in: JavaScript
• Current stable version: Plotly 1.47.4

More Related Content

What's hot

Introduction to elasticsearch
Introduction to elasticsearchIntroduction to elasticsearch
Introduction to elasticsearch
pmanvi
 
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI InitiativesDatabricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data Warehouse
Snowflake Computing
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation Hadoop
Varun Narang
 
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
AWS Chicago
 
Microsoft Purview
Microsoft PurviewMicrosoft Purview
Microsoft Purview
Mohammed Chaaraoui
 
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Cathrine Wilhelmsen
 
Data Lake - Multitenancy Best Practices
Data Lake - Multitenancy Best PracticesData Lake - Multitenancy Best Practices
Data Lake - Multitenancy Best Practices
CitiusTech
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
Dalibor Wijas
 
Data warehouse design
Data warehouse designData warehouse design
Data warehouse design
ines beltaief
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
Snowflake Computing
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
PolarSeven Pty Ltd
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on Snowflake
Kent Graziano
 
Azure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar PresentationAzure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar Presentation
Matthew W. Bowers
 
Presentation of Apache Cassandra
Presentation of Apache Cassandra Presentation of Apache Cassandra
Presentation of Apache Cassandra
Nikiforos Botis
 
Azure data factory
Azure data factoryAzure data factory
Azure data factory
David Giard
 
Building End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCPBuilding End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCP
Databricks
 
Building Data Lakehouse.pdf
Building Data Lakehouse.pdfBuilding Data Lakehouse.pdf
Building Data Lakehouse.pdf
Luis Jimenez
 
Integrating Big Data Technologies
Integrating Big Data TechnologiesIntegrating Big Data Technologies
Integrating Big Data Technologies
DATAVERSITY
 

What's hot (20)

Introduction to elasticsearch
Introduction to elasticsearchIntroduction to elasticsearch
Introduction to elasticsearch
 
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI InitiativesDatabricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI Initiatives
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data Warehouse
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation Hadoop
 
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
 
Microsoft Purview
Microsoft PurviewMicrosoft Purview
Microsoft Purview
 
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
 
Data Lake - Multitenancy Best Practices
Data Lake - Multitenancy Best PracticesData Lake - Multitenancy Best Practices
Data Lake - Multitenancy Best Practices
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 
Data warehouse design
Data warehouse designData warehouse design
Data warehouse design
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on Snowflake
 
Azure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar PresentationAzure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar Presentation
 
Presentation of Apache Cassandra
Presentation of Apache Cassandra Presentation of Apache Cassandra
Presentation of Apache Cassandra
 
Azure data factory
Azure data factoryAzure data factory
Azure data factory
 
Building End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCPBuilding End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCP
 
Building Data Lakehouse.pdf
Building Data Lakehouse.pdfBuilding Data Lakehouse.pdf
Building Data Lakehouse.pdf
 
Integrating Big Data Technologies
Integrating Big Data TechnologiesIntegrating Big Data Technologies
Integrating Big Data Technologies
 

Similar to Big Data Technologies.pdf

Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologies
neeraj rathore
 
ODSC and iRODS
ODSC and iRODSODSC and iRODS
ODSC and iRODS
Raminder Singh
 
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitAnalysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Slim Baltagi
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
DataWorks Summit/Hadoop Summit
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
DataWorks Summit/Hadoop Summit
 
Top 10 renowned big data companies
Top 10 renowned big data companiesTop 10 renowned big data companies
Top 10 renowned big data companies
Robert Smith
 
BD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdfBD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdf
eramfatima43
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
FredReynolds2
 
25 Best Data Mining Tools in 2022
25 Best Data Mining Tools in 202225 Best Data Mining Tools in 2022
25 Best Data Mining Tools in 2022
Kavika Roy
 
Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021
Mobcoder
 
Memory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective ViewMemory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective View
ijtsrd
 
Big Data
Big DataBig Data
Big Data
Kirubaburi R
 
Big Data
Big DataBig Data
Big Data
Neha Mehta
 
data-mesh-101.pptx
data-mesh-101.pptxdata-mesh-101.pptx
data-mesh-101.pptx
TarekHamdi8
 
Big Data , Big Problem?
Big Data , Big Problem?Big Data , Big Problem?
Big Data , Big Problem?
Mohammadhasan Farazmand
 
Big data Question bank.pdf
Big data Question bank.pdfBig data Question bank.pdf
Big data Question bank.pdf
Sitamarhi Institute of Technology
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
Nitesh Ghosh
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Sreedhar Chowdam
 
Ss eb29
Ss eb29Ss eb29
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache Software
Bob Marcus
 

Similar to Big Data Technologies.pdf (20)

Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologies
 
ODSC and iRODS
ODSC and iRODSODSC and iRODS
ODSC and iRODS
 
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitAnalysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Top 10 renowned big data companies
Top 10 renowned big data companiesTop 10 renowned big data companies
Top 10 renowned big data companies
 
BD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdfBD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdf
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
25 Best Data Mining Tools in 2022
25 Best Data Mining Tools in 202225 Best Data Mining Tools in 2022
25 Best Data Mining Tools in 2022
 
Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021Top 10 Data analytics tools to look for in 2021
Top 10 Data analytics tools to look for in 2021
 
Memory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective ViewMemory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective View
 
Big Data
Big DataBig Data
Big Data
 
Big Data
Big DataBig Data
Big Data
 
data-mesh-101.pptx
data-mesh-101.pptxdata-mesh-101.pptx
data-mesh-101.pptx
 
Big Data , Big Problem?
Big Data , Big Problem?Big Data , Big Problem?
Big Data , Big Problem?
 
Big data Question bank.pdf
Big data Question bank.pdfBig data Question bank.pdf
Big data Question bank.pdf
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Ss eb29
Ss eb29Ss eb29
Ss eb29
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache Software
 

Recently uploaded

Sri Guru Hargobind Ji - Bandi Chor Guru.pdf
Sri Guru Hargobind Ji - Bandi Chor Guru.pdfSri Guru Hargobind Ji - Bandi Chor Guru.pdf
Sri Guru Hargobind Ji - Bandi Chor Guru.pdf
Balvir Singh
 
CSP_Study - Notes (Paul McNeill) 2017.pdf
CSP_Study - Notes (Paul McNeill) 2017.pdfCSP_Study - Notes (Paul McNeill) 2017.pdf
CSP_Study - Notes (Paul McNeill) 2017.pdf
Ismail Sultan
 
Butterfly Valves Manufacturer (LBF Series).pdf
Butterfly Valves Manufacturer (LBF Series).pdfButterfly Valves Manufacturer (LBF Series).pdf
Butterfly Valves Manufacturer (LBF Series).pdf
Lubi Valves
 
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort ServiceCuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
yakranividhrini
 
Cricket management system ptoject report.pdf
Cricket management system ptoject report.pdfCricket management system ptoject report.pdf
Cricket management system ptoject report.pdf
Kamal Acharya
 
Update 40 models( Solar Cell ) in SPICE PARK(JUL2024)
Update 40 models( Solar Cell ) in SPICE PARK(JUL2024)Update 40 models( Solar Cell ) in SPICE PARK(JUL2024)
Update 40 models( Solar Cell ) in SPICE PARK(JUL2024)
Tsuyoshi Horigome
 
❣Unsatisfied Bhabhi Call Girls Surat 💯Call Us 🔝 7014168258 🔝💃Independent Sura...
❣Unsatisfied Bhabhi Call Girls Surat 💯Call Us 🔝 7014168258 🔝💃Independent Sura...❣Unsatisfied Bhabhi Call Girls Surat 💯Call Us 🔝 7014168258 🔝💃Independent Sura...
❣Unsatisfied Bhabhi Call Girls Surat 💯Call Us 🔝 7014168258 🔝💃Independent Sura...
hotchicksescort
 
Better Builder Magazine, Issue 49 / Spring 2024
Better Builder Magazine, Issue 49 / Spring 2024Better Builder Magazine, Issue 49 / Spring 2024
Better Builder Magazine, Issue 49 / Spring 2024
Better Builder Magazine
 
Microsoft Azure AD architecture and features
Microsoft Azure AD architecture and featuresMicrosoft Azure AD architecture and features
Microsoft Azure AD architecture and features
ssuser381403
 
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
simrangupta87541
 
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASICINTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
GOKULKANNANMMECLECTC
 
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
DharmaBanothu
 
Call Girls Madurai 8824825030 Escort In Madurai service 24X7
Call Girls Madurai 8824825030 Escort In Madurai service 24X7Call Girls Madurai 8824825030 Escort In Madurai service 24X7
Call Girls Madurai 8824825030 Escort In Madurai service 24X7
Poonam Singh
 
SELENIUM CONF -PALLAVI SHARMA - 2024.pdf
SELENIUM CONF -PALLAVI SHARMA - 2024.pdfSELENIUM CONF -PALLAVI SHARMA - 2024.pdf
SELENIUM CONF -PALLAVI SHARMA - 2024.pdf
Pallavi Sharma
 
Covid Management System Project Report.pdf
Covid Management System Project Report.pdfCovid Management System Project Report.pdf
Covid Management System Project Report.pdf
Kamal Acharya
 
Data Communication and Computer Networks Management System Project Report.pdf
Data Communication and Computer Networks Management System Project Report.pdfData Communication and Computer Networks Management System Project Report.pdf
Data Communication and Computer Networks Management System Project Report.pdf
Kamal Acharya
 
Call Girls Chandigarh 🔥 7014168258 🔥 Real Fun With Sexual Girl Available 24/7...
Call Girls Chandigarh 🔥 7014168258 🔥 Real Fun With Sexual Girl Available 24/7...Call Girls Chandigarh 🔥 7014168258 🔥 Real Fun With Sexual Girl Available 24/7...
Call Girls Chandigarh 🔥 7014168258 🔥 Real Fun With Sexual Girl Available 24/7...
shourabjaat424
 
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
nainakaoornoida
 
FUNDAMENTALS OF MECHANICAL ENGINEERING.pdf
FUNDAMENTALS OF MECHANICAL ENGINEERING.pdfFUNDAMENTALS OF MECHANICAL ENGINEERING.pdf
FUNDAMENTALS OF MECHANICAL ENGINEERING.pdf
EMERSON EDUARDO RODRIGUES
 
College Call Girls Kolkata 🔥 7014168258 🔥 Real Fun With Sexual Girl Available...
College Call Girls Kolkata 🔥 7014168258 🔥 Real Fun With Sexual Girl Available...College Call Girls Kolkata 🔥 7014168258 🔥 Real Fun With Sexual Girl Available...
College Call Girls Kolkata 🔥 7014168258 🔥 Real Fun With Sexual Girl Available...
Ak47
 

Recently uploaded (20)

Sri Guru Hargobind Ji - Bandi Chor Guru.pdf
Sri Guru Hargobind Ji - Bandi Chor Guru.pdfSri Guru Hargobind Ji - Bandi Chor Guru.pdf
Sri Guru Hargobind Ji - Bandi Chor Guru.pdf
 
CSP_Study - Notes (Paul McNeill) 2017.pdf
CSP_Study - Notes (Paul McNeill) 2017.pdfCSP_Study - Notes (Paul McNeill) 2017.pdf
CSP_Study - Notes (Paul McNeill) 2017.pdf
 
Butterfly Valves Manufacturer (LBF Series).pdf
Butterfly Valves Manufacturer (LBF Series).pdfButterfly Valves Manufacturer (LBF Series).pdf
Butterfly Valves Manufacturer (LBF Series).pdf
 
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort ServiceCuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
Cuttack Call Girls 💯Call Us 🔝 7374876321 🔝 💃 Independent Female Escort Service
 
Cricket management system ptoject report.pdf
Cricket management system ptoject report.pdfCricket management system ptoject report.pdf
Cricket management system ptoject report.pdf
 
Update 40 models( Solar Cell ) in SPICE PARK(JUL2024)
Update 40 models( Solar Cell ) in SPICE PARK(JUL2024)Update 40 models( Solar Cell ) in SPICE PARK(JUL2024)
Update 40 models( Solar Cell ) in SPICE PARK(JUL2024)
 
❣Unsatisfied Bhabhi Call Girls Surat 💯Call Us 🔝 7014168258 🔝💃Independent Sura...
❣Unsatisfied Bhabhi Call Girls Surat 💯Call Us 🔝 7014168258 🔝💃Independent Sura...❣Unsatisfied Bhabhi Call Girls Surat 💯Call Us 🔝 7014168258 🔝💃Independent Sura...
❣Unsatisfied Bhabhi Call Girls Surat 💯Call Us 🔝 7014168258 🔝💃Independent Sura...
 
Better Builder Magazine, Issue 49 / Spring 2024
Better Builder Magazine, Issue 49 / Spring 2024Better Builder Magazine, Issue 49 / Spring 2024
Better Builder Magazine, Issue 49 / Spring 2024
 
Microsoft Azure AD architecture and features
Microsoft Azure AD architecture and featuresMicrosoft Azure AD architecture and features
Microsoft Azure AD architecture and features
 
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
Mahipalpur Call Girls Delhi 🔥 9711199012 ❄- Pick Your Dream Call Girls with 1...
 
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASICINTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
INTRODUCTION TO ARTIFICIAL INTELLIGENCE BASIC
 
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
 
Call Girls Madurai 8824825030 Escort In Madurai service 24X7
Call Girls Madurai 8824825030 Escort In Madurai service 24X7Call Girls Madurai 8824825030 Escort In Madurai service 24X7
Call Girls Madurai 8824825030 Escort In Madurai service 24X7
 
SELENIUM CONF -PALLAVI SHARMA - 2024.pdf
SELENIUM CONF -PALLAVI SHARMA - 2024.pdfSELENIUM CONF -PALLAVI SHARMA - 2024.pdf
SELENIUM CONF -PALLAVI SHARMA - 2024.pdf
 
Covid Management System Project Report.pdf
Covid Management System Project Report.pdfCovid Management System Project Report.pdf
Covid Management System Project Report.pdf
 
Data Communication and Computer Networks Management System Project Report.pdf
Data Communication and Computer Networks Management System Project Report.pdfData Communication and Computer Networks Management System Project Report.pdf
Data Communication and Computer Networks Management System Project Report.pdf
 
Call Girls Chandigarh 🔥 7014168258 🔥 Real Fun With Sexual Girl Available 24/7...
Call Girls Chandigarh 🔥 7014168258 🔥 Real Fun With Sexual Girl Available 24/7...Call Girls Chandigarh 🔥 7014168258 🔥 Real Fun With Sexual Girl Available 24/7...
Call Girls Chandigarh 🔥 7014168258 🔥 Real Fun With Sexual Girl Available 24/7...
 
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
❣Independent Call Girls Chennai 💯Call Us 🔝 7737669865 🔝💃Independent Chennai E...
 
FUNDAMENTALS OF MECHANICAL ENGINEERING.pdf
FUNDAMENTALS OF MECHANICAL ENGINEERING.pdfFUNDAMENTALS OF MECHANICAL ENGINEERING.pdf
FUNDAMENTALS OF MECHANICAL ENGINEERING.pdf
 
College Call Girls Kolkata 🔥 7014168258 🔥 Real Fun With Sexual Girl Available...
College Call Girls Kolkata 🔥 7014168258 🔥 Real Fun With Sexual Girl Available...College Call Girls Kolkata 🔥 7014168258 🔥 Real Fun With Sexual Girl Available...
College Call Girls Kolkata 🔥 7014168258 🔥 Real Fun With Sexual Girl Available...
 

Big Data Technologies.pdf

  • 2. What are Big Data Technologies? • Actually, Big Data Technologies is the utilized software that incorporates data mining, data storage, data sharing, and data visualization, the comprehensive term embraces data, data framework including tools and techniques used to investigate and transform data. • Big Data Technology can be defined as a Software-Utility that is designed to Analyse, Process and Extract the information from an extremely complex and large data sets which the Traditional Data Processing Software could never deal with.
  • 3. Types of Big Data Technologies: • Big Data Technology is mainly classified into two types: 1. Operational Big Data Technologies 2. Analytical Big Data Technologies
  • 4. Operational Big Data Technologies • The Operational Big Data is all about the normal day to day data that we generate. This could be the Online Transactions, Social Media, or the data from a Particular Organisation etc. You can even consider this to be a kind of Raw Data which is used to feed the Analytical Big Data Technologies. • A few examples of Operational Big Data Technologies are as follows: Online ticket bookings, which includes your Rail tickets, Flight tickets, movie tickets etc. Online shopping which is your Amazon, Flipkart, Walmart, Snap deal and many more. Data from social media sites like Facebook, Instagram, what’s app and a lot more. The employee details of any Multinational Company.
  • 5. Analytical Big Data Technologies • Analytical Big Data is like the advanced version of Big Data Technologies. It is a little complex than the Operational Big Data. In short, Analytical big data is where the actual performance part comes into the picture and the crucial real-time business decisions are made by analyzing the Operational Big Data. • Few examples of Analytical Big Data Technologies are as follows: Stock marketing Carrying out the Space missions where every single bit of information is crucial. Weather forecast information. Medical fields where a particular patients health status can be monitored.
  • 6. Types of Big Data Technologies • Top big data technologies are divided into 4 fields which are classified as follows: 1. Data Storage 2. Data Mining 3. Data Analytics 4. Data Visualization Source: javatpoint
  • 7. Data Storage 1. Hadoop Framework was designed to store and process data in a Distributed Data Processing Environment with commodity hardware with a simple programming model. It can Store and Analyse the data present in different machines with High Speeds and Low Costs. • Developed by: Apache Software Foundation in the year 2011 10th of Dec. • Written in: JAVA • Current stable version: Hadoop 3.11
  • 8. Data Storage 2. MongoDB: The NoSQL Document Databases like MongoDB, offer a direct alternative to the rigid schema used in Relational Databases. This allows MongoDB to offer Flexibility while handling a wide variety of Datatypes at large volumes and across Distributed Architectures. • Developed by: MongoDB in the year 2009 11th of Feb • Written in: C++, Go, JavaScript, Python • Current stable version: MongoDB 4.0.10
  • 9. Data Storage 3. RainStor is a software company that developed a Database Management System of the same name designed to Manage and Analyse Big Data for large enterprises. It uses Deduplication Techniques to organize the process of storing large amounts of data for reference. • Developed by: RainStor Software company in the year 2004. • Works like: SQL • Current stable version: RainStor 5.5
  • 10. Data Storage 4. Hunk lets you access data in remote Hadoop Clusters through virtual indexes and lets you use the Splunk Search Processing Language to analyse your data. With Hunk, you can Report and Visualize large amounts from your Hadoop and NoSQL data sources. • Developed by: Splunk INC in the year 2013. • Written in: JAVA • Current stable version: Splunk Hunk 6.2
  • 11. Data Storage 5. Cassandra: Cassandra forms a top choice among the list of popular NoSQL databases which is a free and an open-source database, which is distributed and has a wide columnar storage and can efficiently handle data on large commodity clusters i.e. it is used to provide high availability along with no single failure point. • Among the list of main features includes the ones like distributed nature, scalability, fault-tolerant mechanism, MapReduce support, tunable consistency, query language property, supports multi data center replication and eventual consistency.
  • 12. Data Mining 1. Presto is an open source Distributed SQL Query Engine for running Interactive Analytic Queries against data sources of all sizes ranging from Gigabytes to Petabytes. Presto allows querying data in Hive, Cassandra, Relational Databases and Proprietary Data Stores. • Developed by: Apache Foundation in the year 2013. • Written in: JAVA • Current stable version: Presto 0.22
  • 13. Data Mining 2. RapidMiner is a Centralized solution that features a very powerful and robust Graphical User Interface that enables users to Create, Deliver, and maintain Predictive Analytics. It allows creating very Advanced Workflows, Scripting support in several languages. • Developed by: RapidMiner in the year 2001 • Written in: JAVA • Current stable version: RapidMiner 9.2
  • 14. Data Mining 3. Elasticsearch is a Search Engine based on the Lucene Library. It provides a Distributed, MultiTenant-capable, Full-Text Search Engine with an HTTP Web Interface and Schema-free JSON documents. • Developed by: Elastic NV in the year 2012. • Written in: JAVA • Current stable version: ElasticSearch 7.1
  • 15. Data Analytics 1. Apache Kafka is a Distributed Streaming platform. A streaming platform has Three Key Capabilities that are as follows: i. Publisher ii. Subscriber iii. Consumer This is similar to a Message Queue or an Enterprise Messaging System. • Developed by: Apache Software Foundation in the year 2011 • Written in: Scala, JAVA • Current stable version: Apache Kafka 2.2.0
  • 16. Data Analytics 2. Splunk captures, Indexes, and correlates Real-time data in a Searchable Repository from which it can generate Graphs, Reports, Alerts, Dashboards, and Data Visualizations. It is also used for Application Management, Security and Compliance, as well as Business and Web Analytics. • Developed by: Splunk INC in the year 2014 6th May • Written in: AJAX, C++, Python, XML • Current stable version: Splunk 7.3
  • 17. Data Analytics 3. KNIME allows users to visually create Data Flows, Selectively execute some or All Analysis steps, and Inspect the Results, Models, and Interactive views. KNIME is written in Java and based on Eclipse and makes use of its Extension mechanism to add Plugins providing Additional Functionality. • Developed by: KNIME in the year 2008 • Written in: JAVA • Current stable version: KNIME 3.7.2
  • 18. Data Analytics 4. Spark provides In-Memory Computing capabilities to deliver Speed, a Generalized Execution Model to support a wide variety of applications, and Java, Scala, and Python APIs for ease of development. • Developed by: Apache Software Foundation • Written in: Java, Scala, Python, R • Current stable version: Apache Spark 2.4.3
  • 19. Data Analytics 5. R is a Programming Language and free software environment for Statistical Computing and Graphics. The R language is widely used among Statisticians and Data Miners for developing Statistical Software and majorly in Data Analysis. • Developed by: R-Foundation in the year 2000 29th Feb • Written in: Fortran • Current stable version: R-3.6.0
  • 20. Data Analytics 6. BlockChain is used in essential functions such as payment, escrow, and title can also reduce fraud, increase financial privacy, speed up transactions, and internationalize markets. BlockChain can be used for achieving the following in a Business Network Environment:  Shared Ledger: Here we can append the Distributed System of records across a Business network.  Smart Contract: Business terms are embedded in the transaction Database and Executed with transactions.  Privacy: Ensuring appropriate Visibility, Transactions are Secure, Authenticated and Verifiable  Consensus: All parties in a Business network agree to network verified transactions. •Developed by: Bitcoin •Written in: JavaScript, C++, Python •Current stable version: Blockchain 4.0
  • 21. Data Visualization 1. Tableau is a Powerful and Fastest growing Data Visualization tool used in the Business Intelligence Industry. Data analysis is very fast with Tableau and the Visualizations created are in the form of Dashboards and Worksheets. • Developed by: TableAU 2013 May 17th • Written in: JAVA, C++, Python, C • Current stable version: TableAU 8.2
  • 22. Data Visualization 2. Plotly: Mainly used to make creating Graphs faster and more efficient. API libraries for Python, R, MATLAB, Node.js, Julia, and Arduino and a REST API. Plotly can also be used to style Interactive Graphs with Jupyter notebook. • Developed by: Plotly in the year 2012 • Written in: JavaScript • Current stable version: Plotly 1.47.4
  翻译: