Using SAS GRID v 9 with Isilon F810

Using Isilon All-Flash Storage for SAS GRID
A Technical Deep Dive
Boni Bruno, CISSP, CISM, CGEIT
Chief Solutions Architect, Analytics
Dell EMC

2
SAS – Statistical Analysis Systems
• Business Intelligence, Advanced Analytics, Data Management, Predictive
Analysis
• SAS is not a relational database, (RDBMS)
– SAS is an interpretive programming language
– Data is stored in SAS proprietary formatted files
• Native access to all major databases
• Application Front Ends for thick, thin, Grid, and multi-platform tiered solutions
• Used by nearly every Dell EMC Enterprise customer
– 100’s of TB of SAS data is common

3
Why Dell EMC for SAS Analytics?
Dell EMC holds leadership positions in some of the biggest and largest growth
categories in the IT infrastructure business, and that means you can confidently source
all your IT needs from one provider — Dell EMC
• converged infrastructure1
• in traditional and all-flash storage2
• virtualized data center
infrastructure3
• cloud IT infrastructure4
• server virtualization and cloud
systems management software
(VMware)5
• in data protection6
• in software-defined storage7
1 IDC WW Quarterly Converged Systems Tracker, June 2016, Vendor Revenue — EMC FY 2015; 2 IDC WW Quarterly Enterprise Storage Systems Tracker, June 2016, Vendor Revenue — EMC CY 2015; 3 Dell EMC Annual Report, 2015; 4 IDC WW
Quarterly Cloud IT Infrastructure Tracker, Q1 June 2016, Vendor Revenue — EMC FY 2015; 5 IDC WW Virtual Machine and Cloud System Market Shares 2015, July 2016; 6 Dell EMC Pulse, Gartner Recognizes EMC as a Leader in the 2016 Data Center
Backup and Recovery Software Magic Quadrant, June 2016; 7 IDC white paper, "Software Defined Storage: A Pervasive Approach to IT Transformation Driven by the 3rd Platform," November 2015

F810 Overview
New All-Flash Node with Inline Data Reduction
Key Features
Benefits
 Hardware-accelerated, real-time compression
 Supports 3.8TB, 7.7TB and 15.4TB SSD capacities
 Fully supported in heterogeneous Isilon Gen6 clusters
 Dell EMC 2:1 Data Reduction Guarantee & other Storage
Loyalty Program elements
 Ideal for demanding workloads that require extreme performance and efficiency
 Up to 33% more effective storage/PB than major competitive offerings
 Simple configuration, transparent operation
 Fully supported with all other Isilon OneFS features

Why Storage is Critical in Analytics…
• Analytics require massive amounts of data to meet business needs
• Speed of access to data is critical in order to “feed” increasing
processing power
• Enhanced compression techniques to reduce cost without hindering
performance
• Easily scalable as the environment grows (modular)
• Even as analytics move to RAM, it has to be stored somewhere and
accessed quickly
• Ability to eliminate duplicate data as time goes on, to further reduce
storage

Typical SAS Grid Architecture
Grid Node #1
SAS
Grid
Resource
Mgmt
Typically IBM
Platform LSF
Users
Grid Node #2
Job
Submission
Browser
SAS
Client
Tools
Batch
(shell)
Shared Storage
Customer Data
Customer Home Directories
SAS Code
Etc.
Dedicated to Grid Node
(Fast, Never Shared or NFS)
Job 1
Job 3
Job 2
Temp Storage,
SASWORK
Temp Storage,
SASWORK
Fiber or Network
(High Speed)
Many Grid Nodes
(100s of total cores)

Grid Node #1
Grid Node #12
Batch 1
Submission
Batch
Scripts
(shell)
4 Isilon F8x0 Nodes
Each Batch Has its own copy of the data
Input NFS mount for each grid node/batch
Output NFS mount for each node/batch
Dedicated to on Grid Node
(20+ disk RAID-0)
Job 1
Job 1
Job 2
Temp Storage,
SASWORK
Temp Storage,
SASWORK
2 x 10 GbE
Per SAS Node
Batch 12
Submission
Job 2
1 x 40 GbE
Per Node
Load Sharing Facility (LSF) was NOT used in this scenario to spawn jobs
in order to create a more repeatable job launch across all nodes (predictable job spread).
It also helped reduce setup time. This is a common practice at SAS, partners and customers.
12 nodes!!
Each batch is 33 SAS jobs.
Common to have 10s or 100s of NFS mounts in typical grid
(typical for groups/projects to have 1 or more mounts each)
1 x 40 GbE
Per Node
Dell EMC
SAS GRID v 9.4M6
Test Lab

Dell EMC
SAS GRID v 9.4M6
Test Lab Grid
Node 1
Grid
Node 3
Node1
Node2
Node3
Node4
2x10 GbE
Bond
LACP 802.3ad
2x40 GbE
1x40 GbE
Grid
Node 8
Grid
Node 2
Node1
Node2
Node3
Node4
Isilon Models:
Isilon F810-4U-Single-256GB-1x1GE-2x40GE SFP+-24TB SSD, OneFS v is 8.1.3
Isilon F800-4U-Single-256GB-1x1GE-2x40GE SFP+-24TB SSD, OneFS v is 8.2.0
/saswork
22 disks
raid 0
FYI: CPUs used in f8x0 E5-2697A v4
PowerEdge R730 Servers:
Intel Xeon CPU E5-2698 V4 2.2 GHz
2 Sockets Per Grid Node
20 Cores Per Socket (40 Threads)
40 Total Cores / 80 Threads
Each Grid node 256 GB RAM
Network
40GbEStorage
Interconnect
40GbEStorage
Interconnect
F810 with
HW compression
/sasdata
F800 /sasdata

NFS CLIENT Mount Options
EXAMPLE FROM SAS GRID NODE 2
# F800 with SAS Compression
f800n2:/ifs/f800c/wrk2/multiuser /f800c nfs nfsvers=3,tcp,rw,hard,intr,retrans=2,nosuid,noatime,nodiratime 0 0
f800n3:/ifs/f800c/wrk2/sas7bdat /f800c/sas7bdat nfs nfsvers=3,tcp,rw,hard,intr,retrans=2,nosuid,noatime,nodiratime 0 0
f800n4:/ifs/f800c/wrk2/output /f800c/output nfs nfsvers=3,tcp,rw,hard,intr,retrans=2,nosuid,noatime,nodiratime 0 0
#F800 with no SAS Compression
f800n2:/ifs/f800/wrk2/multiuser /f800 nfs nfsvers=3,tcp,rw,hard,intr,retrans=2,nosuid,noatime,nodiratime 0 0
f800n3:/ifs/f800/wrk2/sas7bdat /f800/sas7bdat nfs nfsvers=3,tcp,rw,hard,intr,retrans=2,nosuid,noatime,nodiratime 0 0
f800n4:/ifs/f800/wrk2/output /f800/output nfs nfsvers=3,tcp,rw,hard,intr,retrans=2,nosuid,noatime,nodiratime 0 0
#F810 with SAS Compression and HW compression
f810n2:/ifs/f810c/wrk2/multiuser /f810c nfs nfsvers=3,tcp,rw,hard,intr,retrans=2,nosuid,noatime,nodiratime 0 0
f810n3:/ifs/f810c/wrk2/sas7bdat /f810c/sas7bdat nfs nfsvers=3,tcp,rw,hard,intr,retrans=2,nosuid,noatime,nodiratime 0 0
f810n4:/ifs/f810c/wrk2/output /f810c/output nfs nfsvers=3,tcp,rw,hard,intr,retrans=2,nosuid,noatime,nodiratime 0 0

OneFS v 8.1.3
NFS Export
Settings

Testing Focus Areas…
Performance
• How does compression and newer hardware effect runtimes?
Scalability
• We tested up to 12 Grid Nodes
• Most existing NFS Clusters are 1:1
• Do runtimes for individual jobs increase(get slower)?
Compression
• SAS Binary Compression does help with larger files 20-50%
• What happens when we throw in Isilon F810 HW compression too?
Deduplication
• Lots of Replicated Data in Analytic Systems, Can We Save More Space?
Cost
• Can we deploy less nodes due to compression and maintain performance?

D4t4 Financial Services Workload
• Suite: Multiuser Analytic Workload
• Created By D4t4 For Financial Services Customers
• Work Patterns And Data Volumes Match Real Customer Jobs
• Simulates SAS Grid Users
• Mix Of Programs That Simulate Different User Scenarios
• Interactive And Batch SAS Jobs
• Designed To Evaluate:
• Scalability Of HW Resources (Focus On Storage Performance)
• Sustained Performance At Scale
• Monitor Response Times Of Large And Small Jobs
• Easily Adjustable To Match Customer Workload
• Ability Of A System To Achieve Customer Requirements

SAS IO Requirements / How Data Flows
CPU Core
(Typically 2 Threads)
Sustained feed R+W
100-150 MBps per core
Peak feed R+W
300-400 MBps per core
System RAM
IO does occur here to.. file cache & more with Viya
Connections
Network, Fiber, SATA, etc.
To and From Sources, RAM, Cores
Data on Disk
Project, Tables, etc.
SAS Work
Temporary – High Speed
Network
RDBMs, Streams, etc.
~40-50%
~40-60%
~10-20%
Typical
IO Percentage
To/From Source
SAS Rule: Sustain IO Throughput of around 150 MBps Total (combined R+W) per core
Yes… cores range in speed and performance, but this is a good target throughput…
Data Source/Target
Running SAS
Jobs

Multiuser Analytics Workload Execution
SAS Grid
Node #1
scale
SAS Grid
Node #2
Batch 1 Launch
Isilon Shared Storage
work
work
Batch 1
Data
Batch 2
Data
Batch 2 Launch
Batch #
Data
Network
scale
40GbEStorageInterconnect

Multiuser Workload Batch Details
• Single Node Batch Includes:
• 33 SAS Programs Executed
• Staggered Launch – Timed Script to simulate onboard/real world
• Each Batch Averages ~15-20 Simultaneous Jobs at Peak
• Simulate typical 8 to 12 core SAS Grid server workload during average day
• Data Volumes Per Batch (SAS uncompressed)
• Input Data (SAS7bdat): 1.3 TB
• Output Data Created: 1.2 TB
• SASWork / Temporary Space Peak Usage: ~350 GB (grows and shrinks over period)
• Job Types:
• SAS Studio / Report User – interactive report/coding user (sleep periods are added to create
the feel of real users working on the system at random periods)
• SAS Modeler – execution of complex analytics like logistic, regression
• SAS Data Set construction in support of Modeling / Analytics (building analytics data sets)
• ETL workflow simulation, reading from remote source and populating tables (includes index
creation, merge, where, sorts)
• Advanced Analytics user – larger datasets with more advanced analytics and data
manipulation

Multiuser Workload Batch Details (cont.)
• SAS Procedures / Methods Used in Code
• Datasets, PRINT, MEANS, CONTENTS, SQL, HPLOGISTIC, SORT, REG, GLM, DELETE
• Data step (sequential and random read/write)
• Data Details (Uncompressed SAS & Isilon)
• Modeling Data
• User Data
• Random Generated With Fields That Mimic Financial Services
• In Reality, Stressing The IO Is The Key To Performance Testing For SAS Grid!

Job Launch
on each
“Grid” Node
#!/bin/sh
cd $MULTIUSER
OPTIONS="-memsize 2048M -bufsize 256k -sortsize 256M -fullstimer -work /saswork "
echo "###########################################################################"
echo "LAUNCH multiuser workload.... ramp up and slowly keep releasing jobs... "
date
sas $MULTIUSER/code/citi1.sas -sysparm 1-log $MULTIUSER/logs/citi1_1.log-print $MULTIUSER/lst/citi1_1.lst $OPTIONS &
sleep 10
sleep 10
sleep 10
sleep 10
sas $MULTIUSER/code/rwtumble.sas -sysparm 2 -log $MULTIUSER/logs/rwtumble_1.log -print $MULTIUSER/lst/rwtumble_1.lst $OPTIONS &
sleep 10
sas $MULTIUSER/code/rtumble.sas -sysparm 1 -log $MULTIUSER/logs/rtumble_1.log -print $MULTIUSER/lst/rtumble_1.lst $OPTIONS &
sleep 10
sas $MULTIUSER/code/comp_glm.sas -sysparm 2 -log $MULTIUSER/logs/comp_glm_1a.log -print $MULTIUSER/lst/comp_glm_1a.lst $OPTIONS &
sleep 10
sas $MULTIUSER/code/sort.sas -sysparm 1 -log $MULTIUSER/logs/sort_1.log -print $MULTIUSER/lst/sort_1.lst$OPTIONS &
sleep 120
sas $MULTIUSER/code/rwrw.sas -sysparm 1 -log $MULTIUSER/logs/rwrw_1.log -print $MULTIUSER/lst/rwrw_1.lst$OPTIONS &
sleep 5
sas $MULTIUSER/code/wr_junk.sas -sysparm 1 -log $MULTIUSER/logs/wr_junk_1.log -print $MULTIUSER/lst/wr_junk_1.lst $OPTIONS &
sleep 5
sas $MULTIUSER/code/etl_inbound.sas -sysparm 1 -log $MULTIUSER/logs/etl_inbound_1.log -print $MULTIUSER/lst/etl_inbound_1.lst $OPTIONS &
sleep 5
sas $MULTIUSER/code/where_test.sas -sysparm 1 -log $MULTIUSER/logs/where_test_1.log -print $MULTIUSER/lst/where_test_1.lst $OPTIONS &
sleep 5
sas $MULTIUSER/code/hplogistic.sas -sysparm 1 -log $MULTIUSER/logs/hplogistic_1.log -print $MULTIUSER/lst/hplogistic_1.lst $OPTIONS &
sleep 5
sleep 5
sas $MULTIUSER/code/smallnoise.sas -sysparm 5 -log $MULTIUSER/logs/smallnoise_5.log -print $MULTIUSER/lst/smallnoise_5.lst $OPTIONS &
sleep 60
sas $MULTIUSER/code/fscheck.sas -sysparm 1 -log $MULTIUSER/logs/fscheck_a.log -print $MULTIUSER/lst/fscheck_a.lst $OPTIONS &
sleep 60
sas $MULTIUSER/code/fscheck.sas -sysparm 1 -log $MULTIUSER/logs/fscheck_c.log -print $MULTIUSER/lst/fscheck_c.lst $OPTIONS &
sas $MULTIUSER/code/smallnoise.sas -sysparm 6a -log $MULTIUSER/logs/smallnoise_6a.log -print $MULTIUSER/lst/smallnoise_6a.lst $OPTIONS &
sleep 60
sas $MULTIUSER/code/rwrw.sas -sysparm 2 -log $MULTIUSER/logs/rwrw_2.log -print $MULTIUSER/lst/rwrw_2.lst$OPTIONS &
sas $MULTIUSER/code/fscheck.sas -sysparm 1 -log $MULTIUSER/logs/fscheck_f.log -print $MULTIUSER/lst/fscheck_f.lst $OPTIONS &
sleep 1
sas $MULTIUSER/code/etl_inbound.sas -sysparm 4 -log $MULTIUSER/logs/etl_inbound_4.log -print $MULTIUSER/lst/etl_inbound_4.lst $OPTIONS &
sleep 1
sas $MULTIUSER/code/comp_glm.sas -sysparm 2 -log $MULTIUSER/logs/comp_glm_4a.log -print $MULTIUSER/lst/comp_glm_4a.lst $OPTIONS &
sleep 1
sas $MULTIUSER/code/comp_glm.sas -sysparm 2 -log $MULTIUSER/logs/comp_glm_4b.log -print $MULTIUSER/lst/comp_glm_4b.lst $OPTIONS &
sas $MULTIUSER/code/hplogistic.sas -sysparm 1 -log $MULTIUSER/logs/hplogistic_2.log -print $MULTIUSER/lst/hplogistic_2.lst $OPTIONS &
sleep 60
sas $MULTIUSER/code/fscheck.sas -sysparm 2 -log $MULTIUSER/logs/fscheck_i.log -print $MULTIUSER/lst/fscheck_i.lst $OPTIONS &
sleep 60
sas $MULTIUSER/code/smallnoise.sas -sysparm 11b -log $MULTIUSER/logs/smallnoise_11b.log -print $MULTIUSER/lst/smallnoise_11b.lst $OPTIONS &
sleep 60
sas $MULTIUSER/code/fscheck.sas -sysparm 2 -log $MULTIUSER/logs/fscheck_l.log -print $MULTIUSER/lst/fscheck_l.lst $OPTIONS &
sleep 60
sas $MULTIUSER/code/fscheck.sas -sysparm 1 -log $MULTIUSER/logs/fscheck_m.log -print $MULTIUSER/lst/fscheck_m.lst $OPTIONS &

IO Throughput for SAS Grid – Deeper Look
• SAS requires IO throughput of 150 MBps/CPU Core
• SAS grid nodes typically have from 8 to 12 CPU cores for NFS
• Typical for dual 10 GbE configuration
• Therefore 12 core node needs 1800 MBps sustained throughput
• IO comes from: SASWork, Data Storage, Other (Network, RDBMS)
• IO throughput percentages for data sources is typically:
• SASWork (~50%), Data (~40%), Other (~10%) - this varies by customer! (see note below)
• If your 12 node SAS Grid has 12 cpu cores each:
• A Single Grid Node Need ~720 MBps sustainable R+W Throughput from NFS
• The Entire Grid Needs ~8 GBps sustainable R+W Throughput from NFS**
**4 x F810s with 12 Grid Nodes
During IO Throughput R+W Tests
NOTE: 40 node grid – Average sustained IO Throughput for 12 core Grid node at major financial institution is 650 MBps with 2 x 10 GbE to NFS

Further Details About The Multiuser Analytic Workload
Workload High Level Concept: The Multiuser Analytic Workload was written to be launch workload like that found in a financial services SAS Grid. The workload is similar in design
to SAS’s Mixed Analytic Workload developed during the past 20+ years at SAS to simulate a typical SAS multi-user workload (SAS’s version included jobs from healthcare, government,
etc).
The multiuser workload can be run on a single SMP system or a multi-node SAS Grid environment. It is designed to be modified in order to ramp the workload up and down to stress
the system’s CPU, RAM and I/O capability based on its performance potential (size). SAS IO, being the most critical component of any customer’s SAS environment, is one of the
prime focuses of the scenario and most SAS tests.
• SAS programs in the workload includes data and functions that simulate the following SAS user personas:
• SAS Studio / Report User – interactive report/coding user (sleep periods are added to create the feel of real users working on the system at random periods)
• SAS Modeler – execution of complex analytics like logistic, regression
• SAS Data Set construction in support of Modeling / Analytics (building analytics data sets)
• ETL workflow simulation, reading from remote source and populating tables (includes index creation, merge, where, sorts)
• Advanced Analytics user – larger datasets with more advanced analytics and data manipulation
• The above jobs are (simultaneous executions) of jobs are launched in a timed launch sequence to simulate users coming and going from the grid.
Run Philosophy: It is very common to run this test scenario in different mixes of the types of users (SAS jobs) in order to more closely resemble a customer's environment. This was
NOT designed to behave like a TPC or SPEC benchmark where the results are always the same and the test is run in exactly the prescribed fashion. Its meant to stress the system,
especially related to I/O in order to confirm it can achieve the recommended SAS requirements. The target IO capability as of this writing is 150 MBps per CPU core. The test is tuned
up and down to ensure that under multi-user workload that throughput can be maintained.
Goal: Meeting SAS’s Requirements for IO Throughput: SAS requires a system to be able to sustain 150 MBps per CPU core. This means the total IO (Read+Write) to temporary
(SASWORK) or permanent storage locations like RDBMS, SAN and/or NAS storage devices must be able to sustain 150 MBps per CPU core at any time. i.e. If 50% of your IO is to
SASWORK, then the other 50% needs to come from the permanent stores like NFS. Therefore NFS would need to maintain a throughput of 75 MBps in order to properly support a
single CPU core system. As a further example, if we had a 10 CPU core system, the sum of the IO capability of the NFS files system would need to support 750 MBps if the other 50%
was supported by SASWORK. The larger the SAS compute server is, the more IO you will need to provide.
Test Execution: Jobs are launched with a shell script on 1 or more machines (SMP or multi-grid node SAS environment). The script used on each grid node launched 33 jobs in a
controlled time launch sequence on 1 or more servers at the same time. Data is pre-generated (compressed or uncompressed) and duplicated on all the machines (local or shared
file system). In this test scenario the data was located on NFS (shared storage – Isilon). A SASWORK local file system was created to handle 50% of the IO workload (dedicated to
each grid node). Output data directory was also placed on the NFS file system. Scripts are launched on each grid node participating in the scenario and used its own data copy
located on the shared storage. No data was shared between grid nodes for this test (many customers do share some data, but typical analytic SAS shops create and then manage
their own input/output data for individual projects. It was typical to see 16 or more simultaneous SAS jobs running on each grid node during the test at any one time. This amount
of simultaneous data was chosen to simulate a typical SAS Grid node with 2 x 10 Gb Ethernet connections to NAS/NFS.

Performance: F810 faster than F800
SAS Job Name in Test Suite F800 - sas compression=none
F800 – sas
compress=binary
F810 – sas compress=binary & HW
compression
citi1_1 0:53:26 0:53:44 0:41:02
citi1_3 0:53:12 0:53:26 0:41:13
citi2_1 2:17:14 1:24:45 0:49:28
citi2_3 2:17:03 1:23:58 0:49:26
comp_glm_1a 0:00:39 0:00:42 0:00:37
comp_glm_4a 0:00:45 0:00:53 0:00:44
comp_glm_4b 0:00:43 0:00:51 0:00:46
etl_inbound_1 0:05:02 0:43:29 0:12:12
etl_inbound_4 0:07:41 0:40:07 0:12:37
fscheck_a 0:00:01 0:00:02 0:00:02
fscheck_c 0:00:00 0:00:01 0:00:01
fscheck_f 0:00:00 0:00:02 0:00:01
fscheck_i 0:00:01 0:00:00 0:00:00
fscheck_l 0:00:00 0:00:00 0:00:00
fscheck_m 0:00:01 0:00:05 0:00:04
hplogistic_1 0:20:30 0:09:44 0:12:25
hplogistic_2 0:17:08 0:10:23 0:12:04
rtumble_1 0:36:21 0:07:41 0:07:47
rwrw_1 0:18:25 0:54:42 0:34:05
rwrw_2 0:17:29 0:51:10 0:32:12
rwtumble_1 0:36:51 0:10:16 0:10:25
smallnoise_11b 0:01:05 0:01:04 0:00:59
smallnoise_17 0:01:09 0:01:04 0:00:59
smallnoise_18 0:01:13 0:01:09 0:00:59
smallnoise_5 0:01:18 0:01:01 0:00:59
smallnoise_6a 0:01:08 0:01:01 0:00:59
smallnoise_6 0:01:16 0:01:02 0:00:59
smallnoise_9 0:01:04 0:01:00 0:00:59
sort_1 0:20:07 0:27:55 0:03:41
where_test_1 0:10:24 0:24:30 0:02:19
wr_junk_10 1:21:08 0:52:13 0:36:34
wr_junk_1 1:25:18 0:56:18 0:37:16
wr_junk_3 1:25:16 0:56:22 0:37:19
Sum of ALL Jobs Runtimes 13:52:58 12:10:40 7:21:13
Average individual Job Runtime 25:14 22:08 13:22
Times in
H:MM:ss
*some jobs vary pending compression type and combination, but overall F810 with SAS Binary Compression is Best

Scalability: F810 Maintains Throughput While Adding More
NFS Clients and SAS Programs
Test
Scenario
Number of SAS
Programs Run
SAS Grid
Nodes
Avg Job Runtime
MM:ss
Max Job
Runtime
HH.MM:ss
Standard
Deviation in
Job Runtime
comparing all jobs
Sustained Throughput
at peak times on Isilon
Isi stats reports
(R+W)
1 33 1 13:12 49:28 16:58 650 to 750 MBps
2 66 2 12:51 47:18 16:12 1 to 1.4 GBps
3 132 4 13:11 49:20 16:42 2 to 2.5 GBps
4 264 8 13:02 49:57 16:28 4.5 to 5 GBps
5 396 12 12:28 49:30 15:47 6.5 to 7 GBps
• Average Runtime = Sum of Runtimes / Number of Jobs
• Maximum Job Runtimes = slowest job in entire Scenario
• Grid Node = 12+ core linux server with dual 10GbE to NFS
All tests run on 4 node F810 cluster.
3:1 Ratio of Dual 10 GbE NFS Clients to Isilon Nodes
for all the above test scenarios.

Performance: Test Details: F810 with SAS compression
Isilon Stats during 12 node grid run
Isilon is 42% idle even with 12 GRID nodes and 396
simultaneous jobs running!!!

0.0
500.0
1000.0
1500.0
2000.0
2500.0
3000.0
0 300 600 900 1200 1500 1800 2100 2400 2700 3000 3300
MB/s
Seconds
Total IO Throughput in MB/s from NMON
Worker 2 During 2 Node Scenario
Isilon F810 with HW and SAS Compression
NFS Read MB/s NFS Write MB/s SASwork Read MB/s SASwork Write MB/s
0.0
500.0
1000.0
1500.0
2000.0
2500.0
3000.0
3500.0
4000.0
0 300 600 900 1200 1500 1800 2100 2400 2700 3000 3300
MB/s
Seconds
Total IO Throughput in MB/s from NMON
Worker 12 During 12 Node Scenario
Isilon F810 with HW and SAS Compression
NFS Read MB/s NFS Write MB/s SASwork Read MB/s SASwork Write MB/s
Scalability: Comparing IO Patterns on Grid Nodes
During 2 Node and 12 Node Run Comparison

F800 sas compress
F810 HW compress + sas compress
F800 no compression Performance: NMON CPU Utilization on Grid Node
Comparison of Configurations Tested
CPU during single batch run of 33 SAS jobs
Graphs scaled to match for visual comparison
Significantly shorter Runtime
Better overall throughput

Scalability:
Bank2 Job
simulate model/data manipulation
DATA step to NFS – 150,000,000 obs, 126 vars
PROC Print 5 obs
PROC Datasets / create index on NFS
PROC Print 100 obs with sum
PROC MEANS
DATA step to work
PROC Datasets / create 2nd index on NFS
Grid Nodes
F800
HH:mm
F810
HH:mm
1 1:24 0:49
2 1:25, 1:22 0:48, 0:49
4 1:26, 1:21, 1:25, 1:22 0:48, 0:49, 0:49, 0:47
8
1:25, 1:22, 1:25, 1:21,
1:24, 1:25, 1:18, 1:23
0:45, 0:48, 0:49, 0:45,
0:48, 0:47, 0:45, 0:46
12 Not run
0:49, 0:47, 0:45, 0:49,
0:46, 0:44, 0:50, 0:48,
0:50, 0:49
Predictable and
Repeatable Runtimes as
System is Scaled Up

Compression: Ratio of Input Data on All Systems
f800 f800 f810 with hardware compress
no sas compress with SAS compress with SAS compress
63G citi1input_1.sas7bdat 22G citi1input_1.sas7bdat 2.8G citi1input_1.sas7bdat
4.6M glminput_1.sas7bdat 6.3M glminput_1.sas7bdat 2.8M glminput_1.sas7bdat
4.8M glminput_2.sas7bdat 6.6M glminput_2.sas7bdat 2.8M glminput_2.sas7bdat
22G multiuser_1.sas7bdat 17G multiuser_1.sas7bdat 14G multiuser_1.sas7bdat
13G ranrw_medium_1.sas7bdat 825M ranrw_medium_1.sas7bdat 103M ranrw_medium_1.sas7bdat
13G ranrw_medium_2.sas7bdat 825M ranrw_medium_2.sas7bdat 103M ranrw_medium_2.sas7bdat
1.6G ranrw_skinny_1.sas7bdat 480M ranrw_skinny_1.sas7bdat 78M ranrw_skinny_1.sas7bdat
1.6G ranrw_skinny_2.sas7bdat 480M ranrw_skinny_2.sas7bdat 78M ranrw_skinny_2.sas7bdat
544K ranrw_small_1.sas7bdat 544K ranrw_small_1.sas7bdat 64K ranrw_small_1.sas7bdat
544K ranrw_small_2.sas7bdat 544K ranrw_small_2.sas7bdat 64K ranrw_small_2.sas7bdat
51G ranrw_wide_1.sas7bdat 1.7G ranrw_wide_1.sas7bdat 210M ranrw_wide_1.sas7bdat
51G ranrw_wide_2.sas7bdat 1.7G ranrw_wide_2.sas7bdat 210M ranrw_wide_2.sas7bdat
40G simdata_1.sas7bdat 55G simdata_1.sas7bdat 19G simdata_1.sas7bdat
16G simdata_2.sas7bdat 22G simdata_2.sas7bdat 7.3G simdata_2.sas7bdat
12G simdata_tnk_1.sas7bdat 9.6G simdata_tnk_1.sas7bdat 8.8G simdata_tnk_1.sas7bdat
12G simdata_tnk_2.sas7bdat 9.6G simdata_tnk_2.sas7bdat 8.8G simdata_tnk_2.sas7bdat
25G sortinput_1.sas7bdat 5.2G sortinput_1.sas7bdat 1.7G sortinput_1.sas7bdat
99G sortinput_2.sas7bdat 21G sortinput_2.sas7bdat 6.6G sortinput_2.sas7bdat
1433.6 503 149 GB on Disk
9.6:1
Ratio to Uncompressed
Data on F800
3.3:1
Ratio to SAS Compressed
Data on F800

Compression - Total Disk Space Used During Tests
Isilon Model
SAS Compress =
Binary
Isilon HW Compress
SAS7bdat Data Directory
(after test runs)
Output Data
(after test runs)
F800 - - 1331 GB 1228 GB
F800 Yes - 503 GB 748 GB
F810 Yes Yes 149 GB 119 GB
• Increased compression over plain SAS compression
• SAS compression reduces network traffic
• Isilon compression further reduces disk space requirement.
• Sizes listed here are for a single Batch run (input / output for single 33 job run).

Compression: Occasionally SAS Compression Causes Issues
Table Output size: 10,000,000 obs 112 vars
ETL Inbound Job – Data coming from DATABASE or other source to Disk
SAS inbound Data Step – Very Common Activity (simdata_tnk.sas7bdat)
With follow up Datasteps as data is modified for analytics.
Isilon Model
SAS Compress =
Binary
Isilon HW
Compress
File Size:
du -sh
Runtime to
create file:
MM:ss
Data step Copy
file from NFS to
NFS lib
MM:ss
All steps,
Total SAS Job
MM:ss
F800 - - 12 GB 1:40 3:35 18:25
F800 Yes - 9.6 GB 6:08 30:17 54:42
F810 - Yes 8 GB 1:10 8:24 14:00
F810 Yes Yes 8.8 GB 8:53 7:35 34:05
• In this particular use case, compression (SAS’s) seems to cause an issue.
• The good news… you can turn SAS compression off on individual jobs!

Deduplication against f810c Filesystem Size Used Avail Use% Mounted on
BEFORE:
10.246.24.202:/ifs/f810c/wrk2/multiuser 87T 4.6T 79T 6% /f810c
AFTER:
10.246.24.202:/ifs/f810c/wrk2/multiuser 87T 2.7T 81T 4% /f810c
Dedup Assessment Job Run:
Job Report Details
Time:
2020-04-01 23:22:39
Event ID:
3.13524
Job ID:
1205
Job Type:
DedupeAssessment
Phase:
1
Report:
Dedupe job report:{
Start time = 2020-Apr-02:01:55:03
End time = 2020-Apr-02:02:22:38
Iteration count = 1
Scanned blocks = 597296572
Sampled blocks = 36254886
Deduped blocks = 512736028
Dedupe percent = 85.8428
Created dedupe requests = 32182564
Successful dedupe requests = 32182564
Unsuccessful dedupe requests = 0
Skipped files = 1512
Previously assessed files = 0
Index entries = 4072317
Index lookup attempts = 4072317
Index lookup hits = 0
}
Elapsed time: 1655 seconds
Aborts: 0
Errors: 0
Scanned files: 455
Directories: 179
1 path:
/ifs/f810c
CPU usage: max 113% (dev 2), min 0% (dev 2), avg 43%
Virtual memory size: max 542760K (dev 2), min 430260K (dev 2), avg 498608K
Resident memory size: max 105316K (dev 1), min 21684K (dev 2), avg 53200K
Read: 27939643 ops, 228881555456 bytes (218278.5M)
Write: 2415628 ops, 19788824576 bytes (18872.1M)
Other jobs read: 53 ops, 434176 bytes (0.4M)
Other jobs write: 93379 ops, 764960768 bytes (729.5M)
Non-JE read: 1815 ops, 14868480 bytes (14.2M)
Non-JE write: 901805 ops, 7387586560 bytes (7045.4M)
Dedup Job Run Results:
Job Report Details
Time:
2020-04-02 03:32:08
Event ID:
3.13534
Job ID:
1207
Job Type:
Dedupe
Phase:
1
Report:
Dedupe job report:{
Start time = 2020-Apr-02:02:34:40
End time = 2020-Apr-02:06:32:08
Iteration count = 3
Scanned blocks = 1182629476
Sampled blocks = 45504643
Deduped blocks = 528351533
Dedupe percent = 44.676
Created dedupe requests = 34065196
Successful dedupe requests = 33986741
Unsuccessful dedupe requests = 78455
Skipped files = 1195
Previously assessed files = 455
Index entries = 10387523
Index lookup attempts = 7479509
Index lookup hits = 1164297
}
Elapsed time: 14248 seconds
Aborts: 0
Errors: 0
Scanned files: 317
Directories: 179
1 path:
/ifs/f810c
CPU usage: max 194% (dev 4), min 0% (dev 1), avg 121%
Virtual memory size: max 539432K (dev 1), min 441384K (dev 3), avg 504675K
Resident memory size: max 89376K (dev 1), min 22352K (dev 2), avg 55837K
Read: 113141338 ops, 926853840896 bytes (883916.7M)
Write: 175404067 ops, 1436910116864 bytes (1370344.3M)
Other jobs read: 15 ops, 122880 bytes (0.1M)
Other jobs write: 493183 ops, 4040155136 bytes (3853.0M)
Non-JE read: 1043 ops, 8544256 bytes (8.1M)

Cost: Reduced Node Requirement
• Storage Ratio: 3 To 1 On Average
• Less Rack Space
• Performance: 3 To 1 SAS Grid Nodes To Isilon Nodes
• Older Systems Tended To Be 1:1 Or 1.5:1 With 12 Core Systems
• Deduplication: Potentially another 20-40% Space Required
• Further Decrease in Storage Cost (Nodes/Disks)

Using SAS GRID v 9 with Isilon F810

Using SAS GRID v 9 with Isilon F810

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Using SAS GRID v 9 with Isilon F810

Similar to Using SAS GRID v 9 with Isilon F810 (20)

More from Boni Bruno

More from Boni Bruno (9)

Recently uploaded

Recently uploaded (20)

Using SAS GRID v 9 with Isilon F810

Editor's Notes