A 30 day plan to start ending your data struggle with Snowflake

1© 2017 Snowflake Computing Inc. All Rights Reserved.
Y O U R D A T A , N O L I M I T S
A 30 Day Plan to End Your Struggle for Data

Common data struggles
© 2016 Snowflake Computing Inc. All Rights Reserved. 2
AnalyticsData IntegrationData Loading Collaboration

Data Loading

Struggle to Load Data
Resource Contention
Capacity Planning
Preparing disparate data to load
“Where can I connect to that
new JSON web log data?”
-BI Team
– Have to flatten to store semi-structured (or use noSQL)
– Storage and compute are limited
– Architecture forces linear compute capacity

Tackle loading challenges with Snowflake
Contention
Capacity
Disparate data
– Variant column type supports semi-structured
– No more flattening (unless you want to)
– Built on the cloud (S3, EC2)
– Scale data and compute to load any data
– Unlimited virtual warehouses allow independent compute
– Isolate loading and other tasks

Data Integration

Struggle to Integrate Data
Making sense of data in silos
Editing and transforming data
Support evolving business logic and disparate use cases
– Hard to transform different datasets while in different
silos/formats
– noSQL tools complex, not all data stores ACID complaint
– Contention an issue while transforming
– No way to easily experiment with and add business logic
– Different people have different use cases
“Are the updated KPI’s in
the sensor data tables?”
- Data scientist

Improve data integration with Snowflake
Silos
Editing and transforming
Business logic
– Native storage for semi-structured, ANSI standard SQL and dot notation to use it
– Combine all of your data fluidly
– ACID compliant with virtual data warehouses
– Edit, transform, insert, delete, however or whenever you want
– Zero-copy cloning
– Rapidly iterate, test and promote business logic for multiple
people

Data analytics

Struggle to Analyze Data
Queues
Delays
– Analysts are always the end of the resource priority
queue
– Even with unlimited access, database is non-performant
“How come the dashboard isn’t working?”
- Sales director

Analyzing Efficiently with Snowflake
Queues
Delays
– Independent virtual warehouses
– Scale up, down or out to serve analytics use cases
– Autoscaling and multi-cluster warehouses
– Automatically match compute to even massive demand

Collaboration

Struggle to Collaborate
Incessant fixing
Siloed teams
– Fixing loading, integration and analytics struggles burns time
– Conflicts from those struggles reduce morale
– Technical and business teams often not working together (physically or
otherwise)
“I’m so buried under this queue I
can’t make the BI standup”
- IT team member
“I could ask IT for an updated table,
but I’m not sure who was working on it.”
- BI team member

Start Collaborating with Snowflake
Fixing
Siloed teams
– Address the other struggles as referenced
– Free more time for collaboration and discussion
– With new time, start new discussions around data
– Build updates and additions into a scheduled meet-up

A 30-day Plan to Start Ending
Your Struggle with Snowflake

Start from the beginning – what’s the analytics goal?
1. Define the team
2. Discuss blocking issues and a place to start
3. Define the scope
4. Define success criteria
5. Try Snowflake On-Demand
6. Plan status updates going forward
Start Ending Your Data Struggle – Week 1

1. Find data to load
2. Create a Warehouse
3. Load data
– Work within defined scope, agree as a team
– Use data that’s new, challenging, or semi-structured
– Will need this to load data
– Create a database and a table
– Stage your data
– Load from stage to database

1. Test and deploy business logic
2. Optional: Create Integration WH
3. Optional: Plan ongoing loading and transform
– Discuss metrics, KPIs, transformations to add
– Use zero-copy cloning to test and then promote
– Isolate integration and transformation
– Use zero-copy cloning to test iterations safely and
promote

1. Create Warehouses for BI
2. Create analytics users
3. Connect your BI to Snowflake
– Avoid queues with isolated compute resources
– Optionally, set up auto-scaling
– Spread the value of the data
– Use this as an opportunity to share and discuss
– Use Tableau, Looker, etc. to query your data live
– Consider publishing dashboards with live connect

After 30 days you should see improvements
1. Your team should be talking and collaborating more
2. You should be able to easily load and combine data
3. You should have accurate business logic in your data
4. You should be finding more insight
TRY SNOWFLAKE FOR FREE

A 30 day plan to start ending your data struggle with Snowflake

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to A 30 day plan to start ending your data struggle with Snowflake

Similar to A 30 day plan to start ending your data struggle with Snowflake (20)

Recently uploaded

Recently uploaded (20)

A 30 day plan to start ending your data struggle with Snowflake

Editor's Notes