sqlintersection do you know data in the microsoft cloud? upgrade your life bob ward, cto css,...

22
SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft [email protected] http://aka.ms/bobwardms Tuesday, 3:45-5:00pm The Experts: Jason Howell, Azim Uddin, Lisa Liu, Adam Saxton, Rohit Nayak

Upload: loren-phelps

Post on 19-Jan-2016

221 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

SQLintersectionDo you know Data in the Microsoft Cloud?Upgrade Your Life

Bob Ward, CTO CSS, [email protected]

http://aka.ms/bobwardms

Tuesday, 3:45-5:00pm

The Experts: Jason Howell, Azim Uddin, Lisa Liu, Adam Saxton,

Rohit Nayak

Page 2: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

What is Microsoft Azure?

Microsoft Azure is a growing collection of integrated cloud services—analytics, computing, database, mobile, networking, storage, and web—for moving faster, achieving more, and saving money…..

Use an open and flexible

platform

Extend your existing IT

Scale as you need, pay as

you go

Protect your data

Run your apps

anywhere

Make smarter

decisions

Rely on a trusted cloud

The Cloud for Modern Business

Page 3: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

The “No Excuse” Cloud Checklist

• Internal security measures, isolation and control, and Compliance “I’m concerned about cloud security and privacy”

• Azure Virtual Machines allow you complete control of your SQL Server instance and configuration

“The Cloud doesn’t give me the control I need”

• Azure ExpressRoute to get WAN network speeds up to 10Gbps• Data centers in 20 regions around the world

“I can’t rely on the speed of the public Internet”

• Azure SQL Database now contains features such as full-text search, TDE, and row-level security

“The cloud doesn’t offer the features I need”

• Hybrid is one of the key diffenteriators for Azure including Azure Backup, SQL Stretch Database, and secondary replicas in Azure VM

“I can’t move everything to the cloud”

Azure Trust Center

Page 4: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Data and Microsoft Azure

SQL in Azure Virtual

Machine

Azure SQL Database

Azure SQL Data

Warehouse Preview

Azure DocumentDB

Azure HDInsight

Azure Data Lake Preview coming

Azure Data Factory

Azure Machine Learning

Power BI

Data Analytics

Stream Analytics, Search, and Data Catalog Preview

Page 5: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

SQL Server in Azure Virtual MachineComplete SQL Server “box” running in a VM

The VM is hosted in a Microsoft data center with a variety of sizes

SQL Server license is “subscription based” or “bring your own”

VM usage is “pay as you use it”

Use the Marketplace to avoid the pain of setup and configuration

Hybrid scenarios available and welcome

DB

You Microsoft

VM

Infrastructure as a Service (IAAS)

Page 6: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Azure SQL Database Deploy a database and application. A managed service Built-in High Availability, Geo-Replication, Restore, and Auditing Basic, Standard, and Premium Service Tiers (determine features, price, and performance) Predictable Performance via DTU that can be dynamically adjusted Close to box parity and new features are here first (row level security and full-text) Elastic pools to manage large number of databases Elastic scale-out for distributed performance Stretch Database in SQL Server 2016 for historical archive scenarios

DB

You Microsoft

VM

Platform as a Service (PAAS)

Page 7: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Azure SQL Database is Ready

Premium now supports 1TB database

Point in Time Restore, Geo-Restore, Geo and Active Geo-Replication

Application isolation and predictable performance with Service Tiers

CLR, XML, Change Tracking, and Full-Text Search now supported

Azure Active Directory support as alternative for SQL authentication

Columnstore index support

Security features such as row-level security, auditing, TDE

Monitor and troubleshoot with Query Store, Index Advisor, and Extended Events

Page 8: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

Demo

Let’s explore Azure SQL Database

Page 9: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Azure SQL Data Warehouse Preview

Think APS and SQL Server together in

the cloud

Fully managed (PAAS) like Azure

SQL Database

Petabytes storage of data along with MPP for scale-out queries

Separate usage and costs for compute

(DWU) and storage

Full Polybase support for queries that span SQL and

Hadoop environments

Power BI direct connectivity

Solution Partners for ETL solutions

Page 10: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Azure DocumentDBA fully-managed NoSQL document database service

Why NoSQL document? Schema-free data storage (no tables and columns predefined) JSON document storage (preferred by many web developers)

Why database? Supports transaction processing Supports database consistency Automatically indexes JSON documents Engine programming via procs, triggers, and UDF written in JavaScript A SQL language query interface built-in

Who would use this? JSON has become a popular choice for schema-free, unstructured data Natural choice for any developer working with JSON formatted data

Don’t think Word document

{ "id": “1", “Team": “Dallas Cowboys", “Players": [ { “Name": “Tony Romo“, “Jersey”: 9, “Position” : “QB” }, { “Name": “Dez Bryant“, “Jersey”: 88, “Position” : “WR” },], “Conference": “NFC“, “Prediction”: 1}

A JSON DBMS

Try Query Playground

JSON support In SQL 2016

Page 11: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Azure HDInsight

• Distributed processing of large data (Big) across a cluster of computers• Support both Windows and Linux environments all running in Azure• Storage based on HDFS that is “schema on read”• Job scheduling and resource management through YARN• Parallel processing of data through MapReduce (think Batch processing)

A semi-managed Hadoop based data and analytics service

We simplify the deployment of Hadoop clusters Hortonworks Data Platform (HDP)

Core Engine – HDFS and YARN

Batch – MapReduce Script – Pig SQL – Hive NoSQL – Hbase Streaming –

StormIn-Memory -

Spark

Ambari Mahout Oozie Phoenix Sqoop Tez Zookeeper

Azure Storage Blob (WASB)

Page 12: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

MapReduce Explained

MapTake <key,value> input and “map” this into an output. Imagine multiple parallel “map” tasks doing this work.

ReduceTake the output of the map tasks and combine or process them into the final output Your schema is in

your code on “read”

ERRORLOG.1 ERRORLOG.100 ERRORLOG.1000

“Show me the count of ERRORLOG entries for spid3s”

HIVE provides a SQL “like” experience

Spid3s:5 Spid3s:10 Spid3s:2

17

Think “batch jobs”

Page 13: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

Demo

Using HDInsight HIVE to analyzeSQL ERRORLOG files

Azim UddinBig Data Support Blog

Page 14: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Azure Data Lake Preview Coming

A fully managed store and analytics service

Based on YARN and HDFS

Rich Visual Studio Development Environment

U-SQL makes it easy for the SQL Professional

@t = EXTRACT date string , time string , author string , tweet string FROM "/input/MyTwitterHistory.csv" USING Extractors.Csv(); @res = SELECT author , COUNT(*) AS tweetcount FROM @t GROUP BY author; OUTPUT @res TO "/output/MyTwitterAnalysis.csv"ORDER BY tweetcount DESCUSING Outputters.Csv();

Jobs instead of clusters Used by many services

Built on Azure Blob Storage

Page 15: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Azure Data Factory Think SSIS and SQLAgent capabilities as a managed cloud service

Linked Services – Data stores or compute service Datasets – Input Data Source or Output dest Pipelines – Orchestrated activities using datasets and linked

services Develop in Azure Portal, PowerShell, or Visual Studio Schedule and monitor your pipelines

Convert ERRORLOG

files to UTF-8

Copy to Azure

Storage

Execute HIVE DDL

Execute HIVE

Queries

Copy results into

Azure Database

Defined with JSON

Copy ActivityHIVE Query

Activity

Linked Service: onPremisesFileServer

Linked Service: AzureStorage

Linked Service: HDInsightOnDemand

Linked Service: AzureSQLDatabase

DataSet: FileShare DataSet: AzureBlob

Copy Activity

DataSet: AzureBlob DataSet: AzureSQLTable

Power BI DashboardDirect Connect

OnDemand – Data Factory creates and deletes HDInsight clusters as needed

Page 16: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Azure Machine Learning

Azure Machine Learning = cloud computing platform to build, test, deploy, and publish predictive models A predictive model contains…

Define your problem (“I want to predict the winner of Super Bowl 50”) Ingest, clean, and aggregate data from past experiences and current data Build a model using “learning tasks”, algorithms, modules, and a “flow” Deploy and run your model

Machine Learning Studio allows you to build and deploy prediction models Reuse libraries of algorithms and modules Enhance with custom R and Python scripts Publish your work as a web service to be consumed in the Marketplace Use Cortana Analytics Gallery to get jump started

T-SQL R language integration coming to SQL Server 2016

“Learning is any process by which a system improves performance from experience” – Herbert Simon

Azure ML Cheat Sheet

Try it free

experiments

Page 17: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Power BI

• Based on our own data and analysis technologies running in Azure• Focuses on dashboards, self-service reports, and Q&A

A true self-service BI and reporting solution

• Gateways to schedule refresh for on-premise data sources• Azure stores and services directly connected. Use Stream Analytics for a “live” feed

Get Data from on-premise, file, Azure, or SAAS stores and services

• Power BI Desktop app allows you to develop offline and publish• Content Packs make it easy to distribute, share, and consume• Want more visuals. Check out the new community gallery.

A rich development and publishing system

• IPhone and Android• Windows Application

Power BI Mobile

Free or Power BI Pro

Page 18: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

Demo

Using PowerBI and SQL Server

Page 19: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Does this stuff work together?

Page 20: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

References Understanding Azure SQL Database and SQL Server in Azure VMs Spotlight on SQL Database Active Geo-Replication Azure SQL Database Benchmark Overview Data Analytic Scenarios Getting Started with Azure SQL Data Warehouse Working with NoSQL Data in DocumentDB Hadoop tutorial: Get started using Hadoop with Hive in HDInsight on Linux Building Big Data Applications Using Azure HDInsight Service Build your first pipeline using Azure Data Factory Build your first Machine Learning Experiment Azure Friday Videos

Page 21: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

© SQLintersection. All rights reserved.http://www.SQLintersection.com

Review

Azure has the security, privacy, scale, flexibility, and speed to meet the needs of production applications

Azure SQL Database is a fully managed database service using the features of SQL Server

Azure SQL Data Warehouse combines the power of APS and SQL Server Azure HDInsight provides a semi-managed Hadoop suite of services Azure Data Lake is a fully-managed store and analytics service for big data at scale Azure Data Factory provides data pipeline and orchestration services PowerBI brings a true self-service analytics and reporting solution Azure Machine Learning is a fully managed predictive model cloud service

Sign-up free today and get $200 credit on Azure ServicesMSDN subscribers get free Azure benefits

Page 22: SQLintersection Do you know Data in the Microsoft Cloud? Upgrade Your Life Bob Ward, CTO CSS, Microsoft bobward@microsoft.com  Tuesday,

Don’t forget to complete an online evaluation on EventBoard!

Your evaluation helps organizers build better conferences and helps speakers improve their sessions.

Questions?

Thank you!

Do you know Data in the Microsoft Cloud?