understanding and applying cloud hybrid...

65
Understanding and Applying Cloud Hybrid Search @jefffried Jeff Fried CTO, BA Insight

Upload: others

Post on 12-Jul-2020

27 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Understanding and ApplyingCloud Hybrid Search

@jefffried

Jeff Fried CTO, BA Insight

Page 2: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

we love hybrid search - it's amazing how fast usage is growing

Jeff Teper @jeffteper

Page 3: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content
Page 4: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Focused on Search and

SharePoint since 2004

Longtime

Search Nerd

• CTO, BA Insight

• Senior PM, Microsoft

• VP, FAST

• SVP, LingoMotors

About Jeff Fried

Passionate About

• Search

• SharePoint

• Search-driven

applications

• Information Strategy

Blog:

BAinsight.com/blog

Technet Column

“A View from the

Crawlspace”

[email protected]

Page 5: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

About BA Insight

– Connectivity

– Applications -

– Classification -

– Analytics

Page 6: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

KCTCS (background)

Page 7: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content
Page 8: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Search is not stationary

Page 9: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Demo

9

Page 10: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Why Hybrid SharePoint?

Page 11: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

The

Evolution

of

SharePoint:

HYBRID Management ExtensibilityExperiences

| Server

Experiences Management Extensibility

| Server | Server

HYBRID

Team

Sites

Portals

Enterprise

Content Mngt

BI

Page 12: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Search Provides a Unified View

Page 13: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

SharePoint 2013/2016 Search Architecture

Web Service (CEWS)

Page 14: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

“Classic” Hybrid Search is Federated

not a single result set OOB

Page 15: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Cloud Hybrid Search

Page 16: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content
Page 17: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Benefits of Cloud Hybrid Search

2) Makes finding content easy, wherever the content lives

1) Simpler, easier, and less costly to run search

Page 18: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

SharePoint Server

(On-premises or Hosted)Office 365

SharePoint Online Content

Onedrive for Business ContentSharePoint Content

Cloud Hybrid Search

Page 19: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Case Study: Split Users with SharePoint

Page 20: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

SupportSales & Marketing

Knowledge Articles

Fileshares

OneDrive

Support forum

SPO

Search Farm

SP 2013 content SP 2010 content

On-premises

Office 365

SPO content

SP 2013/2016

Cloud SSA

Page 21: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Setting up Cloud Hybrid Search

1.

2.

3.

4.

Page 22: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content
Page 23: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Use search verticals with Cloud Hybrid Search

SharePoint Online

Custom result source using Local SharePoint results plus a filter which excludes results from on-premises

TIP: Can be used during validation of hybrid search in the production tenant.

Result source query:

{searchTerms} NOT(IsExternalContent:1)

Page 24: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Result Sources are your friend

The Support Search vertical only searches sites that are relevant to the Support team.

It uses Local SharePoint results plus a filter on which sites to include in the search results

Result source query:

{searchTerms} (

Path:»http://sp2010» OR

Path:»file://fileshare» OR

Path:»http://demohybrid.../../supportforum»)

SharePoint Online Support Search

Page 25: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Demo

25

Page 26: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Single node topology

VM

Crawler

CPC

(unused)

APC

(unused)

Indexer

(unused)QPC

Page 27: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Multi-node topology

1.

2.

3.

VM

Crawler

QPC

VM

Crawler

CPC

(unused)

APC

(unused)

Indexer

(unused)QPC

Page 28: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Reduce your footprint

Servers

Volume of Content(indexable items) Pattern

On-prem Search Farm

Cloud Hybrid Search

0-10 million items small 4 App + 2 DB 1 or 2

10-40 million items medium 12 App + 2 DB 2

40-100 million items large 28 App + 4 DB 2

400 million items XL example (SP2016) 86 App + 4DB 2 or 3

Page 29: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Item Limits and Pricing

Licensing: 1M items of external content in index for every 1TB storage in O365

1TB included by default

+ 0.5 GB per licensed O365 user

No limit on number of items from O365 in the index

Default throttling at 20M external items; current threshold at 25M

2000 users x 0.5 GB = 1TB

+ 1TB default = 2 TB total

-> 2M external items indexed

+ Can also buy the “Office 365 Extra File Storage” Add-on

$0.20/GB/Month = $200/TB/Month = $200/M items/Month

50,000 users x 0.5 GB = 25TB

+ 1TB default = 26 TB total

-> 26M external items indexed

Page 30: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

SharePoint 2016 Hybrid

Cloud Hybrid

Search User Profiles Following

Extranet

Compliance

(DLP/e-

Discovery)

Config

Experience

Built on Search

Page 31: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Advantages•

Disadvantages

Cloud SSA Pro/Con versus on-prem

Page 32: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content
Page 33: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

External Content

(on-premises and/or

in the cloud)

SharePoint Server

(On-premises or Hosted)Office 365

SharePoint Online Content

Onedrive for Business Content

Co

nn

ecto

rs

SharePoint Content

Adding External Content

Cloud Hybrid Search

Also drives:

• Office Graph (delve,..)

• Compliance (DLP, …)

Page 34: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Connectors to MANY Enterprise Systems

ERP and Portal Systems•••••

Page 35: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

External Content in O365 UX

Unified view across all content - on-premises and on-line- inside and outside SharePoint

Page 36: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

DLP Sensitive Data Search works with hybrid

Search for sensitive data across on-premises and SharePoint Online

All Built-in sensitive types

Identification and export

Extends to data in OneDrive

Sensitive Information type detection through KQL searches

Get instant statistics

Preview & export results

Page 37: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Current Caveats:

1) don’t see thumbnails, just file icons

2) Have to query for it to show up

Page 38: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Case Study: Cloud SSA, external content

Large global company

in materials science

Page 39: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

DirSync SP 2007/2010/2013 Fileshares BCS

Cloud SSA

SPO

Search Index

1

2

34

5

6

7

Logical architecture: crawling

Corporate

network

Office 365

3rd Party Connectors

Page 40: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

External Content

(on-premises and/or

in the cloud)

Custom

Processing

CEWS

Bottlenecks:

1) Source systems

2) Content Processing

3) Indexer

….

Page 41: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

External Content

(on-premises and/or

in the cloud)

Bottlenecks:

1) Uplink

2) Source systems

….

Page 42: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

42

Performance

Page 43: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

500K items crawled on an Azure D3

50 DPS 100 DPS

1 hour

Page 44: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

SCS under the hood

Crawler

Content

Indexing

API

Blob store

Document state table

Work queues

Backend

API

Index/Graph

On-Premises content source

Search farm

Azure

Broker

Crawler

Content

SPO content source

Page 45: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

What is pushed to the SCS Endpoint?

SharePoint 2013/ 2016

FileShares

Page 46: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Her user token gets rehydrated with her online claims as she is authenticated against Office 365.

Cloud SSA

SPO

Search Index

Logical architecture: query

Corporate network

SP 2013

1

2a

Jaden issues a query from Office 365.

Her user token contains her online identity and group memberships.

1

Jaden issues a query from a site on-premises. This sends over her on-premises claims to SPO

2a

2b

2b

Office 365

Page 47: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

SUPPORTED

– Custom IFilter

– BCS connectors

– Partner connectors

Customizations with Cloud Hybrid Search

SUPPORTED

– Tenant level schema mapping

– Query rules

– Result sources

Cloud SSA SCS/O365

NOT SUPPORTED

• Content that requires custom security trimming

NOT SUPPORTED

• Site collection level schema mapping

• Custom security trimming

• Custom entity extraction

• Content enrichment web service

Page 48: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Issues with Cloud Hybrid Search (1)Cloud Hybrid Search "annoyances"

Performance Characteristicsslower query latency for on-prem queries against Cloud SSA

SharePoint Online Limitationsno synonyms

no site-level schema

no full trust code access

Hybrid Administration Weaknessesclunky metadata mapping

can't remove on-premises search results from Cloud SSA

trickier to test & debug crawls

can't reset index from Cloud SSA

Be aware of these

& compensate for them

(Fixed in August PU)

(Semi-addressed in June PU)

And it’s getting better:

Page 49: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Should I run index reset?

NO!DeleteAllCloudHybridSearchContent()

https://blogs.technet.microsoft.com/beyondsharepoint/2016/07/07/cloud-hybrid-search-service-application-removing-items-from-the-office-365-search-index/

Page 50: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Issues with Cloud Hybrid Search (2)

50

Content Enrichmentno CEWS

no Entity Extraction

Securityno Custom Security Trimming

Can't crawl across Multiple Domains

Can't Crawl SP in Classic Auth Mode

Data Sovereigntyexport-restricted content

can't be put in O365 index

Limitations of Cloud SSA

Page 51: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

External Content

(on-premises and/or

in the cloud)

SharePoint Server

(On-premises or Hosted)

SPO Content

OneDrive Content

Co

nn

ecto

rs SharePoint Content

Connector

Framework

Office 365

AutoClassifier

(app version)

CEWS

Custom

Processing

Page 52: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Case study:Content Enrichment

Content

CloudSSA

Connector Framework

IndexingConnectors

Smart Pipeline

AutoClassifierCustom Stage A

CustomStage C

Custom Stage B

Page 53: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Online

On-Prem

Cloud Hybrid Search under the coversSecurity = identity sync + ACL mapping

Cloud SSACloud SSA

ParseCrawl

SCS

ACL Map Process

Blob store

queue

Page 54: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Directory Synchronization

SID S-1-5-21-1212121212-1212121212-1212

[email protected]

msOnline-OnPremiseSecurity

Identifier

S-1-5-21-1212121212-1212121212-1212

PUID PUID-XXXX-XXXXXXXXXX

Page 55: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Mapping of Access Control Lists

Allow: S-1-5-21-1212121212-1212121212-1212 Allow: PUID-XXXX-XXXXXXXXXX

• User SIDs are mapped to PUIDs

• Group SIDs are mapped to Object IDs

• «Everyone» and «Authenticated users» are mapped to

«Everyone except external users»

Only AD Users and Groups,

Only from one domain

Page 56: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Case Study: Crawling Cross-Domain

Page 57: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

A global single index solution

Cloud SSA

Cloud SSA

Cloud SSA

Cloud SSA

Cloud SSA

BUT export-restricted content

can’t be in the global index

Page 58: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Issues with Cloud Hybrid Search OOB

Content Enrichmentno CEWS

no Entity Extraction

Securityno Custom Security Trimming

Can't crawl across Multiple Domains

Can't Crawl SP in Classic Auth Mode

Data Sovereigntyexport-restricted content

can't be put in O365 index

Limitations of Cloud SSA BA Insight Solution

Connector Framework

AutoClassifier

Connector Framework

can 'map down' to AD groups

can 'map across' cross-domain

can crawl and map security

Federator

Page 59: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content
Page 60: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content

Key Considerations for Hybrid: Workloads, Environment, Data, Customizations

Availability of features Online versus

On-Premises on particular workloads

Significant investments in

customization of On-Premises

workloads

Concerns over global network

performance with remote sites

Regulatory

considerations

Manageability concerns

Page 61: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content
Page 62: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content
Page 63: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content
Page 64: Understanding and Applying Cloud Hybrid Searchnellisconsultingllc.com/Resources/SPTechConSF2016Slides...SharePoint Server (On-premises or Hosted) Office 365 SharePoint Online Content