implementing enterprise search in sharepoint 2010
TRANSCRIPT
BEST PRACTICES CONFERENCE SHAREPOINT
Clarity. Direction. Confidence.
IMPLEMENTING ENTERPRISE SEARCHIN SHAREPOINT 2010
Ágnes MolnárSharePoint Server MVP,
Senior Solutions Consultant
BEST PRACTICES CONFERENCE SHAREPOINT
About the Speaker...
Ágnes MolnárSharePoint Server MVP
Senior Solutions Consultant, BA Insight
Budapest, Hungary
http://aghy.hu
E-mail: [email protected]
Twitter: @molnaragnes
BEST PRACTICES CONFERENCE SHAREPOINT
Sessions
Title Day / Time / Venue
Enterprise Document Management in SharePoint 2010as You’ve Never Seen
Monday 10:15 AMPAV3
Best Practices for Implementing Enterprise Search in SharePoint
Monday 2:30 PMGBE
Best Practices for Organizing Documents in SharePoint 2010
Tuesday 12:45 PMGBE
BEST PRACTICES CONFERENCE SHAREPOINT
Enterprise Search
Search Technology that your organization owns and controls
There’s usually a right document
Security is critical
Taxonomies and vocabularies are important
Dates are important
Corporate data has structure
BEST PRACTICES CONFERENCE SHAREPOINT
Enterprise Search Benefits
Benefitst to the UsersFind quickly what employees and customers are looking for
Financial Benefits (ROI)Helping employees to find information faster
Saves time by not having employees recreate content that already exist
Helping customers to find things to buy quicker
Suggest additional related products
Reduces support costs by self-service services
BEST PRACTICES CONFERENCE SHAREPOINT
Enterprise Search Benefits
Strategic Benefits (BI)Learning what users are looking for
Finding what they are not finding (mispellings, vocabulary mismatch, non-existing content, etc.)
Checking what searches are coming from important customers
Learn things about your data you didn’t know
Check if terminology that content owners are using is matchingup with the search terms being used
Improving site navigation
More consistent compliance
BEST PRACTICES CONFERENCE SHAREPOINT
7
The Anatomy of Search
Source: http://searchpatterns.org
BEST PRACTICES CONFERENCE SHAREPOINT
Search Technology Concept
Query Object Model
Concepts
Content Sources - Host the contentContentContentContent
OpenSearch Source
Crawling - Traverse URL space to record items in search catalog
Indexing - Extract information from items to enable efficient matching
Query Servers - Accept query requests from users and return results
Search Center - UI for users to issue queries and interact with results
Query Federation - Return results from non-SharePoint Indexes
Crawler
Indexer
Query Servers
Index Partition
Connectors - Know how to process different content sources
Index Partition - Subset of the overall index
BEST PRACTICES CONFERENCE SHAREPOINT
SP2010 Search Improvements
Architecture: Service Applications
Enterprise Scale-out (up to 100M docs)
Connectors for LOB Systems
PowerShell support
Query:Boolean Qery Syntax
Prefix Matching
Suggestion while Typing
BEST PRACTICES CONFERENCE SHAREPOINT
SP2010 Search Improvements
Results:New, rich User Interface
Refinement Panel
Improved People Search
Enhanced relevance
BEST PRACTICES CONFERENCE SHAREPOINT
11
SP 2010 Content SourcesSharePoint sites
File Share
Business Data
Website
Exchange Public Folder
Lotus Notes database
...
Query Object Model
ContentContentContent
OpenSearch Source
Crawler
Indexer
Query Servers
Index Partition
BEST PRACTICES CONFERENCE SHAREPOINT
12
SP 2010 ScopesRefine the queries
Scope RulesWeb address
Property query
Content source
All content
Scope operations:Include
Require
Exclude
Query Object Model
ContentContentContent
OpenSearch Source
Crawler
Indexer
Query Servers
Index Partition
BEST PRACTICES CONFERENCE SHAREPOINT
SP2010 Search Federation
Using remote index for SharePoint queries
Prefix match
For example: „weather”
Pattern match
For example: email query (^([\w-\.]+)@([\w-]+\.)+ ([a-zA-Z]{2,4})$)
Location Type:
SharePoint Search Index
FAST Index
OpenSearch 1.0/1.1
Query Object Model
ContentContentContent
Crawler
Indexer
Query Servers
Index Partition
OpenSearch Source
BEST PRACTICES CONFERENCE SHAREPOINT
14
SP 2010 Search FederationFeatures (cont.)
Query Template{searchTerms} scope:Documents
{searchTerms} type:.doc type:.docx type:.docm
„More Results” link Template
Results formatting (XSL)
Usage restrictions (for sites)
Custom Credentials
BEST PRACTICES CONFERENCE SHAREPOINT
SP 2010 Search Federation YES• remote site’s robots.txt blocks
SharePoint’s crawler• you need results only with
specific keywords and/or keyword patterns in the query
• content changes very often, immediately crawling needed
• queries under different security context
• infrequently queried contents• >500 content sources
NO• You don’t have enough
bandwith• content changes very often,
but immediately crawling NOT needed
• content that is not indexed by the remote server
• remote server does not return with RSS or Atom (OpenSearch 1.0/1.1)
BEST PRACTICES CONFERENCE SHAREPOINT
16
Federated Location Connectors
http://technet.microsoft.com/en-us/enterprisesearch/ff727944.aspx
BEST PRACTICES CONFERENCE SHAREPOINT
FAST Search Server for SharePoint 2010
BEST PRACTICES CONFERENCE SHAREPOINT
18
FAST Search Server 2010 for SharePoint
Extra Capabilities:
Thumbnails + Previews
Visual Best Bets
Deep Refiners with counts
User context from User Profile
Sorting on any property
Similar search
Extreme scale-out (up to >500M docs)
Content processing pipeline
Entity extraction
Easy configuration (user context, visual best bets, promotion/demotion, sorting, refinement)
BEST PRACTICES CONFERENCE SHAREPOINT
FAST Search Server 2010 for SharePoint
19
„I’m a salesman. What should I know about ERP
systems?”
„I’m an IT Consultant. What should I know about ERP
systems?”
BEST PRACTICES CONFERENCE SHAREPOINT
Clarity. Direction. Confidence.
THANK YOU!E-mail: [email protected]
Twitter: @molnaragnes
Blog: http://aghy.hu