enterprise search: a review of the state-of-the-art, and suggestions for search curation
TRANSCRIPT
Enterprise Search A Review of the State-of-the-Art,
and Suggestions for Search Curation
Jet Propulsion Laboratory California Institute of Technology
Camille Mathieu • 10 December 2014
Presentation Outline
1. Current practices in enterprise search management o Review of recent field developments and trends o How JSearch fits in this environment
2. Suggestions for JPL search curation o Developing workflow o Short- and long-term considerations
Searching the Enterprise: Components
"enables employees to find all the information that the
company possesses without the need to know where the information is stored"
(White 2013)
● content definition ● indexing
● query management ● ranking of results
Searching the Enterprise: History
1990s: Text REtrieval Conference (NIST) 2000: IDC’s The High Cost
mid-2000s: Search acknowledged 2010s: Resources developed
2013: “the year of enterprise search”?
Searching the Enterprise: Statistics
71%: search is vital or essential ● only 18% have cross-repository search
capabilities (Miles 2014)
67%: content management investments increasing by 15-20% annually (Gleanster 2013)
The Enterprise Search Environment
Currently optimal: ● scalability and interoperability ● increased search function ● semantic intelligence ● dedicated search staff
(Benghoz and Chamaret 2010)
The Enterprise Search Environment Trends in enterprise search strategy: ● expanded scope (esp. email) ● mobile search ● personalized search/social tagging ● LaSO for improving search
(Benghoz and Chamaret 2010)
Search “curation”
● efforts to mediate search problems and improve search experience for specific users or to meet enterprise goals o user-focused o organization-specific o “findability”
State-of-the-art in search “curation”
● query log analysis ● measuring and improving relevancy
o metrics solutions
● metadata improvement and standardization o content curation solution
Query Log Analysis - Account for modes of searching
- navigational, transactional, informational, etc.
- Obtain clickthrough, reformulation, and other data to tease out patterns
- Identify user groups and patterns
- Perform query clustering for LaSO
Relevancy Measures and Improvements
- Test regularly for system precision and recall
- Measure curation success from these data - Expert/user mediation can be effective
Metadata/Taxonomy
- Assess standards in different departments - Automate cleaning/standardization practices
where possible - “Junk in, junk out” principle
Final Thoughts Search is BIG! Managed search is essential
● Never finished; a “process”
Search curation is centered on the user experience/content findability.
Search strategies should be developed with future trends in mind.
References Benghoz, Pierre-Jean, and Cécile Chamaret. "Economic Trends in Enterprise Search Solutions " In JRC Scientific and Technical Reports, 2010. Feldman, Susan, and Chris Sherman. "The High Cost of Not Finding Information: An IDC White Paper." International Data Corporation, 2001. Gleanster. "Deep Dive: Future-Proof Your investments in DAM." Gleanster, LLC, 2013. Miles, Doug. "Aiim Industry Watch Search and Discovery - Exploiting Knowledge, Minimizing Risk." Silver Spring, MD: AIIM: The Global Community of Information Professionals, 2014. Rosenfeld, Louis. Search Analytics for Your Site: Conversations with Your Customers. Rosenfeld Media, 2011. White, Martin S. Enterprise Search. Beijing: O'Reilly, 2013.