davis mark advanced search analytics in 20 minutes

18

Upload: lucidworks-archived

Post on 11-May-2015

1.518 views

Category:

Technology


7 download

DESCRIPTION

Kitenga's ZettaVox and ZettaSearch products support SOLR and Lucene ecosystems at both the ingestion point and for the search user. In this talk, I will show how ZettaVox, our professional content mining platform on Hadoop, can be used to index content and rich metadata into a LucidWorks Enterprise installation. Being built on Hadoop, ZettaVox scales up by scaling out. I will then create an end-user search and analytics experience using our ZettaSearch solution that leverages the faceted metadata to enhance information discovery and analysis. All in about 20 minutes.

TRANSCRIPT

Page 1: Davis mark   advanced search analytics in 20 minutes
Page 2: Davis mark   advanced search analytics in 20 minutes

Kitenga reinventing information

Mark Davis Founder/CTO

Page 3: Davis mark   advanced search analytics in 20 minutes

Advanced Search and Analytics

in 20 minutes

Page 4: Davis mark   advanced search analytics in 20 minutes

Scalable  Big  Data  

Analytics  

Advanced  Search  

Built  on  an  open  source  foundation  

Page 5: Davis mark   advanced search analytics in 20 minutes

Conquer  “Big  Data”  

Overcome  information  overload  

Transform  data  into  actionable  intelligence  

Find  the  needle  in  the  haystack  

Page 6: Davis mark   advanced search analytics in 20 minutes

Big  Data    

Enormous  transactional  data  Enormous  unstructured  information  Too  big  for  databases  New  tools  are  needed    

Page 7: Davis mark   advanced search analytics in 20 minutes

Get  Document  

Extract  Information   Index  

Page 8: Davis mark   advanced search analytics in 20 minutes
Page 9: Davis mark   advanced search analytics in 20 minutes

¡  Scalable  ¡  Fault-­‐tolerant  ¡  Network/rack  aware  

¡  Parallel  programming  model:  MapReduce  

¡  Cottage  industry  ¡  Complex  MapReduce  model  

¡  Stability  ¡  Command-­‐line  tools  

Page 10: Davis mark   advanced search analytics in 20 minutes

 The  voice  of  Big  Data  

 Out  of  the  box  

MapReduce  components  for  content  mining  

 Reduce  time-­‐to-­‐action  

 Integrated  visualization  

and  analytics    

ZettaVox

Page 11: Davis mark   advanced search analytics in 20 minutes
Page 12: Davis mark   advanced search analytics in 20 minutes

 Faceted  search  for  complex  metadata  

 Analytics  and  search  

together    

Revolutionize  enterprise  search  

 Built  on  open  source  

success  

ZettaSearch

Page 13: Davis mark   advanced search analytics in 20 minutes
Page 14: Davis mark   advanced search analytics in 20 minutes

START  THE  TIMER  

Advanced Search and Analytics

in 20 minutes

Page 15: Davis mark   advanced search analytics in 20 minutes

DID  IT  WORK?  

Advanced Search and Analytics

in 20 minutes

Page 16: Davis mark   advanced search analytics in 20 minutes

ZettaVox  1.4  ¡  Drag-­‐and-­‐drop  Hadoop  analysis  ¡  Natural  Language  Processing  ¡  Cluster  monitoring  ¡  HDFS-­‐aware  analysis  tools  ¡  Integrated  information  visualization  

ZettaSearch  1.0  ¡  JSP  Custom  Taglib  search  designer  ¡  Charts  and  tools  tied  to  metadata  ¡  Available  for  free  (soon)  

ZettaSearch  2.0  ¡  Drag-­‐and-­‐drop  user  search  designer  ¡  Personalization  ¡  Rich  visualization  options  

Page 17: Davis mark   advanced search analytics in 20 minutes

Questions?  

Page 18: Davis mark   advanced search analytics in 20 minutes