© 2007 megaputer intelligence inc. polyanalyst data and text mining tool your knowledge partner tm ...
Post on 16-Dec-2015
227 Views
Preview:
TRANSCRIPT
© 2007 Megaputer Intelligence Inc.
PolyAnalyst Data and Text Mining tool
Your Knowledge Partner TM
www.megaputer.com
TM
© 2007 Megaputer Intelligence Inc.
Megaputer Intelligence
• Knowledge discovery tools for business users• Easy-to-understand actionable results
Data OverloadUseful Knowledge
© 2007 Megaputer Intelligence Inc.
PolyAnalyst
• Enterprise level client-server analytical and reporting system– Unlocks business value hidden in massive volumes of data– Efficiently analyses both structured data and free-form text– Allows business users to easily generate actionable results – Simplifies complex business analysis– Offers simple visual means for building reusable analytic scripts– Readily scales with growing volumes of data– Provides executives with re-executable custom analytic reports
© 2007 Megaputer Intelligence Inc.
Two types of PolyAnalyst users
Data Analyst Decision Maker
Visual analytic scenario Interactive up-to-date reports
© 2007 Megaputer Intelligence Inc.
PolyAnalyst application domains
Government
Insurance
Financial
High Tech
Consumer Products
Manufacturing
© 2007 Megaputer Intelligence Inc.
Technical capabilities
• Visual building of reusable analytical scripts that include:– Intelligent spell checking– Categorization– Clustering– Entity extraction– Natural language search– Interactive multi-dimensional analysis– Visual link analysis– Scheduled re-execution of analysis
• Development of reusable self-updating report templates– Up-to-date dashboards displaying key factors– Interactive presentation of results of analysis
© 2007 Megaputer Intelligence Inc.
Handled business tasks
• Survey data analysis• Call Center data analysis• E-mail target routing• Repair notes analysis• Incident report analysis• Claims notes analysis• Competitive intelligence• Fraud detection• Intellectual property research
© 2007 Megaputer Intelligence Inc.
Generated term replacements
Substitution rules are applied automatically when script is re-executed
© 2007 Megaputer Intelligence Inc.
Advanced search engine
Supports natural language and PDL-based searching
© 2007 Megaputer Intelligence Inc.
Pattern Definition Language
Pattern Definition Language operators
© 2007 Megaputer Intelligence Inc.
Dictionary Manager
PolyAnalyst Dictionary Manager: pre-loaded and user-defined dictionaries
© 2007 Megaputer Intelligence Inc.
Dictionary Editor
PolyAnalyst Dictionary Editor: list of synonyms and other hierarchical semantic relationships
© 2007 Megaputer Intelligence Inc.
Keyword extraction
Frequently encountered terms (and their linguistic modifications)
© 2007 Megaputer Intelligence Inc.
Link Analysis
Correlations of terms on the document, paragraph or sentence level
© 2007 Megaputer Intelligence Inc.
Document clustering - statistics
Distribution of documents by discovered clusters
© 2007 Megaputer Intelligence Inc.
Document clustering - results
Discovered groups of similar documents
© 2007 Megaputer Intelligence Inc.
Taxonomy building
Hierarchical clustering helps build a tentative taxonomy that can be edited
© 2007 Megaputer Intelligence Inc.
Taxonomy categorization
Monitoring data for known issues of importance
© 2007 Megaputer Intelligence Inc.
Self-learning categorization
Document categorization based on training examples:
Support Vector Machine and Naïve Bayesian algorithms
© 2007 Megaputer Intelligence Inc.
Multi-dimensional analysis
Displays distribution of cases across multiple dimensions
© 2007 Megaputer Intelligence Inc.
Drill-down on text dimensions
Drill-down to 622 cases that have problems with modem
© 2007 Megaputer Intelligence Inc.
Drill-down on text dimensions
Further drill-down to No dial tone problem (20 cases)
© 2007 Megaputer Intelligence Inc.
Scheduling script re-execution
Step 1: Select nodes to re-execute Step 2: Set execution time
© 2007 Megaputer Intelligence Inc.
Report Editor collects results
Nicely arrange key results of performed analysis
© 2007 Megaputer Intelligence Inc.
Interactive Dashboard for execs
Decision makers see and manipulate up-to-date key results
© 2007 Megaputer Intelligence Inc.
Benefits
• Dramatic cost reduction
• Increase in quality and speed of the analysis
• Objective and uniform data-driven analysis
• Discovery of even unexpected issues suggested by data
• Automated monitoring of known problems
• Timely discovery of newly developing issues
• Utilization of 100% of available data: structured and text
• Up-to-date reports for executives
• Easy to use and inexpensive to maintain solution
top related