text analytics & linked data management as-a-service
TRANSCRIPT
![Page 1: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/1.jpg)
Text Analytics & Linked Data Management As-a-Service
Marin Dimitrov, Alex Simov, Yavor Petkov
May 31st, 2015
Text Analytics & Linked Data Management -aaS / Wasabi’2015 #1 May 2015
![Page 2: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/2.jpg)
About Ontotext
• Provides products & solutions for content enrichment and metadata management
– 70 employees, headquarters in Sofia (Bulgaria)
– Sales presence in London, NYC & Boston
• Major clients and industries
– Media & Publishing
– Health Care & Life Sciences
– Cultural Heritage & Digital Libraries
– Government
– Education
#2 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 3: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/3.jpg)
• Semantic Technology adoption challenges
• The Self-Service Semantic Suite (S4)
• Lessons learned
Contents
#3 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 4: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/4.jpg)
Semantic Technology Adoption Challenges
#4 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 5: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/5.jpg)
Time-to-value gap (Gartner)
#5 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
From Wasabi @ ESWC’2014
Performance, Integration, Penetration,
Payback & ROI
![Page 6: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/6.jpg)
• Limiting factors
– Complexity & cost of existing solutions
– Limited resources to evaluate novel technologies (startups)
– Slow procurement processes, risk aversion (enterprises)
• How can we…
– Reduce time-to-market
– Reduce adoption risks
– Optimise costs
Semantic Technology adoption
#6 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 7: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/7.jpg)
The Self-Service Semantic Suite (S4)
#7 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 8: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/8.jpg)
• Capabilities for text analytics, content enrichment and smart data management
– Text analytics for news, life sciences and social media
– RDF graph database as-a-service
– Access to large open knowledge graphs
• Available on-demand, anytime, anywhere
– Simple RESTful services
• Simple pay-per-use pricing
– No upfront commitments
What is S4?
#8 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 9: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/9.jpg)
What is S4?
#9 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 10: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/10.jpg)
• Enables quick prototyping
– Instantly available, no provisioning & operations required
– Focus on building applications, don’t worry about infrastructure
• Free tier!
• Easy to start, shorter learning curve
– Various add-ons, SDKs and demo code
• Based on enterprise semantic technology
Benefits
#10 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 11: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/11.jpg)
• Text analytics services
– News annotation
– News categorisation
– Biomedical
• Entity linking & disambiguation
– Mappings to DBpedia & GeoNames instances
– Mappings to biomedical data sources (LinkedLifeData)
• HTML, MS Word, XML, plain text input
• Simple JSON output
Text analytics with S4
#11 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 12: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/12.jpg)
News analytics example
#12
S4 result
Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 13: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/13.jpg)
• Low-cost graph DBaaS available 24/7
• Ideal for small & moderate data volumes
– database options: 1M, 10M, 50M, 250M and 1B triples
• Instantly deploy new databases when needed
• Zero administration: automated operations, maintenance & upgrades
• Users pay only for the actual database utilisation
– Number of triples stored + number of queries per month
• OpenRDF REST API
Fully managed RDF DB in the Cloud
#13 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 14: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/14.jpg)
Fully managed RDF DB in the Cloud
#14 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 15: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/15.jpg)
• SPARQL query endpoint to the FactForge semantic data warehouse
– 500 million entities / 5 billion triples
• Key LOD datasets integrated
– DBpedia, Freebase/WikiData, GeoNames, WordNet
– Dublin Core, SKOS, PROTON ontologies and vocabularies
Knowledge graphs with S4
#15 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 16: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/16.jpg)
Cloud native architecture of S4
#16 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
Elasticity vs High Availability vs
Cost Efficiency
![Page 17: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/17.jpg)
Lessons Learned
#17 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 18: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/18.jpg)
• You must build a “cost aware” cloud platform
• Cloud-native architectures are more efficient, but more difficult to build
• A microservices architecture improve system resilience & agility, but difficult to design right
• Extensive and continuous benchmarking & monitoring
– Some problems emerge only at large scale
• Assume failures will happen & design for resilience
Lessons learned
#18 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015
![Page 19: Text Analytics & Linked Data Management As-a-Service](https://reader035.vdocuments.site/reader035/viewer/2022062420/55c168b0bb61ebb66e8b45f9/html5/thumbnails/19.jpg)
Thank you!
#19 Text Analytics & Linked Data Management -aaS / Wasabi’2015 May 2015