Download - Artificial Intelligence What’s Possible, What’s Not, How Do We Move Forward? Adam Cheyer Co-Founder, VP Engineering Siri Inc

Artificial Intelligence

What’s Possible, What’s Not, How Do We Move Forward?

Adam Cheyer

Co-Founder, VP EngineeringSiri Inc

The Future of AI• AI: What are we trying to achieve?• What does it require?• Why is it hard?• What approaches are there?• What works well?• What doesn’t?• Where is the state of the art today?

– Commercial “AI Engines”• What are our best hopes of success?• What holds us back?• What does the future look like?

– In 5 years.... In 15… In 25…

AI: What are we trying to achieve?

Apple’s Knowledge Navigator (1987)

Interaction with the Assistant

• Touch screens and cinematic animation• Global network for info and collaboration• Awareness of temporal and social context• Continuous Speech in and out• Conversational Interface - assistant talks back• Delegation of tasks to the assistant• Assistant use of personal data

Is the Knowledge Navigator vision possible today?

But we're getting there.

No.

How Close are we Today?

• Touch screens• Cinematic effects• Global network• Location and time awareness• Speech out, on demand• Continuous speech to text

But where is the interface for assistance?

You just can't talk to a search engine this way.

"The future of search is a conversation with someone you trust."

-- John Battelle, The Search

And it all needs to work together…

Location AwarenessConversational Interface

Speech to TextTime Awareness

Text to Intent

Access to Personal Information

Dialog flowTask Awareness

Semantic Data

Services APIs

Task & DomainModels

Reasoning

Preferences

What does it require?

Planning Scheduling Learning

Why is it hard?

• Each component technology is complex

• Informal, incomplete grammar of English is larger than 1,700 pages

R. Quirk et al., A Comprehensive Grammar of the English Language, Longman, 1985.

book 4 star restaurant in Boston

cityRestaurant name

+ 43 other fragment interpretations…

8 Boston’s in US…

These combine into many valid interpretations

Why is it hard?

• “Common sense” knowledge is fundamental to all components– Don’t yet have sufficient representations for logical

reasoning– *Huge* amounts of knowledge required, where

does it come from?– How to manage the scale of the two?

• Each component area uses different technologies, languages, methods yet deep integration is fundamentally required

What approaches are there?

• Simple heuristic rules plus enormous computation (search)

• “Deep” knowledge approach– Typically relies on hand-coded grammars,

ontologies, and rules• Statistical approach relying on learning

probabilities from large corpora

What works well?

• All the approaches work well – for some problems

– Massive search with simple heuristics• Deep Blue beats world chess champion• Genetic Finance beats benchmarks on stock prediction

– Statistical training based on massive data• Speech recognition• Machine translation• Web search• Read: “The Unreasonable Effectiveness of Data”

– “Deep” knowledge approach• Urban Challenge/Robotics• Multiplayer Virtual Games

What doesn’t?

• But they have their limitations

– Massive search with simple heuristics• Only certain problems fit into this category

– Statistical training based on massive data• Again, works only for certain problems due to availability of data

and shallowness of scope

– “Deep” knowledge approach• Too brittle• How to get the data?

State of the Art: AI Engines• Google – A “Search Engine”• Wolfram Alpha – A “Compute Engine”

– Mathematica at the core– Huge, curated fact base

• True Knowledge – An “Inference Engine”– Chains inferences together, reasons about time– Collaborative knowledge entry for enhancing KB

• IBM’s Deep QA – An “Entity Retrieval Engine”– Statistical retrieval of concepts/entities

• Siri Virtual Personal Assistant – A “Do Engine”– Reasons about capabilities of external services– Conversational (spoken) interaction, with context– Personalized: learns and applies info about user

http://www.trueknowledge.com/demo-video/

http://www.youtube.com/watch?v=oORzFifLFGI

http://www.youtube.com/watch?v=FC3IryWr4c8

http://www.youtube.com/watch?v=MpjpVAB06O4

Architecture connects services to what people do, where they are.

Web Services Delivery Platforms Domain Modules

CORTEX™

entertaining

places

traveling

shopping

scheduling people

what SIRI delivers what SIRI knows where SIRI is

http://images.google.com/imgres?imgurl=http://www.utexas.edu/its/blackberry/images/blackberry.gif&imgrefurl=http://www.utexas.edu/its/blackberry/index.php&h=463&w=310&sz=118&hl=en&start=4&sig2=f3N_E4dd8rE0XEIzZHY6zg&tbnid=2ZJ1HllgAnH0nM:&tbnh=128&tbnw=86&ei=TrbARqTKKZ7eggOI7uCaCA&prev=/images?q=blackberry&svnum=10&hl=en

http://images.google.com/imgres?imgurl=http://www.techshout.com/images/yahoo-microsoft.jpg&imgrefurl=http://community.eons.com/groups/topic/210511-Instant-Messaging-&h=280&w=280&sz=29&hl=en&start=17&um=1&tbnid=u1gAD_1RtLTxsM:&tbnh=114&tbnw=114&prev=/images?q=instant+message&svnum=10&um=1&hl=en&safe=off

http://www.gearfuse.com/wp-content/uploads/andrew/4_mar07/thinkcentre_m52_tower_1.jpg

http://images.google.com/imgres?imgurl=http://www.connexion.be/blog/user/files/iphone_home.gif&imgrefurl=http://www.connexion.be/blog/permalinks/2007/01/11/Great-marketing-move-iPhone-wars/&h=495&w=300&sz=41&hl=en&start=16&sig2=4Xyt65E0jmiIQXQyCgItJg&tbnid=xiD2aOVKhhxQHM:&tbnh=130&tbnw=79&ei=0rXARtfCMZ7eggOI7uCaCA&prev=/images?q=iPhone&svnum=10&hl=en

http://images.google.com/imgres?imgurl=http://www.techcrunch.com/wp-content/yelplogo.jpg&imgrefurl=http://www.techcrunch.com/2005/10/12/yelps-local-reviews/&h=104&w=210&sz=8&hl=en&start=7&sig2=ooPaf6d12DTaneRJ22rx2A&tbnid=6M0nhsAJOpw23M:&tbnh=52&tbnw=106&ei=QajARsbPMozSgAO_qICmCA&prev=/images?q=%60yelp%60&svnum=10&hl=en

http://www.hotelsatanywhere.com/

http://images.google.com/imgres?imgurl=http://www.otair.com/images/upload/Fandango.gif&imgrefurl=http://www.otair.com/index.cfm?action=shortCodes&h=92&w=150&sz=5&hl=en&start=2&sig2=_evkgmBWwgIFadDArXwMdw&tbnid=1UFfaqh-4jTDYM:&tbnh=59&tbnw=96&ei=C6nARs2ELaX4gQO9m92KCA&prev=/images?q=fandango+movies&imgsz=small%7Cmedium%7Clarge%7Cxlarge&svnum=10&hl=en

http://images.google.com/imgres?imgurl=http://www.ism.ws/ISMApps/Admin/SupplierDirectory/images/Rearden%20Commerce.gif&imgrefurl=http://www.ism.ws/SupplierDirectory/DetailRecord.cfm?ID=173&CAT=&h=75&w=298&sz=5&hl=en&start=17&sig2=3qUxDSxoqKcw87A3GRMMaA&tbnid=LP5ifsfKhq8M5M:&tbnh=29&tbnw=116&ei=I6nARvKXIqKQggOp_Z2QCA&prev=/images?q=rearden+commerce&imgsz=small%7Cmedium%7Clarge%7Cxlarge&svnum=10&hl=en

http://images.google.com/imgres?imgurl=http://www.mobiledesigntech.com/images/Logo-Handango.jpg&imgrefurl=http://www.mobiledesigntech.com/corporate/partners.htm&h=121&w=335&sz=7&hl=en&start=1&um=1&tbnid=tq9HoySB-XYseM:&tbnh=43&tbnw=119&prev=/images?q=logo+handango&svnum=10&um=1&hl=en&safe=off&sa=N

http://images.google.com/imgres?imgurl=http://www.mobiletracker.net/archives/images/tv-guide-logo.jpg&imgrefurl=http://www.mobiletracker.net/archives/2006/08/23/tvguide-mobile-search&h=130&w=171&sz=8&hl=en&start=3&um=1&tbnid=V_FYBdJdZvS4XM:&tbnh=76&tbnw=100&prev=/images?q=logo+tv+guide&svnum=10&um=1&hl=en&safe=off

http://images.google.com/imgres?imgurl=http://blogs.eurielec.etsit.upm.es/miotroblog/images/logo_facebook-rgb-7inch.jpg&imgrefurl=http://blogs.eurielec.etsit.upm.es/miotroblog/?m=200609&h=790&w=2100&sz=135&hl=en&start=1&um=1&tbnid=bZ8LB-N8mFkevM:&tbnh=56&tbnw=150&prev=/images?q=logo+facebook&svnum=10&um=1&hl=en&safe=off

http://images.google.com/imgres?imgurl=http://www.japanprobe.com/2007/01/youtube-logo.jpg&imgrefurl=http://www.bitsofbeats.com/&h=146&w=220&sz=10&hl=en&start=3&um=1&tbnid=A0USjXelPq5bwM:&tbnh=71&tbnw=107&prev=/images?q=youtube+logo&svnum=10&um=1&hl=en&safe=off

http://images.google.com/imgres?imgurl=http://www.paladarrestaurant.com/LOGO-zagat.gif&imgrefurl=http://www.paladarrestaurant.com/press.htm&h=35&w=82&sz=1&hl=en&start=9&um=1&tbnid=2fGYG81gIlKt2M:&tbnh=32&tbnw=75&prev=/images?q=logo+zagat&svnum=10&um=1&hl=en&safe=off

http://www.sfstation.com/

http://images.google.com/imgres?imgurl=http://andrewteman.org/blog/images/citysearch_logo.gif&imgrefurl=http://andrewteman.org/blog/?p=135&h=75&w=100&sz=3&hl=en&start=2&um=1&tbnid=gAhYZrcWsILc-M:&tbnh=62&tbnw=82&prev=/images?q=citysearch+logo&svnum=10&um=1&hl=en&safe=off

KnowledgeDataLanguage

DialogPersonalizationLearning

Knowledge Data Language

Context Dialog Learning

Task Models Service Coordination Transactions

Siri’s Cortex Platform

Unified Platform

Integrating AI Technologies

What are our best hopes of success?

• Integrating many AI components into single system• Learning from Massive Data

– Web, but soon all books, music, tv/video, …• Learning from Massive Usage

– The internet population is growing at enormous rate• Learning from Active Teaching & Collaborative

Intelligence• Hybrid probabilistic/logical approaches

• Or… something completely different – Allen institute for brain science?

What holds us back?

• Software – Brittle/fragile– “Anti-Moore’s Law” – gets slower– Ex: boot MS Word

• Human understanding moves slowly– Engelbart: co-evolution of technology and human

understanding/adoption– Ex: collective intelligence progress…

AI in the future: 5 Years…

• Everyone will have a Siri-like assistant and will rely on it increasingly for – mobile tasks– internet tasks (e.g. travel, e-commerce)– communication tasks– entertainment/attention


• Common sense knowledge models and reasoning components begin to be more feasible – systems seem “smarter”, more general, are less brittle, make less stupid mistakes– Contributions from the masses– Scale issues in probabilistic/logic start to resolve


Who Knows?

Poll Question• Robocup goal successful?

– By mid-21st century, a team of fully autonomous humanoid robot soccer players shall win the soccer game, complying with the official rule of the FIFA, against the winner of the most recent World Cup.

http://en.wiktionary.org/wiki/Autonomy

http://en.wikipedia.org/wiki/Humanoid

http://en.wikipedia.org/wiki/Robot

http://en.wikipedia.org/wiki/Soccer

http://en.wikipedia.org/wiki/Soccer

http://en.wikipedia.org/wiki/FIFA

http://en.wikipedia.org/wiki/FIFA_World_Cup

http://www.youtube.com/watch?v=Ca0-v0OTml0&feature=player_embedded

Adam Cheyer

[email protected]

Siri available for free in the Apple App Store

Download - Artificial Intelligence What’s Possible, What’s Not, How Do We Move Forward? Adam Cheyer Co-Founder, VP Engineering Siri Inc

Top Related