is opencyc doomed to be the new esperanto, or is oor doomed to be the new electronic data...
TRANSCRIPT
Is OpenCyc doomed to be the new Esperanto, or is OOR doomed to be the new Electronic Data Interchange, or -- even worse -- both!
Doug Lenat
Cycorp
Our content
What we’d want a good host to provide
Given the other, funded, open ontology repository projects going on in the world (e.g. OKKAM), does it need one more?
2
Our Content
OpenCyc (www.opencyc.org)The Cyc Ontology made 100% freely available (yes, 100% free even for commercial purposes)Available for download on SourceForgeOver 30,000 “users”
ResearchCyc (researchcyc.cyc.com)OpenCyc + millions of hand-engineered assertionsFree for R&D purposes Current users: 300 research groups (1/2 academic)
3
What are people doing with it?
• USAF 45th Space Wing: Decision Support• USNavy: Threat Scenario Detection• US Forest Service: Regulatory Compliance• LarKC: Large Knowledge Collider• Medical Research Center: Clinical Trial Cohort Selection (doctors can now directly formulate complex FOPC queries via interactive clarification dialogue; DBs)• Glaxo: semi-automatic ontology alignment across multiple large domain-specific info sources
4
What’s in OpenCyc
(#$isa 596215)
(#$genls 99198)
(#$disjointWith 6114)
(#$resultIsa 4277)
(#$resultGenl 1206)
(#$argIsa 35617
(#$argGenl 5398)
(#$arg1Isa 16748)
(#$arg1Genl 2354)
(#$arg2Isa 14114
(#$arg2Genl 2283)
(#$arg3Isa 3486)
(#$argFormat 5493)
(#$arg2Format 3320)
(#$functionalInArgs 1427)
(#$arity 16416)
(#$arityMin 958)
(#$comment 57305)
(#$genlPreds 7440)
(#$negationInverse 990)
(#$genlMt 26078)
(#$denotationInEnglish 409745)
(#$synonymousExternalConcept 13916)
Explicitly: 300k terms; 14k predicates; 57k classes; 2 million assertions; infin. more nonatomic terms and inferred assertions
5
Systems and Processes
‘lifetime’ of system
energy source
boundary
resource conveyer
resource synthesizer
providerOfMotiveForce
doneBy
transporter
eventOccursAt
6
FunctionalSystem
Specializations
AutocatalyticProcess
Ecosystem
EcologicalProcess
Organization
Organism
Culture-Practice
Metabolism
componentInSystem
agentInEcosystem
hasMembers
anatomicalParts
7
Ecosystem Classes
Ecosystem
BiomeAquatic
LifeZone
DesertEcosystem
TropicalRainforestEcosystem
ChaparralEcosystem
TundraEcosystem
TaigaEcosystem
GrasslandEcosystem
genlsgenls
genls
8
ChaparralEcosystem
MediterraneanClimateCycleclimateOfEcosystemType
MediterraneanScrub
terrainClimateType
GeographicalRegion
Eco-system
genls
genls
Territory Of Santa
Barbara, CA
hasClimateType
9
What We’d Want a Good Host to Provide
A commitment to use – to have contributors all provide content under – some Creative Commons license, as opposed to e.g. a GNU license
Retention of the provenance/lineage of contributed ontological content
Agreement on some of the most fundamental ontological relations
Agreement on a small set of inter-ontology alignment relations
10
Given the other, funded, open ontology repository projects going on in the world (e.g. OKKAM), does it need one more?
OKKAM is already a funded UE FP7 project (~$10M, 3-years) that started 2 months ago. Ontologizing individuals (including organizations such as the USArmy and IBM as individuals), providing a unique identifier and agreed-on set of properties for each individual
DBpedia extracted the content of fact boxes from Wikipedia + 35 open-source ontologies; KBpedia EU STREP ($3M) follow-on and will include true ontology-merging
Lots of other projects which other speakers in this panel will no doubt mention
11
12
FP7 IP - LarKC ConsortiumOrganisation Country
Universität Innsbruk Austria
AstraZenica AB, R&D Sweden
CEFRIEL S.c.r.l. Italy
Cycorp, Raziskovanje in Eksperimentalni Razvoj, d.o.o. Slovenia
Universität Stuttgart, HPCC Germany
Max Plank Gesellshaft Germany
Sirma Group, Ontotext Lab Bulgaria
Saltlux Korea
Siemens Aktiengesellshaft Germany
University of Sheffield United Kingdom
Vrije Universiteit Amsterdam Netherlands
Beijing University of Technology PRC
WHO: International Agency for Research on Cancer France
13
14