iso/iec 11179, part 2: classification schemes jim carpenter bureau of labor statistics nancy lawler...
TRANSCRIPT
ISO/IEC 11179, Part 2: Classification Schemes
Jim CarpenterBureau of Labor Statistics
Nancy LawlerDepartment of Defense
Open Forum on Metadata Registries 2003January 20 – 24, 2003Santa Fe, New Mexico
Disclaimer
• Part 2 is a working draft– Significant deficiencies
• This presentation proposes a new model– Not compliant with Part 3 (International
Standard)– Strategy is to decide what is needed & then
write a compromise compliant version of Part 2– New version to be backwards compatible with
compliant version
Thanks to many folks in the following communities that contributed ideas at this Open Forum
• Terminology Standards (TC 37, SC4)
• Statistical Community (UN Standards)
• Others in JTC1, SC 32, WG2 & WG4
Apologies if we missed some.
Classification SchemeISO standard definition
the descriptive information for an
arrangement or division of objects
into groups based on characteristics,
which the objects have in common.
Classification Scheme• the descriptive information for an arrangement or
division of objects into groups – information for
• a process – classifying objects
• a result – characterization of objects
• based on characteristics, which the objects have in common – Classification methods are out of scope
– Counter examples • 19th Century RR ticket fee classification (a turtle is an insect) –
The Platypus and the Mermaid: And Other Figments of the Classifying Imagination by Harriet Ritvo
• Aborigine category for “women, fire, and dangerous things” (book by same title by G. Lakoff)
• Fuzzy categories, e.g., bald men (Fuzzy set theory, Lotfi Zadeh )
utility: find & understand
Benefits of Part 2
• Classifying administered items promotes– Understanding– Finding
• Registering classification schemes promotes– Sharing – Improving
• Classification schemes
• Systems
Classification_Scheme_Item_Relationshiptype_description [1..1] : String
Classification_Schemeadministration_record [1..1] : Administration_Recordtype_name [1..1] : String
Classification_Scheme_Itemvalue [1..n] : Stringtype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Category concept
Item to classify
Classification RegionConformant Version
Where is the informationfor classifying?
Classification RegionProposed Version
(tentative)
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Administration Record
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Attributes Inherited Through Administration Record
(No changes.)
• Terminology Attributes
• Administration Attributes
• Reference Document Attributes
• Submission Attributes
• Stewardship Attributes
• Registration Authority Attributes
Terminology Attributes• name• preferred name (yes or no)• language identifier of name• definition • preferred definition (yes or no)• language identifier of definition• source reference• context administration record• context description• context language identifier
Designation
Definition
Administration Attributes• item identifier• registration status• administration status• creation date• last change date• effective date• until date• change description• note & explanatory comment• unresolved issue• origin
Reference Document Attributes
• identifier
• type description
• language identifier
• title
• organization name
• organization mail address
Submission Attributes
• organization name
• organization mail address
• contact name
• contact title
• contact information
Stewardship Attributes
• organization name
• organization mail address
• contact name
• contact title
• contact information
Registration Authority Attributes
• identifier
• organization name
• organization mail address
• documentation language identifier
• registrar identifier
• registrar contact name
• registrar contact title
• registrar contact information
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Other attributes in theClassification Region
Source artifact type
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Classification Scheme
Source Artifact Type• The type of artifact that contains the names
and/or definitions of each category• Suggests features of the classification
scheme, for example:– subject matter– intended use– structure– content
Classification Scheme
Source Artifact Type
• Class type attributes:– Generic artifact type
– Standard or Product name
– XML Schema reference
Examples of Generic Artifact Types
• key word list• thesaurus• glossary• project charter• yellow pages• product exhibit• organization chart• flow chart
• E-R model• class diagram• state diagram• activity diagram• parts diagram• taxonomy• ontology
Classification Scheme
Source Artifact Type
• UML Class Diagram• IDEF0 Process Model• Microsoft Project activity diagram• Rational Rose class diagram• XML schema URL of tool that produced the artifact• XML scheme URL of artifact
Examples
?
Classification Scheme
Source Artifact Type
Structure type
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Classification scheme
for
classification schemes
Classification Scheme
Structure Type
Classification Scheme
Structure Type• Relationships exist (yes or no)• Predominant relationship type• Hierarchy
– Single vs. multiple inheritance– Levels
• Graph (network)– Cyclic– Acyclic
• Other: category order, overlap, completeness
?
• Assume MDR has a registered classification scheme for classification scheme structures.
• Classify the structure as– Classification scheme administered item
identifier– Category concept identifier
Classification Scheme
Structure TypeSelf-referential alternative
Notation system
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Classification Scheme
Notation System(Non-term signs)
• Icon
• Sound
• Animation sequence
• Formula
• others
Standards may apply
Variance estimate
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Reference Document class type– identifier– type description– language identifier– title– organization name– organization mail address
Classification Scheme
Variance Estimate Reference
Identifier
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Category Concept
Identifier
• Unique within classification scheme
• Global identifier (within MDR)– Classification scheme identifier plus– Category concept identifier
Term Entry
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Definitiondefinition_text [1..1] : Stringpreferred_definition [0..1] : True_Falsesource_reference [0..1] : Reference_Document
Terminological_Entry
Language_Sectionlanguage_identifier [1..1] : Language_Identification
Designationname [1..1] : Stringpreferred_designation [0..1] : True_False
1..1
0..*
1..1
0..*
terminological_entry_languages
1..1
0..*
1..1
0..*
name_entry
1..1
0..*
1..1
0..*
definition_entry
0..10..1
0..10..1
term_definition_pairing
May replace with ISO 16642 model:Terminology MarkupFramework
•Cardinalities are controversial.•In Part 3, Terminological Entity is a relationship between admin. item & context.•Need more than definition for classification.
Sign Entry
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
• Sign Entry class type– Notation system identifier– Reference to the sign in the notation system
• Example– Category concept = White Owl– Notation system = Peterson’s Bird Calls– Sign Reference = Song of White Owl WAV
Category Concept Sign Entry
Type Name
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Category Concept
Type Name
Generic term for the type of category
associated with the classification scheme type
Category Concept
Type Name Examples
Classification
Scheme Name
Classification Scheme Type
Category Concept Type Names
Dewey Decimal System hierarchical Subject, Discipline
International Biological Taxonomical System
Taxonomy Taxon
ISO/IEC 11179 Registry UML Model
UML Model Class
Washington DC Yellow Pages
Yellow Pages Business category, Professional category
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
type description
Category Concept Relationship
Type Description
• Text describing relationship type of a category concept
• Description varies by – Type of classification scheme– Application– Each pair of category concepts
Category Concept Relationship
Type DescriptionExamples
Classification
Scheme Name
Category Concept pair
Category Concept Relationship Type Description
Dewey Decimal System Politics & Religion
None
International Biological Taxonomical System
Homo & sapiens Subtype
ISO/IEC 11179 Registry UML Model
Classif. Scheme & Category concept
Membership
Washington DC Yellow Pages
Airport & Airport transportation
Business subtype
Category_Concept_Relationshiptype_description [1..1] : Stringfacet [0..1] : String( More attributes)
Classification_Schemeadministration_record [1..1] : Administration_Recordsource_artifact_type [0..1] : Artifact_Typestructure_type [1..1] : C_S_Structure_Typenotation_system [0..n] : Notation_Systemvariance_estimate_reference [0..n] : Reference Document
Category_Conceptidentifier [1..1] : Stringterm_entry [0..1] : Terminological_Entrysign_entry [0..n] : Sign_Entrytype_name [0..1] : String
1..*
0..*
1..*
0..*
membership
0..*
0..*
0..*
0..*
Administered_Itemadministered_item-administration_record [1..1] : Administration_Record
0..*
0..*
0..*
0..*
classification
Facet
Category Concept Relationship
Facet• Characteristic used to separate categories
within a category
• Other terms for facet: dimension, slot, …
• Example– Parent category: writing instrument– Facet: marking material– Child categories: chalk, ink pen, lead pencil, …
Questions• Jim Carpenter
– [email protected]– Bureau of Labor Statistics
• Nancy Lawler– [email protected]– Department of Defense