learning abstraction and representation knowledge: an ......does not correspond to the term of...

18
1 Learning Abstraction and Representation Knowledge: an Application to Cartographic Generalisation Jean-Daniel Zucker Univ. Paris VI – LIP6 & CNRS 4, place Jussieu 75252 Paris FRANCE [email protected] Sébastien Mustière IGN – COGIT & Univ. Paris VI – LIP6 2-4, avenue Pasteur 94165 St-Mandé Cedex FRANCE (33)-1-43-98-85-45 [email protected] Lorenza Saitta Univ. del Piemonte Orientale Dip. di Scienze e Tecnologie Avanzate Corso Borsalino 54 15100 Alessandria, ITALY [email protected] ABSTRACT This article proposes a machine learning approach to overcome the knowledge acquisition bottleneck that limits the automation of cartographic generalisation. It first explains why this automation must be guided by a differentiation of two main types of knowledge involved in this process. More precisely, it shows that cartographic generalisation can be accomplished by a combination of two processes: representing (formulating, renaming knowledge) and abstracting (simplifying a given representation). The whole process of creating maps fits into an abstraction framework we developed to account for the difference between knowledge abstraction and knowledge representation. The utility of this framework lies in its efficiency to support the automation of knowledge acquisition for cartographic generalisation as a combined learning of both abstraction and representation knowledge. The results experiments show the interest of this approach. Keywords: abstraction, change of representation, cartographic generalisation, supervised learning. 1 INTRODUCTION Cartographic generalisation 1 is the process of creating maps from over-detailed geographic data. Automating cartographic generalisation is a problem of first importance in cartography. First of all, automatic cartographic tools allow to decrease cost and time necessary to produce paper maps. Then, new maps are increasingly electronic maps. Because of screen resolution these maps ask even stronger generalisation than paper maps. Map-users will also create themselves their maps. Most of the maps are so created by geographic domain experts or decision makers, who are not necessarily specialists in cartography. As a consequence today’s maps are often well dedicated to specific user needs but have a poor cartographic quality. This quality can be re- 1 To avoid any confusion, we should mention here that the term “generalization” used in the field of Cartography does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the notion of abstraction. [Nyerges, 91]

Upload: others

Post on 17-Jun-2020

10 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

1

Learning Abstraction and Representation Knowledge:an Application to Cartographic Generalisation

Jean-Daniel Zucker

Univ. Paris VI – LIP6 &CNRS

4, place Jussieu75252 ParisFRANCE

[email protected]

Sébastien Mustière

IGN – COGIT &Univ. Paris VI – LIP62-4, avenue Pasteur

94165 St-Mandé CedexFRANCE

(33)[email protected]

Lorenza Saitta

Univ. del PiemonteOrientale

Dip. di Scienze eTecnologie AvanzateCorso Borsalino 54

15100 Alessandria, [email protected]

ABSTRACT

This article proposes a machine learning approach to overcome the knowledge acquisitionbottleneck that limits the automation of cartographic generalisation. It first explains why thisautomation must be guided by a differentiation of two main types of knowledge involved in thisprocess. More precisely, it shows that cartographic generalisation can be accomplished by acombination of two processes: representing (formulating, renaming knowledge) and abstracting(simplifying a given representation). The whole process of creating maps fits into an abstractionframework we developed to account for the difference between knowledge abstraction andknowledge representation. The utility of this framework lies in its efficiency to support theautomation of knowledge acquisition for cartographic generalisation as a combined learning ofboth abstraction and representation knowledge. The results experiments show the interest of thisapproach.

Keywords: abstraction, change of representation, cartographic generalisation, supervisedlearning.

1 INTRODUCTION

Cartographic generalisation1 is the process of creating maps from over-detailed geographic data.Automating cartographic generalisation is a problem of first importance in cartography. First ofall, automatic cartographic tools allow to decrease cost and time necessary to produce papermaps. Then, new maps are increasingly electronic maps. Because of screen resolution these mapsask even stronger generalisation than paper maps. Map-users will also create themselves theirmaps. Most of the maps are so created by geographic domain experts or decision makers, who arenot necessarily specialists in cartography. As a consequence today’s maps are often welldedicated to specific user needs but have a poor cartographic quality. This quality can be re-

1 To avoid any confusion, we should mention here that the term “generalization” used in the field of Cartographydoes not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more relatedto the notion of abstraction. [Nyerges, 91]

Jean-Daniel Zucker
Jean-Daniel Zucker
Zucker, J.-D., S. Mustiere and L. Saitta (2000). Learning Abstraction and Representation Knowledge: An Application to Cartographic Generalisation. The fifth International Workshop on Multistrategy Learning (MSL'2000), Guimarães, Portugal.
Page 2: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

2

introduced in these maps if cartographic knowledge is the basis for designing GIS tools.Furthermore, the opportunity to create and display maps quickly enhances the need for flexibilityand, especially, multi-level space analysis. For example, João (98) emphasises the need formulti-scale analysis in environmental impact assessment studies. She also notices that, for timeand cost reasons, users prefer more easily-available data than data with the right level of details.GISs should then offer efficient tools to automatically change level of details.In this paper, we are interested in both representing and learning cartography concepts. Inparticular, we consider the task of (supervised) learning from examples, which consists ininducing an intensional description (a “hypothesis”) of a concept from a set of positive andnegative instances of it (Michalski, 1983; Mitchell, 1982). Because of the complexity of the task,it appears that a modification of the language is a promising approach to increase both accuracyand rule comprehensibility. In Machine Learning, many approaches follow the methodologycalled constructive induction, which tries to augment the initial language with newly inventedfeatures (Muggleton, 1988; Matheus & Rendell, 1989; Wnek & Michalski, 1994; Zucker &Ganascia, 1994). In this context the modification of the language used to represent the learningexamples and the concepts needs also to be learned.

Section 2 presents the difficulty of automating cartographic generalisation process, especiallybecause of the difficulty to formalise cartographic rules. Why machine learning can help toformalise these rules is then explained. Based on the lessons learned in the field of KnowledgeAcquisition (Clancey,83; Thomas et al.,93), the necessity to differentiate, separate, and structurethe different types of knowledge involved in cartography is discussed. As an alternative to theclassical distinction used in cartographic generalisation between geometric, structural andprocedural knowledge (e.g. see Armstrong, 91; Weibel et al, 95), Section 3 proposes acharacterisation of the knowledge used in cartographic generalisation better fitted to itsacquisition. This characterisation is based on the distinction between two dimensions: knowledgeabstraction and knowledge representation (Saitta and Zucker, 98). Section 4 describes acartographic generalisation process developed from the previous considerations and explains howto perform knowledge acquisition. Section 5 describes an experiment to acquire these kind ofknowledge with machine learning techniques.

2 KNOWLEDGE BASED SYSTEM AND KNOWLEDGE ACQUISITION FORCARTOGRAPHIC GENERALISATION

2.1 Problem specification

In the scope of this article, we focus on one of the important problem for automating cartographicgeneralisation: given one geographical object (a bend, a road, a town…) how to represent it on amap, knowing the specifications of this map? Other problems such as "the interrelations betweenobjects" or "the way to link the representation of an object to the specification of a map" (Ruas,99) are beyond the scope of this work.

Several approaches have been proposed to address this problem. In particularly, many operationshave been developed to specific types of object. A test on several generalisation platformsaccounts for these specific operations and for there scope (OEEPE, 98). Many space analysistools have been also developed to describe geographic objects. However, identifying theoperation to apply to a given object is still an open question.

Page 3: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

3

An approach to face the need for automation of cartographic generalisation is to build expertsystems. Indeed, they have proved to be efficient in numerous fields where knowledge requires tobe introduced. There are situations where it is difficult to acquire from experts the knowledgenecessary to the system. This problem is well known as the «knowledge acquisition bottleneck»and has been underlined in the field of cartographic generalisation by Weibel et al (95).

Cartographic knowledge needs to be elicited so as to be used by expert systems for cartographicgeneralisation. Cartographic knowledge acquisition is problematic because cartographic rules arenumerous, contradictory and not formalised. For example, in the rule “enlarge significantly nonlegible road bends so that they become legible but keep the planimetric accuracy and the shape asmuch as possible”, how to formalise “significantly” “enlarge but keep accuracy” “legible”“shape” “as much as possible”, etc. ? Most of the time, cartography experts cannot formalisethese rules in such a way that they become understandable by a computer.

Machine learning techniques are one of the solutions developed in Artificial Intelligence area tosolve this knowledge acquisition bottleneck. Their aim is to automatically build some rules froma set of examples given by an expert. The expert provides some examples in a form of, on the onehand a description of an object and, on the other hand a classification of this object. Machinelearning techniques automatically build some rules from these examples to explain theclassification from the object description. These rules can be then used to classify new examplesprovided to the system.

This paper proposes a knowledge based system to answer to our problem and we study, assuggested by Weibel et al (95), the opportunity to use Machine Learning to support theknowledge acquisition process.

2.2 Second generation expert systems and knowledge acquisition

The ultimate goal of our research being the development of a generic cartographic expert system,this chapter briefly recalls the principles of knowledge acquisition in Artificial Intelligence.

Most traditional expert systems follow the same schema. They contain a knowledge base, a factbase and an inference engine. The knowledge base contains (usually as a set of rules) knowledgeacquired from experts. Each rule is supposed to be a piece of knowledge independent from theother pieces. The fact base contains knowledge describing the initial facts about a problem to besolved. The inference engine is the program building the reasoning of the system by applyingrelevant rules of the knowledge base to the facts of the fact base to deduce new factscorresponding to a possible problem solution. The inference engine is supposed to beindependent from the problem to be solved, for example the order of use of the rules isdetermined by the inference engine with generic techniques valid for all problems.

One of the key application field for Machine Learning has been and still is to automatically buildrules from examples (Bergadano, Giordana and Saitta, 91). Most rule learning systems use asequential covering algorithm where each rule is considered as an independent part of the expertknowledge to be acquired. Critics of first generation expert systems by Clancey (83) showed thatthese traditional systems implicitly mix a lot of different kind of knowledge. In particular they

Page 4: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

4

mix knowledge about basic inferences to be drawn and knowledge about how to organise theseinferences. This mix of knowledge types is a limitation to the easy maintainability of the systemand the comprehensibility of the reasoning done by the system.

These critics paved the way to new researches in knowledge acquisition area, and led to thedevelopment of second generation expert systems (David et al, 93) where control over inferencesto be drawn is considered as a kind of knowledge in itself and explicitly introduced in expertsystems. It becomes necessary to understand inference steps involved in a problem solving (likegeneralising a map) and acquire (by learning or direct Knowledge Acquisition techniques)distinctly knowledge necessary to draw each inference. Rules are then more comprehensible (sothey are easier to understand, validate, and update) and more easily acquired.

The following section first identifies which types of knowledge are involved in cartographicgeneralisation and how they are related each other. A knowledge based system to answer ourcartographic generalisation problem is then designed from this analysis. An approach to useMachine Learning techniques to learn each identified type of knowledge that is not easy todirectly acquire from experts is finally proposed.

3 CARTOGRAPHY IN A KNOWLEDGE REPRESENTATION AND ABSTRACTIONMODEL

3.1 Differentiating representation and abstraction

Representing knowledge is one of the main research topic in AI since its birth. In the past fiftyyears, a large variety of languages that are more or less adapted to represent different field ofhumans knowledge have been proposed (Ginsberg, 93). Although a large amount of humanexpertise can be formulated as a set of specific procedures or inferences in one given language,the cartographic generalisation process clearly requires several knowledge representationlanguages to capture the different types of knowledge manipulated, ranging from the raw data toits final representation as a usable map. We have proposed a model of abstraction (hereaftercalled the KRA model), supporting reasoning in a wide context (Saitta and Zucker, 98). In thismodel, we distinguish two fundamental processes, namely the process of changing the nature ofthe language of representation and the process of abstracting it

The KRA model originates from the observation that the conceptualisation of a domain involvesat least four different levels. Underlying any source of experience there is the world (W), whereconcrete objects reside. However, the world is not really known, because we only have amediated access to it, through our perception. Then, what is important for an observer is not theworld in se, but the perception P(W) that s/he has of it. At this level the percepts «exist» only forthe observer and only during their being perceived. Their reality consists in the “physical” stimuliproduced on the observer. In order to let these stimuli become available over time, for retrievaland further reasoning, they must be memorised and organised into a structure S. This structure isan extensional representation of the perceived world, in which stimuli related one to another arestored together into tables. This set of tables constitutes a relational database. Then, in order tosymbolically describe the perceived world, and to communicate with other agents, a language Lis needed. L allows the perceived world to be described intensionally. Finally, a theory T mightbe needed to reason about the world. The theory may contain general knowledge, which does not

Page 5: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

5

belong to the specific domain, and allows inferences to be drawn. Let us define R = < P(W), S, L,T > as a Reasoning Context. The relationships among the considered levels are represented inFigure 1.

W S=MMMM(P) L=DDDD(S)Description

DDDDP=PPPP(W) T=FFFF(L)

FormalizationFFFF

MemorizationMMMM

PerceptionPPPP

Background Knowledge

Figure 1 Knowledge representation

There is an infinity of ways in which the world can be perceived by an intelligent agent,according to the observation tools, the goal of the observation, the agent’s cultural background,and so on. This variability is captured by the diversity of the world perceptions P(W). It is at thislayer that is established the type and amount of information the agent will memorise, speakabout, and reason about later on. The less detailed the perception, the more abstract. Sometimesthe agent has control over the perception, in such a way to collect exactly the information itneeds to achieve its goals. Sometimes the agent can not control the perception, so that it mayreceive much more information than it currently needs, or maybe it wants to perform severaltasks, each one requiring different pieces of information, which, on the other hand, are easier tocollect together. The preceding considerations suggest that it would be very useful to havemethods to actually or virtually transform a perception into a more abstract one. The abstractionprocess, as defined by Saitta and Zucker (98), starts at the perception level, but propagatestoward the layers of Figure 1. In Figure 2, the view on abstraction on which rely this paper issynthetically described. Within the context of cartographic generalization, the important aspectof this model is to support the idea that abstraction ought to be first defined at the perceptionlevel.

Sa=σσσσ(Sg)Sg=MMMM(PPPPg(W))

PPPPg(W) Pa=ωωωω((((PPPPg(W))

W

Lg=DDDD(Sg) La=λλλλ(Sg)

Tg=FFFF(Lg) Ta=ττττ(Tg)

A

ωωωω

σσσσ

λλλλ

ττττ

Figure 2 Knowledge abstraction and representation

Page 6: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

6

3.2 Cartography in the KRA model

The topographic map production process closely parallels the KRA model, because it can beanalysed according to the two dimensions, representation and abstraction. Let us first consider thescheme of Figure 1 applied to cartography. The first step of cartography is to collect data fromthe geographic world or part of it (W). This is usually done through aerial photographs, satelliteimages or field survey which are the perceived world P(W). Objects contained in thesephotographs are located and labelled to create a geographic database (GDB). This GDB is the setof geographic data organised in a Structure (S). This GDB is then displayed by means ofcartographic symbols applied to objects stored in it. This is the creation of a map, an iconiclanguage (L). Finally, maps are created for specific tasks (e.g. space analysis, search foritineraries, geographic theory construction). The theory (T) is the result of map analysis and isguided by all the background facts and laws allowing one to reason about geographicconfiguration. But, Cartography is not just knowledge representation. All steps of map creation do not onlyinvolve knowledge representation, but also knowledge abstraction, because each step retains onlypart of available information (e.g. a GDB does not contain all information seen on an image). Inparticular, map creation (which contains the generalisation process that we detail in the nextchapter) is both a knowledge representation process, when objects are symbolised, and aknowledge abstraction process, when objects relevant to the theory construction are identified. Sodescribed, map creation is represented as a diagonal process in Figure 3.

S

P(W)

Data Collection

Map creation

Identification

Map analysis

GeographicWorld

Image

Geographic DBor DLM

Geographic theory

Mapor DCM

KnowledgeRepresentationAxis

Knowledge Abstraction Axis(level of details decreases)

L

T

Figure 3. Cartography in the abstraction/representation space

The KRA model exhibits key properties for cartography. It allows the process of representation(change of language) to be distinguished from the process of abstraction (change of level ofdetail). These two processes are usually very much entangled in cartography. This distinctionprovides the basis for automating cartographic knowledge acquisition, as a combined acquisitionof specific knowledge for abstraction and knowledge for changing representation, as we willexplain in the following section.

3.3 Generalisation process in the KRA model

Knowledge abstraction in cartographic generalisation is the identification of abstractedgeographic objects relevant to the theory construction that will be done from the map. By

Page 7: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

7

abstracted object we mean an abstracted description of the characters of the objects. For example,when we consider that the shape of a road is more important than its accurate location, we makean abstraction as we do enhance only one of the properties of the considered objects. Theabstraction process goes from a detailed description of a geographic object, considering each partof the object, to a more abstract description of the object, retaining only properties of the objectrelevant to the map user’s needs.

Knowledge representation in cartographic generalisation is the process of symbolisinggeographical object and is guided by the abstracted object. For example, the representationprocess is to determine how to represent a road (the geographical object) so that its shape (theabstracted object) is well legible. This choice is guided by the necessity to well represent theabstracted properties and is restricted by the drawing possibilities (we cannot represent all thebends of the road and keep the planimetric accuracy as we have to enlarge non legible bends).

It is important to notice that knowledge abstraction and representation are not independent,nor that when abstraction has been done the “ground” GDB is no more necessary. For example,considering a road as a "sinuous road" (the abstracted object) help us to change our view of theworld and to decide how to represent it, but the representation process needs to look again at theground GDB to create a simplified representation of the actual geographic object. In this way weimitate the human perception, which continuously changes the level of abstraction to analysespace. These inter-links between abstraction and representation explain why, manually, thecartographer has always performed these two steps in one time. However, this distinctionbetween abstraction and representation is necessary for the creation of an automated process ofcartographic generalisation. 4 DESIGN OF THE CARTOGRAPHIC KNOWLEDGE BASED SYSTEM

4.1 Knowledge based system description

A simple and preliminary design of a cartographic knowledge based system answering ourproblem (how to represent on a map a geographic object?) could be the step of figure 4.

GeographicObject

CartographicObject

Cartographicgeneralisation

Figure 4 First design of the system

However, as explained in Section 2, knowledge used by the system cannot be well understood inthis process. Moreover, knowledge acquisition for this system is very difficult. This comes fromthe fact that a direct knowledge acquisition from experts is problematic, as shown by researcheson cartographic generalisation (Weibel et al, 95). Machine Learning is also made difficultbecause the description space of inputs and outputs of the system are huge. The more complexthese spaces are, the more examples are needed in order to expect an efficient learning (be it withsymbolic machine learning or neural networks) and the less we can expect good quality learnedrules (see theoretical works on computational learning theory, e.g. Valiant, 84, and practicalexperiments Saitta and Neri 98 ).

Page 8: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

8

According to considerations of section 2 (separating different types of knowledge) and tosection3 (modelling generalisation as a combination of abstraction and representation). A moreappropriate process requires two steps: knowledge abstraction and knowledge representation(figure 5). As defined in section 3, knowledge abstraction input is a geographic object, and itsoutput is an abstracted qualitative description of it. Knowledge representation inputs are theabstracted description (to determine how to represent) and the geographic object itself (toperform the geometric transformation). Knowledge representation output is the cartographicobject representing the geographic object. Nevertheless knowledge acquisition for thisarchitecture is still problematic, for the same reasons than the one explained for the previous one-step architecture (figure 4).

GeographicObject

Qualitative Description

CartographicObject

Abstraction Representation

Figure 5 First refinement of the system

In order to facilitate the knowledge acquisition process we eventually designed a systemarchitecture shown in figure 6.

Page 9: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

9

Applicabilitydetermination

Applicability of Operation

1

GeographicObject

Measurements

Qualitative Description

CartographicObject

MeasuresAbstraction

Expert interview

QualitativeAbstraction

Machine Learningfrom examples

Machine Learning from examples

Transformation Expert interview

Knowledgeacquisitiontechniques

Applicabilitydetermination

Applicability of Operation

3

Applicabilitydetermination

Applicability of Operation

2

ABSTRACTION

REPRESENTATION

KNOWLEDGE

KNOWLEDGE

(a)(a)

(b)

(c)

(d)

Figure 6. Inferences drawn by the system and knowledge acquisition techniques used (Mustiere et al. 98)

The knowledge abstraction is in two steps: the first one (figure 6,a) is the description of an objectas a set of measures (this abstraction is a way to supply the system with characteristics thecartographer may focus on). The measures to be used are made by an expert of the domain. Thedescription of the considered object as a set of measures is therefore greatly simplified over theinitial vector representation. Then the second abstraction step (figure 6,b) goes from this set ofmeasures (surface, elongation…) to a qualitative description of the object. That means that a setof abstracted qualitative descriptors (size, shape…) are determined to enhance the most importantcharacters of the object to deal with (this is the description the cartographer will use to reasonabout).

The knowledge representation step is decomposed in several steps. First, for a given object, and aset of possible transformation operations to apply on it, the system determined if each operationis applicable or not (figure 6,c). A distinct set of rules is necessary to determine the applicabilityof each operation, but this set of rules use the same abstracted description. Then from this set ofoperation applicability the system determine which operation to use (by mean of a strategy to bedefined), and apply it on the geographic object (figure 6,d).

Page 10: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

10

The reason why a one-step knowledge representation has not been chosen is that it is most of thetime impossible for an expert to say which operation is the best to apply on an object. Indeed inmost cases several operations are equally applicable, learning which operation is the best is then a“non-exclusive classes” problem (for one object several classes are possible), which is a ratheroriginal setting for machine learning and is a more difficult problem to handle. Designing thesystem the way we did allows us to work with several “exclusive classes” problem, because theapplicability of an operation cannot be at the same time “applicable” and “not applicable”. Thisdesign allows us to acquire necessary knowledge, as explained in the next chapter.

4.2 Knowledge acquisition for the system

Among the sources of knowledge used to fill in the knowledge based system some may beacquired through interview and others automatically learned (right part of figure 6). This sectionpresents different sources used to acquire this knowledge.

The first abstraction step of the system requires a selection of measures to describe an object.Knowledge necessary to this step can be directly acquired from experts on space analysis whocan determine relevant measures to well describe a type of objects. This knowledge is clearlydifferent for each type of object. For example, if we deal with buildings an expert will tell us toprefer measures of area, elongation, compactness, etc…If we deal with roads an expert will tellus to prefer measures of length, number of bends, etc

The second abstraction step goes from measurements to a qualitative abstraction of the object.Part of this knowledge can be directly acquired from experts in cartography. They can specifywhich characters of the object are relevant. For example, if we deal with buildings a cartographyexpert will tell us that an important character is the shape, and more precisely that rectangle-shaped and L-shaped buildings are both important types of building. But the expert incartography is most of the time unable to link these qualitative descriptions to the previous set ofmeasures. For example, to describe a building an expert will say whether it is small, medium orbig, but he will not be able to say that a small building is one with an area smaller than 300 m2.He is able to show examples of small buildings but usually unable to provide a threshold on thesurface measurement to characterise “small” buildings. We will so therefore learn automaticallythis knowledge from examples, as described in the next part.

Then, for each possible transformation operation, we need to know whether it is applicable oneach given object. Experts are not able to formalise this knowledge because either they do notknow the operations or operations are too complex to be easily controlled. However, if we giveexamples of objects transformed by a specific operation, a cartography expert is always able tosay for each example if the result is acceptable or not and therefore if this operation is applicable.This “applicability” knowledge may be learned from examples, as described in the next part.Finally, the system has to make the choice, from the set of applicable operations, of an operationto effectively apply on an object. Some complex reasoning, acquired from cartography experts,can be done to choose this operation (Ruas 99) but this aspect is out of the scope of this paper.

Knowledge acquisition for each part of the system may not come from the same expert, as someexperts are specialised in space analysis (particularly in measures) and others are more

Page 11: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

11

specialised in cartography (and particularly in generalisation). As the described approachseparates each type of knowledge, it is possible to acquire them from different experts.

5 EXPERTIMENTSThis section presents an experiment performed to automatically learn knowledge that the systemrequires to deal with the particular case of cartographic generalisation of buildings. As specifiedabove both qualitative abstraction knowledge and applicability knowledge are to be learned.

5.1 Experimental settingIn order to collect data necessary for our test, a "space analysis and algorithms expert"2 wasasked to define:- a set of measures describing buildings (see Table 1. below). Algorithms chosen to compute

these measures are defined in Regnauld (97) ,- a set3 of "operations" applicable to buildings (listed in Table 3).

Then a "cartography expert"4 was asked to define a set of qualitative abstract descriptors for agiven building that are somehow related to the above-mentioned measures (see Table 2. below),

Measure Name unit Possible valuesMinimum length m a positive realSurface m2 a positive realMinimum width m a positive realConcavity none a real in [0,1]Compactness none a real in [0,1]Depth of the biggest yard m a positive realElongation none a real in [0,1]Squareness radian a real in [0,π/2]Number of points none a positive integer

Table 1 Measures (Regnauld, 97)

Name Possible values NameGlobal shape Rectangle, L-shape, stair-

like bar, otherSimple dilatation

Size Small, Medium, Big SimplificationNumber of main orientation One, Several SquaringGranularity High, Low Squaring / Simplification / EnlargementImportant details None, Some Simplification / Squaring / EnlargementExistence of special shapes None, Triangle or Circle Table 2 Possible operations

(Regnauld et al, 98)Table 3 Abstracted descriptors

The cartography expert was then provided with a set of 80 observations of buildings and he wasfirst asked to describe each building with the defined qualitative descriptors (this building has theshape “L- shaped”, the size “medium”, it contains “no big wings”…). The same set of buildings,each one generalised by each operation was presented to him and he was asked to say for each

2 N. Regnauld from Edinburgh University3 Regnauld et al. (98) describe this combination of algorithms and show that each combination is efficient on somebuildings, nevertheless they do not know when a particular combination is efficient.4 S. Mustière from IGN-France

Page 12: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

12

generalised building if the result was acceptable or not. Meanwhile the set of measures chosen bythe expert (see Table 1) was computed on each building.

Measurements:Min length: 4.0 m Surface: 715.5 m2Min width: 4.35 mConcavity: 0.69Compactness: 0.38Yard: 32 mElongation: 0.89Squareness: 0.19 radNb points: 15

Abstracted desription:Global Shape: otherSize: mediumMain direction: severalGranularity: highWings: someNeck: noneSpecial shape: None

Accepted operations:No, Yes, Yes, Yes, No, No, No, No

Measurements:Min length: 5.4 m Surface: 6312.4 m2Min width: 7.0 mConcavity: 0.74Compactness: 0.25Yard: 46.2 mElongation: 0.29Squareness: 0.07 radNb points: 37

Abstracted desription:Global Shape: otherSize: bigMain direction: oneGranularity: highWings: someNeck: noneSpecial shape: None

Accepted operations:No No No No Yes No Yes Yes

OriginalBuilding

TransformedBuildings

OriginalBuilding

TransformedBuildings

Example 1

Example 2

Figure 7 Two of the eighty examples of buildings used in the machine learning test

Two examples of buildings are shown in figure 7. Eighty examples were used to first learn howto link measures to abstracted descriptors, then to link abstracted descriptors and applicability toeach operations.

As we intend to learn comprehensible rules, we applied on these examples symbolic machinelearning tools. We used indifferently the C4.5 (Quinlan, 92) and Ripper (Cohen, 95) algorithmsto learn the qualitative abstraction step expertise. We then used Ripper (Cohen, 95) or ENIGME(Thomas et al, 93) to learn the applicability of operations expertise. C4.5 produces a decisiontree, Ripper decision rules, and ENIGME production rules. To select when to use a decision treelearner or a rule learner, we did some empirical tests. We noticed that for our problem rules wereeasier to read for the applicability step.

Some results of these tests (performed on eighty buildings) are presented in the next section.

5.2 Learning results : qualitative evaluation

Page 13: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

13

Due to space limitations, we are unable to comment all the learned rules and did extract some ofthe most typical learned rules. Figure 8 shows a decision tree automatically learned by C4.5 fromexamples to determine the abstracted character Size from a given set of measures. This simpletree demonstrates several advantages of using of Machine Learning and of our approach:- the system learns the relevant measures for determining the size (Area and Concavity).- the system learns relevant thresholds although this is always a difficult problem in

cartography. For example it learns that the expert considered buildings smaller than 137 m2 assmall building (they are actually the individual houses), and buildings bigger than 1168 m2 asbig buildings (most of the time industrial buildings).

- The system learns rules that the expert would not have thought useful to express. For exampleit learns that the concavity influences the perception of buildings size. This is well known inpsychology that the shape of an object influences its size perception, but there are greatchances that an expert does not explicitly express this rule.

Surface

Concavity

Size =Small

Size =Medium

Size =Medium

Size =Big

<137 m2

> 1168 m2

> 137 m2 and < 1168 m2

<0.93

>0.93

Figure 8 Learned decision tree for Size determination

Figure 9 shows some rules automatically learned by ENIGME to determine applicability ofoperations. These rules show some advantages of our approach:- Learned rules are simple (with a maximum of three abstracted characters included in the

premises). They are easy to understand, which is not the case for rules learned in one-steplearning experiments (see Appendix).

- Some rules have been created to determine the applicability of complex operations (like“Squaring and Simplification”) which is not easy to do without machine learning because it isdifficult to control what this operation does.

If Size = Big and Wings = None and Granularity = small and special_shape = nonethen Squaring is applicable

If Shape = Rectangle and Granularity = small then Squaring is applicable

If Shape = Big and Main_direction=Several then Squaring is not applicable

If Size = Big and Special_shape = circle or triangle then Simplification is applicable

If Global_shape = L-shape and Size = Small and Special_shape = circle or triangle then operation « Squaring, Simplification, Enlargement »is applicable

Figure 9 Learned rules for determining operations applicability

Page 14: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

14

Detailed results of the rules learned by Ripper at the different stages b) and c) of figure 6 arepresented in Appendix.

5.3 Learning results : quantitative evaluation

We performed an evaluation of the predictivity of the different rules learned. Because of thesmall number of available examples, we did perform this evaluation by cross-validation (10-fold). Figure 10 summarises the evaluation results of rules learned by Ripper (presented inAppendix). The abstraction rules have in average an error rate of 21% (more or less 5%depending on the abstracted descriptor learned) and the applicability rules have in average anerror rate of 16% (more or less 5% depending on the operator considered). These relatively higherror rates are explained by the fuzziness of the examples.

Measurements

Applicability ofoperations

Qualitative description

Abstraction

Applicabilitydetermination

Average ofErrors

21 %

21 %

16 %

Measurements

Applicability ofoperations

Applicabilitydetermination

Average ofErrors

27 %

Learning applicabilitywith abstraction

Learning applicabilitywithout abstraction

Figure 10 Errors evaluation

For our purpose, we are intersted in evaluating the error rate of the whole process either using“abstraction +applicability determination” or using a direct approach. Let us recall that each stephas also been also evaluated through a cross-validation. Without detailing the results, we cannotice that the process with abstraction has a lower error rate (21%) than the process withoutabstraction (27%). Introducing the abstraction knowledge while learning rules does moreoverincrease the quality of the learned hypothesis.

6 CONCLUSION

This paper contributes to support the view that in complex domain, the learning task may beviewed as a combined process of learning to change the representation and then learning in themore appropriate representation (Michalski, 1994). It is also a contribution to the research on theautomation of cartographic generalisation, and more specifically to the knowledge acquisitionprocess that is required to develop cartographic expert systems. Knowledge acquisition is a keyproblem for cartographic generalisation. We showed that this knowledge acquisition bottleneckmight be overcome by analysing the different types of knowledge involved in the cartographyprocess. Such an analysis based on an abstraction model we developed lead to considercartographic generalisation as a combination of abstraction and representation. First experimental

Page 15: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

15

results on the knowledge acquisition for isolated buildings generalisation are very encouraging.Ongoing work includes an extensive evaluation of the quality of these acquired knowledge aswell as the evaluation of the cartographic quality of new buildings generalised following theexpert system proposed operations.

ACKNOWLEDGEMENTWe wish to thank Nicolas Regnauld, from Edinburgh University, for providing all our test data,and for being our expert who determined relevant operations and measures on buildings.

Page 16: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

16

REFERENCESArmstrong M.P. 1991 “Knowledge Classification and organisation”. in Map Generalization : Making Rules for Knowledge

Representation. McMaster & Buttendfield (eds) Longman Scientific & Technical pp 86-102

Bergadano, F., Giordana A., Saitta L. (1991). Machine Learning: An Integrated Framework and its Application. Chichester : EllisHorwood.

Clancey, W. (1983). “The Epistemology of a Rule Based Expert System – A framework for Explanation”, Artificial Intelligence,Vol. 20, pp. 215-251

Cohen, W. (1995). “Fast Effective Rule Induction.” Proceedings of 12th Machine Learning Conference, 1995

David J.-M., Krivine J.-P. and Simmons R. (eds.) (1993). Second Generation Expert Systems, Springer Verlag.

Ginsberg M., (1993). Essentials of Artificial Intelligence, San Francisco: Academic Press / Morgan Kaufmann.

Matheus, C., Rendell, L. (1989). "Constructive Induction in Decision Trees", Proc. IJCAI-89, (Detroit, USA), pp. 645-650.

Michalski, R. (1983). “A Theory and Methodology of Inductive Learning”. In R. Michalski, T. Mitchell and J. Carbonell (Eds.),Machine Learning, Vol. I, Tioga Publ. Co., Palo Alto, CA, pp. 83-134.

Michalski, R. S. (1994). Inferential Theory of Learning: Developping foundations for Multistrategy Learning. Machine Learning:An artificial intelligence approach. R. Michalski and G. Tecuci. San Mateo, CA, Morgan Kaufmann. IV: 3-61.

Mitchell, T. (1982). "Generalization as Search", Artificial Intelligence,18, 203-226.

Muggleton, S., Buntine, W. (1988). "Machine Invention of First-Order Predicates by Inverting Resolution", Proc. Fifth Int. Conf.on Machine Learning, (Ann Arbor, MI), pp.339-352.

Mustière, S., J.-D. Zucker , L. Saitta (2000). An Abstraction-Based Machine Learning approach to Cartographic Generalization.Spatial Data Handling 2000 (SDH), Beijing, CHINA.

OEEPE (1998 ) “First results on the OEEPE test on generalisation” OEEPE Newsletter 1998 (1)

Quinlan J.R. (1992). C4.5 : Programs for Machine Learning. Morgan Kaufmann Series in Machine Learning.

Regnauld N. (1997). Généralisation du bâti: Structure spatiale de type graphe et représentation cartographique. PhD Thesis.LIRM – Marseille, France

Regnauld N., Edwardes A. and Barrault M., (1998) “Strategies in Building Generalisation: Modelling the Sequence, Constrainingthe Choice”. ICA Workshop on Progress in Automated Map Generalization. Ottawa

Ruas, A. (1997). “Strategies for Urban Map Generalisation”. Proceedings of ICC Vol. 3 pp. 1387-1394

Ruas, A. (1999). Modèles de généralisation de données géographiques à base de contraintes et d’autonomie. PhD Thesis,University Marne-la-Vallée, France.

Saitta L., Zucker J.-D. (1998). “Semantic Abstraction for Concept Representation and Learning”. Proceedings Symposium onAbstraction, Reformulation and Approximation Pacific Grove, California

Saitta, L. and Neri F. (1998). “Learning in the "Real World".” Machine Learning 30(2/3): 133-163, February 1998.

Thomas, J., P. Laublet, et J-G. Ganascia. (1993). A Machine Learning Tool Designed for a Model-Based Knowledge AcquisitionApproach,. EKAW’93,, Toulouse-Caylus, France, Springer-Verlag.

Valiant, L. (1984). “A theory of the learnable”. Communications of the ACM, 27(11), p. 1134-1142

Weibel R., Keller S. & Reichenbacher T. (1995) “Overcoming the Knowledge Acquisition Bottleneck in Map Generalization :the Role of Interactive Systems and Computionnal Intelligence.” Proceedings of COSIT’95, p. 139-156.

Wneck J. and Michalski R. (1994). “Hypothesis-Driven Constructive Induction in AQ17-HCI: A Method and Experiments”.Machine Learning, 14, 139-168.

Zucker, J.-D., and Ganascia, J.-G. 1994. Selective Reformulation of Examples in Concept Learning. Proc. InternationalConference on Machine Learning, pp. 352-360, New-Brunswick, Morgan Kaufmann.

Nyerges (1991) Representing Geographic Meaning in Map Generalization Ed Longman Scientific & Technical pp 59-85

Page 17: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

17

Appendix

This appendix shows detailed results of knowledge learned in our buildings examples. Table 1shows “applicability” rules learned if no abstraction is performed. Table 2 shows rules learned toexplain the abstracted description of an object from the measures (the concrete language). Table 3shows rules learned to explain the “applicability” of each operation from the abstracted language.

Table 2 and 3 (corresponding to the scheme ofGeographic

ObjectQualitative Description

CartographicObject

Abstraction Representation

Figure 5) contain rules more understandable by cartography experts than rules of Table 1

(corresponding to the scheme of

GeographicObject

CartographicObject

Cartographicgeneralisation

Figure 4)

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SIMPLE-DILATIONyes :- Compacity>=0.745, Squareness<=0.11 (8/2).yes :- Area>=83.33 (1/0).default no (68/0).

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SIMPLIFICATIONyes :- Elongation>=0.639 (23/9).yes :- Area>=15.96, Compacity<=0.369 (7/0).yes :- Concavity>=0.856, Area>=2.32 (6/0).yes :- Depth>=0.339, Area<=1.11 (2/0).default no (31/1).

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SIMPLIF.-SQUAR.-ENLARG.yes :- Concavity>=0.856 (23/3).yes :- Depth>=0.345, Nb_points<=14 (7/1).yes :- Area>=52.55 (2/0).default no (40/3).

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SQUARINGyes :- Depth<=0.304, Elongation<=0.529 (14/4).yes :- Min_length>=0.86 (12/0).yes :- Area<=4.94, Area>=1.81, Area<=2.41 (5/3).yes :- Area<=4.94, Nb_points>=16 (3/1).yes :- Area<=0.87, Min_length>=0.55 (2/0).yes :- Concavity<=0.674, Min_length>=0.34 (2/1).default no (32/0).

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SQUAR.-SIMPLIF.-ENLARG.yes :- Squareness<=0.23, Squareness<=0.05 (7/0).yes :- Concavity>=0.863 (13/6).yes :- Min_width<=0.78, Min_width>=0.68, Depth>=0.282 (7/2).default no (37/7).

Table 1. Rules describing the applicability in the concrete language

RULES LEARNED TO CHARACTERISE THE ATTRIBUTE GLOBAL-SHAPEStair-like bar :- Elongation<=0.392, Area<=5.76 (6/1).

Page 18: Learning Abstraction and Representation Knowledge: an ......does not correspond to the term of generalization used in Artificial Intelligence. Its meaning is much more related to the

18

L-shape :- Depth>=0.25, Nb_points<=8 (5/0).Rectangle :- Depth<=0.207 (20/3).default Other (32/12).

RULES LEARNED TO CHARACTERISE THE ATTRIBUTE SIZESmall :- Area<=1.37 (16/1).Medium :- Area<=5.76 (28/2).default Big (30/2).

RULES LEARNED TO CHARACTERISE THE ATTRIBUTE SPECIAL-SHAPETriangle_or_Circle :- Min_length<=0.38, Nb_points>=31 (7/1).default None (62/9).

RULES LEARNED TO CHARACTERISE THE ATTRIBUTE GRANULARITY

High :- Min_length>=0.86 (12/0).High :- Area<=0.29 (2/0).default Low (60/0).

RULES LEARNED TO CHARACTERISE THE ATTRIBUTE IMPORTANT DETAIL

Some :- Depth>=0.178, Min_width>=0.75 (24/10).default none (32/13).

Table 2. Rules describing the abstract language in the concrete language

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SIMPLE DILATIONyes :-granularity=high, global_shape=rectangle, special_shape=none, size=small (5/0).yes :- size=small, global_shape=L-shape (1/0).default no (70/3).

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SIMPLIFICATIONyes :- size=big (20/10).yes :- mportant_detail=some, special_shape=none (16/7).default no (23/3).

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SIMPLIF.-SQUAR.-ENLARG.yes :- important_detail=none, global_shape=rectangle (21/0).yes :- important_detail=none, special_shape=none (9/3).yes :- global_shape=rectangle, size=big (1/0).yes :- global_shape=L-shape, size=medium (2/1).default no (40/2).

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SQUARINGyes :- granularity=igh (15/2).yes :- size=medium, important_detail=some (17/3).yes :- global_shape=L-shape, size=small (1/0).yes :- important_detail=some, global_shape=L-shape (2/0).yes :- global-shape=other, size=medium, special_shape=triangle_or_circle (1/0).default no (36/2).

RULES LEARNED TO CHARACTERISE THE APPLICAIBILITY OF THE ALGORITHM SQUAR.-SIMPLIF.-ENLARG.yes :- important_detail=none (29/13).default no (32/5).

Table 3. Rules describing the applicability in the abstract language