footprints in instance space: steps otwards a free lunch · introduction methodology (tsp case...

101

Upload: others

Post on 09-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Footprints in Instance Space:Steps Towards a Free Lunch

Kate Smith-Miles

School of Mathematical SciencesMonash University

South Paci�c Optimisation Meeting (SPOM), Newcastle Univ., Feb. 2013

Footprints in Instance Space 1 / 57

Page 2: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Acknowledgements

This research is funded by ARC Discovery Project grant

DP120103678 �Footprints in Instance Space: Visualising the

Suitability of Optimisation Algorithms�

The Travelling Salesman results are based on work with Dr.

Jano van Hemert (University of Edinburgh), and my former

student Thomas T. Tan

The Graph Colouring results are based on work with Dr.

Davaa Baatar (Monash University), Dr. Rhyd Lewis

(University of Cardi�), Dr. Leo Lopes (SAS, USA), and former

students Nur Insani and Brendan Wreford

Thanks to Prof. Gordon Royle (University of Western

Australia) for informing us about the �energy� of a graph

Footprints in Instance Space 2 / 57

Page 3: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Acknowledgements

This research is funded by ARC Discovery Project grant

DP120103678 �Footprints in Instance Space: Visualising the

Suitability of Optimisation Algorithms�

The Travelling Salesman results are based on work with Dr.

Jano van Hemert (University of Edinburgh), and my former

student Thomas T. Tan

The Graph Colouring results are based on work with Dr.

Davaa Baatar (Monash University), Dr. Rhyd Lewis

(University of Cardi�), Dr. Leo Lopes (SAS, USA), and former

students Nur Insani and Brendan Wreford

Thanks to Prof. Gordon Royle (University of Western

Australia) for informing us about the �energy� of a graph

Footprints in Instance Space 2 / 57

Page 4: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Acknowledgements

This research is funded by ARC Discovery Project grant

DP120103678 �Footprints in Instance Space: Visualising the

Suitability of Optimisation Algorithms�

The Travelling Salesman results are based on work with Dr.

Jano van Hemert (University of Edinburgh), and my former

student Thomas T. Tan

The Graph Colouring results are based on work with Dr.

Davaa Baatar (Monash University), Dr. Rhyd Lewis

(University of Cardi�), Dr. Leo Lopes (SAS, USA), and former

students Nur Insani and Brendan Wreford

Thanks to Prof. Gordon Royle (University of Western

Australia) for informing us about the �energy� of a graph

Footprints in Instance Space 2 / 57

Page 5: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Acknowledgements

This research is funded by ARC Discovery Project grant

DP120103678 �Footprints in Instance Space: Visualising the

Suitability of Optimisation Algorithms�

The Travelling Salesman results are based on work with Dr.

Jano van Hemert (University of Edinburgh), and my former

student Thomas T. Tan

The Graph Colouring results are based on work with Dr.

Davaa Baatar (Monash University), Dr. Rhyd Lewis

(University of Cardi�), Dr. Leo Lopes (SAS, USA), and former

students Nur Insani and Brendan Wreford

Thanks to Prof. Gordon Royle (University of Western

Australia) for informing us about the �energy� of a graph

Footprints in Instance Space 2 / 57

Page 6: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

No-Free-Lunch Theorem (Wolpert & Macready, 1997)

Standard practice is to use benchmark instances of

optimisation problems to report strengths (rarely weaknesses!)

of algorithms.

NFL Theorem warns us against expecting a single algorithm to

perform well on all instances of a problem, regardless of their

structure and characteristics.

The properties (or measurable features) of an instance tell us a

lot about how an algorithm is expected to perform across a

range of instances.

Reference

Smith-Miles, K. & Lopes, L., �Measuring Instance Di�culty for CombinatorialOptimization Problems�, Comp. & Oper. Res., vol. 39(5), pp. 875-889, 2012.

Footprints in Instance Space 3 / 57

Page 7: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

No-Free-Lunch Theorem (Wolpert & Macready, 1997)

Standard practice is to use benchmark instances of

optimisation problems to report strengths (rarely weaknesses!)

of algorithms.

NFL Theorem warns us against expecting a single algorithm to

perform well on all instances of a problem, regardless of their

structure and characteristics.

The properties (or measurable features) of an instance tell us a

lot about how an algorithm is expected to perform across a

range of instances.

Reference

Smith-Miles, K. & Lopes, L., �Measuring Instance Di�culty for CombinatorialOptimization Problems�, Comp. & Oper. Res., vol. 39(5), pp. 875-889, 2012.

Footprints in Instance Space 3 / 57

Page 8: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

No-Free-Lunch Theorem (Wolpert & Macready, 1997)

Standard practice is to use benchmark instances of

optimisation problems to report strengths (rarely weaknesses!)

of algorithms.

NFL Theorem warns us against expecting a single algorithm to

perform well on all instances of a problem, regardless of their

structure and characteristics.

The properties (or measurable features) of an instance tell us a

lot about how an algorithm is expected to perform across a

range of instances.

Reference

Smith-Miles, K. & Lopes, L., �Measuring Instance Di�culty for CombinatorialOptimization Problems�, Comp. & Oper. Res., vol. 39(5), pp. 875-889, 2012.

Footprints in Instance Space 3 / 57

Page 9: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

No-Free-Lunch Theorem (Wolpert & Macready, 1997)

Standard practice is to use benchmark instances of

optimisation problems to report strengths (rarely weaknesses!)

of algorithms.

NFL Theorem warns us against expecting a single algorithm to

perform well on all instances of a problem, regardless of their

structure and characteristics.

The properties (or measurable features) of an instance tell us a

lot about how an algorithm is expected to perform across a

range of instances.

Reference

Smith-Miles, K. & Lopes, L., �Measuring Instance Di�culty for CombinatorialOptimization Problems�, Comp. & Oper. Res., vol. 39(5), pp. 875-889, 2012.

Footprints in Instance Space 3 / 57

Page 10: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Travelling Salesman Problem (TSP) Example

Easy Hard

Footprints in Instance Space 4 / 57

Page 11: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

What makes the TSP easy or hard?

A TSP Formulation (not the only one)

Let Xi ,j = 1 if city i is followed by city j in the tour; 0 otherwise

minimiseN

∑i=1

N

∑j=1

Di ,jXi ,j

subject to

∑i

Xi ,j = 1 ∀j

∑j

Xi ,j = 1 ∀i

∑i∈S

∑j∈S

Xi ,j ≤ |S |−1 ∀S 6= {0},S ⊂ {1,2, . . . ,N}

TSP is NP-hard, but some instances are easy depending on

properties of the inter-city distance matrix D

Footprints in Instance Space 5 / 57

Page 12: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

What makes the TSP easy or hard?

A TSP Formulation (not the only one)

Let Xi ,j = 1 if city i is followed by city j in the tour; 0 otherwise

minimiseN

∑i=1

N

∑j=1

Di ,jXi ,j

subject to

∑i

Xi ,j = 1 ∀j

∑j

Xi ,j = 1 ∀i

∑i∈S

∑j∈S

Xi ,j ≤ |S |−1 ∀S 6= {0},S ⊂ {1,2, . . . ,N}

TSP is NP-hard, but some instances are easy depending on

properties of the inter-city distance matrix D

Footprints in Instance Space 5 / 57

Page 13: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Questions

How do instance features help us understand the strengths and

weaknesses of optimisation algorithms?

How can we infer and visualise algorithm performance across a

huge �instance space�?

How can we measure objectively the relative performance of

algorithms?

How easy or hard are the benchmark instances in the

literature?

Footprints in Instance Space 6 / 57

Page 14: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Questions

How do instance features help us understand the strengths and

weaknesses of optimisation algorithms?

How can we infer and visualise algorithm performance across a

huge �instance space�?

How can we measure objectively the relative performance of

algorithms?

How easy or hard are the benchmark instances in the

literature?

Footprints in Instance Space 6 / 57

Page 15: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Questions

How do instance features help us understand the strengths and

weaknesses of optimisation algorithms?

How can we infer and visualise algorithm performance across a

huge �instance space�?

How can we measure objectively the relative performance of

algorithms?

How easy or hard are the benchmark instances in the

literature?

Footprints in Instance Space 6 / 57

Page 16: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Questions

How do instance features help us understand the strengths and

weaknesses of optimisation algorithms?

How can we infer and visualise algorithm performance across a

huge �instance space�?

How can we measure objectively the relative performance of

algorithms?

How easy or hard are the benchmark instances in the

literature?

Footprints in Instance Space 6 / 57

Page 17: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Aims

Develop a new methodology to

I visualise �instance space� based on instance featuresI visualise algorithm performance across the instance spaceI de�ne where algorithm performance is expected to be �good�(called the �algorithm footprint�)

I measure the relative size of an algorithm's footprint

Enable objective assessment of the power of optimisation

algorithms.

Understand and report the boundary of good performance of

an algorithm � essential for good research practice, and to

avoid deployment disasters.

Footprints in Instance Space 7 / 57

Page 18: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Aims

Develop a new methodology to

I visualise �instance space� based on instance featuresI visualise algorithm performance across the instance spaceI de�ne where algorithm performance is expected to be �good�(called the �algorithm footprint�)

I measure the relative size of an algorithm's footprint

Enable objective assessment of the power of optimisation

algorithms.

Understand and report the boundary of good performance of

an algorithm � essential for good research practice, and to

avoid deployment disasters.

Footprints in Instance Space 7 / 57

Page 19: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Aims

Develop a new methodology to

I visualise �instance space� based on instance featuresI visualise algorithm performance across the instance spaceI de�ne where algorithm performance is expected to be �good�(called the �algorithm footprint�)

I measure the relative size of an algorithm's footprint

Enable objective assessment of the power of optimisation

algorithms.

Understand and report the boundary of good performance of

an algorithm � essential for good research practice, and to

avoid deployment disasters.

Footprints in Instance Space 7 / 57

Page 20: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Algorithm Selection Problem, Rice (1976)

Footprints in Instance Space 8 / 57

Page 21: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications of Rice's Framework: PDEs

Rice and colleagues used this approach to predict the

performance of the many methods (A) for numerical solution

of elliptic partial di�erential equations (PDEs).

Reference

Weerana, Rice, et al., �PYTHIA: a knowledge-based system to select scienti�calgorithms�, ACM Trans. on Math. Software, vol. 22(4), pp. 447-468, 1996.

PYTHIA matches the characteristics (F ) of a given problem

(x) with those of PDEs in an existing problem population (P)

It then uses learned performance pro�les (S) of the varioussolvers to select the appropriate method given user-speci�ed

error and solution time bounds (Y ).

Footprints in Instance Space 9 / 57

Page 22: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications of Rice's Framework: PDEs

Rice and colleagues used this approach to predict the

performance of the many methods (A) for numerical solution

of elliptic partial di�erential equations (PDEs).

Reference

Weerana, Rice, et al., �PYTHIA: a knowledge-based system to select scienti�calgorithms�, ACM Trans. on Math. Software, vol. 22(4), pp. 447-468, 1996.

PYTHIA matches the characteristics (F ) of a given problem

(x) with those of PDEs in an existing problem population (P)

It then uses learned performance pro�les (S) of the varioussolvers to select the appropriate method given user-speci�ed

error and solution time bounds (Y ).

Footprints in Instance Space 9 / 57

Page 23: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications of Rice's Framework: PDEs

Rice and colleagues used this approach to predict the

performance of the many methods (A) for numerical solution

of elliptic partial di�erential equations (PDEs).

Reference

Weerana, Rice, et al., �PYTHIA: a knowledge-based system to select scienti�calgorithms�, ACM Trans. on Math. Software, vol. 22(4), pp. 447-468, 1996.

PYTHIA matches the characteristics (F ) of a given problem

(x) with those of PDEs in an existing problem population (P)

It then uses learned performance pro�les (S) of the varioussolvers to select the appropriate method given user-speci�ed

error and solution time bounds (Y ).

Footprints in Instance Space 9 / 57

Page 24: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications of Rice's Framework: Machine Learning

For the last two decades, the �eld of meta-learning (learning

about learning algorithms) has emerged.

Here, measurable features of classi�cation and prediction

problems are used to predict the performance of machine

learning algorithms.

While Rice recommended regression models to model the

relationship between features and algorithm performance, we

can apply more powerful statistical or machine learning

methods to this task.

Reference

Smith-Miles, K. A., �Cross-disciplinary perspectives on meta-learning for algorithmselection�, ACM Computing Surveys, vol. 41(1), 2008.

Footprints in Instance Space 10 / 57

Page 25: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications of Rice's Framework: Machine Learning

For the last two decades, the �eld of meta-learning (learning

about learning algorithms) has emerged.

Here, measurable features of classi�cation and prediction

problems are used to predict the performance of machine

learning algorithms.

While Rice recommended regression models to model the

relationship between features and algorithm performance, we

can apply more powerful statistical or machine learning

methods to this task.

Reference

Smith-Miles, K. A., �Cross-disciplinary perspectives on meta-learning for algorithmselection�, ACM Computing Surveys, vol. 41(1), 2008.

Footprints in Instance Space 10 / 57

Page 26: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications of Rice's Framework: Machine Learning

For the last two decades, the �eld of meta-learning (learning

about learning algorithms) has emerged.

Here, measurable features of classi�cation and prediction

problems are used to predict the performance of machine

learning algorithms.

While Rice recommended regression models to model the

relationship between features and algorithm performance, we

can apply more powerful statistical or machine learning

methods to this task.

Reference

Smith-Miles, K. A., �Cross-disciplinary perspectives on meta-learning for algorithmselection�, ACM Computing Surveys, vol. 41(1), 2008.

Footprints in Instance Space 10 / 57

Page 27: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications of Rice's Framework: Machine Learning

For the last two decades, the �eld of meta-learning (learning

about learning algorithms) has emerged.

Here, measurable features of classi�cation and prediction

problems are used to predict the performance of machine

learning algorithms.

While Rice recommended regression models to model the

relationship between features and algorithm performance, we

can apply more powerful statistical or machine learning

methods to this task.

Reference

Smith-Miles, K. A., �Cross-disciplinary perspectives on meta-learning for algorithmselection�, ACM Computing Surveys, vol. 41(1), 2008.

Footprints in Instance Space 10 / 57

Page 28: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Extending Rice's Framework

Footprints in Instance Space 11 / 57

Page 29: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications to Optimisation

Represents a new direction for the OR community.

Much needed, given

I huge range of algorithmsI frequent statements like �currently there is still a strong lack of. . . understanding of how exactly the relative performance ofdi�erent meta-heuristics depends on instance characteristics.�

Can also resolve longstanding debate about how instance

choice a�ects evaluation of algorithm performance

Reference

Hooker, J.N., �Testing heuristics: We have it all wrong�, Journal of Heuristics, vol. 1,pp. 33-42, 1995.

Footprints in Instance Space 12 / 57

Page 30: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications to Optimisation

Represents a new direction for the OR community.

Much needed, given

I huge range of algorithmsI frequent statements like �currently there is still a strong lack of. . . understanding of how exactly the relative performance ofdi�erent meta-heuristics depends on instance characteristics.�

Can also resolve longstanding debate about how instance

choice a�ects evaluation of algorithm performance

Reference

Hooker, J.N., �Testing heuristics: We have it all wrong�, Journal of Heuristics, vol. 1,pp. 33-42, 1995.

Footprints in Instance Space 12 / 57

Page 31: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

Applications to Optimisation

Represents a new direction for the OR community.

Much needed, given

I huge range of algorithmsI frequent statements like �currently there is still a strong lack of. . . understanding of how exactly the relative performance ofdi�erent meta-heuristics depends on instance characteristics.�

Can also resolve longstanding debate about how instance

choice a�ects evaluation of algorithm performance

Reference

Hooker, J.N., �Testing heuristics: We have it all wrong�, Journal of Heuristics, vol. 1,pp. 33-42, 1995.

Footprints in Instance Space 12 / 57

Page 32: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

TSP Algorithms

Amongst the many algorithms, we consider two variants of the

famous Lin-Kernighan heuristic

I Lin-Kernighan with Cluster Compensation (LKCC)I chained Lin-Kernighan (CLK)

Lin-Kernighan is an edge-swapping heuristic

Which algorithm is better, and for which types of instances?

Footprints in Instance Space 13 / 57

Page 33: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

MotivationAimsFramework

TSP Algorithms

Amongst the many algorithms, we consider two variants of the

famous Lin-Kernighan heuristic

I Lin-Kernighan with Cluster Compensation (LKCC)I chained Lin-Kernighan (CLK)

Lin-Kernighan is an edge-swapping heuristic

Which algorithm is better, and for which types of instances?

Footprints in Instance Space 13 / 57

Page 34: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 1: Summarising Instances as Feature Vectors

There are many statistical properties of an instance that

correlate with �hardness� of an instance

I generic properties like number of variables, constraintsI problem-speci�c properties like eigenvalues of adjacency matrixin graph colouring; slackness in rooms for timetabling; etc.

Suppose an instance is summarised by n (topology preserving)

features:

I each instance can be represented as a point in RnI two instances that are similar should be close in RnI two instances that are very dissimilar should be far apart in Rn

Similar instances elicit similar behaviour from algorithms,

except where we observe a phase transition

Footprints in Instance Space 14 / 57

Page 35: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 1: Summarising Instances as Feature Vectors

There are many statistical properties of an instance that

correlate with �hardness� of an instance

I generic properties like number of variables, constraintsI problem-speci�c properties like eigenvalues of adjacency matrixin graph colouring; slackness in rooms for timetabling; etc.

Suppose an instance is summarised by n (topology preserving)

features:

I each instance can be represented as a point in RnI two instances that are similar should be close in RnI two instances that are very dissimilar should be far apart in Rn

Similar instances elicit similar behaviour from algorithms,

except where we observe a phase transition

Footprints in Instance Space 14 / 57

Page 36: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 1: Summarising Instances as Feature Vectors

There are many statistical properties of an instance that

correlate with �hardness� of an instance

I generic properties like number of variables, constraintsI problem-speci�c properties like eigenvalues of adjacency matrixin graph colouring; slackness in rooms for timetabling; etc.

Suppose an instance is summarised by n (topology preserving)

features:

I each instance can be represented as a point in RnI two instances that are similar should be close in RnI two instances that are very dissimilar should be far apart in Rn

Similar instances elicit similar behaviour from algorithms,

except where we observe a phase transition

Footprints in Instance Space 14 / 57

Page 37: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 1: TSP

A comprehensive set of n = 40 features were used to

summarize the properties of TSP instances:

I Standard deviation of inter-city distancesI Radius of TSP instance (mean distance from cities to centroid)I Fraction of distinct distances in the distance matrixI Rectangular area in which the cities lieI Normalized variance and coe�cient of variation of the nNNd's(normalized nearest neighbor distances)

I Number of clusters found using GDBSCAN, max=10I Cluster ratio (number of clusters to the number of cities)I Outlier ratio (number of outliers to number of cities)I Variance of the number of cities in each clusterI Ratio of nodes near the edges of the planeI Mean radius of the clustersI etc.

Footprints in Instance Space 15 / 57

Page 38: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 2: Evolving Easy and Hard Instances

To see the strengths and weaknesses of algorithms we need

diverse instance, well-spread across feature spaceI also important for statistical generalisation

Randomly generated or benchmark instances are frequently not

diverse enough on the spectrum of di�culty

We also generate instances that have been evolved from

random instances to be intentionally easy or hard for a given

algorithm under considerationI to create hard instances, the �tness function can be therun-time to �nd an optimal solution, or the solution gap to aknown optimal solution after a �xed run-time

I inverse for easy instancesI take ratio of one algorithm's performance to another togenerate uniquely easy or hard instances

Footprints in Instance Space 16 / 57

Page 39: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 2: Evolving Easy and Hard Instances

To see the strengths and weaknesses of algorithms we need

diverse instance, well-spread across feature spaceI also important for statistical generalisation

Randomly generated or benchmark instances are frequently not

diverse enough on the spectrum of di�culty

We also generate instances that have been evolved from

random instances to be intentionally easy or hard for a given

algorithm under considerationI to create hard instances, the �tness function can be therun-time to �nd an optimal solution, or the solution gap to aknown optimal solution after a �xed run-time

I inverse for easy instancesI take ratio of one algorithm's performance to another togenerate uniquely easy or hard instances

Footprints in Instance Space 16 / 57

Page 40: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 2: Evolving Easy and Hard Instances

To see the strengths and weaknesses of algorithms we need

diverse instance, well-spread across feature spaceI also important for statistical generalisation

Randomly generated or benchmark instances are frequently not

diverse enough on the spectrum of di�culty

We also generate instances that have been evolved from

random instances to be intentionally easy or hard for a given

algorithm under considerationI to create hard instances, the �tness function can be therun-time to �nd an optimal solution, or the solution gap to aknown optimal solution after a �xed run-time

I inverse for easy instancesI take ratio of one algorithm's performance to another togenerate uniquely easy or hard instances

Footprints in Instance Space 16 / 57

Page 41: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Evolving Instances

Reference

Smith-Miles, K. & van Hemert, J., �Discovering the Suitability of OptimisationAlgorithms by Learning from Evolved Instances�, Annals of Mathematics and Arti�cial

Intelligence, vol. 61, no. 2, pp. 87-104, 2011.

Footprints in Instance Space 17 / 57

Page 42: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 2: TSP

We consider only N = 100 city TSP instances, and have three

types of instances:

I randomly generated: 190 instances randomly placing 100 citiesin a 400 x 400 plane

I TSPLIB benchmark instances: 6 instances with exactly 100cities (kroA100, kroB100, kroC100, kroD100, kroE100, andrd100)

I evolved easy and hard for each algorithm: 190 instances ofeasy and hard for each of CLK and LKCC

I evolved uniquely easy and hard for each algorithm: 190instances that are easy for CLK but hard for LKCC, and 190instances that are hard for CLK but easy for LKCC

In total we have 1336 100-city TSP instances that are

intentionally diverse

Footprints in Instance Space 18 / 57

Page 43: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 2: TSP

We consider only N = 100 city TSP instances, and have three

types of instances:

I randomly generated: 190 instances randomly placing 100 citiesin a 400 x 400 plane

I TSPLIB benchmark instances: 6 instances with exactly 100cities (kroA100, kroB100, kroC100, kroD100, kroE100, andrd100)

I evolved easy and hard for each algorithm: 190 instances ofeasy and hard for each of CLK and LKCC

I evolved uniquely easy and hard for each algorithm: 190instances that are easy for CLK but hard for LKCC, and 190instances that are hard for CLK but easy for LKCC

In total we have 1336 100-city TSP instances that are

intentionally diverse

Footprints in Instance Space 18 / 57

Page 44: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 3: Visualising Instances in Feature Space

Suppose we have a set of m instances and n measurable

features for each instanceI we store the data in a matrix X ∈ Rm×n

We use Principal Components Analysis (PCA) to project X

from Rm×n to Rm×2

I each instance can now be visualised in R2 (axes are the toptwo eigenvectors of XTX )

I the noise in the data has been reducedI essential relationships de�ning similarities and di�erencesbetween instances are preserved

I we call this 2-d projected feature space the instance space

If similar instances are not grouped together in instance space,

we must question the discriminatory power of our feature set

and revisit Step 1.

Footprints in Instance Space 19 / 57

Page 45: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 3: Visualising Instances in Feature Space

Suppose we have a set of m instances and n measurable

features for each instanceI we store the data in a matrix X ∈ Rm×n

We use Principal Components Analysis (PCA) to project X

from Rm×n to Rm×2

I each instance can now be visualised in R2 (axes are the toptwo eigenvectors of XTX )

I the noise in the data has been reducedI essential relationships de�ning similarities and di�erencesbetween instances are preserved

I we call this 2-d projected feature space the instance space

If similar instances are not grouped together in instance space,

we must question the discriminatory power of our feature set

and revisit Step 1.

Footprints in Instance Space 19 / 57

Page 46: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 3: Visualising Instances in Feature Space

Suppose we have a set of m instances and n measurable

features for each instanceI we store the data in a matrix X ∈ Rm×n

We use Principal Components Analysis (PCA) to project X

from Rm×n to Rm×2

I each instance can now be visualised in R2 (axes are the toptwo eigenvectors of XTX )

I the noise in the data has been reducedI essential relationships de�ning similarities and di�erencesbetween instances are preserved

I we call this 2-d projected feature space the instance space

If similar instances are not grouped together in instance space,

we must question the discriminatory power of our feature set

and revisit Step 1.

Footprints in Instance Space 19 / 57

Page 47: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 3: TSP Instances in Instance Space

Footprints in Instance Space 20 / 57

Page 48: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 4: Visualising Algorithm Footprints in Instance Space

We can now superimpose the performance of algorithms in the

instance space by evaluating them on our m instances

We �rst need to de�ne �good� performance, where the

algorithm solves an instance easily (e.g. 1% optimality gap)

All instances are labelled 1 (�good�) or 0 (�bad�), and we

create a binary matrix Y ∈ Rm×|A|, where |A| is the number of

algorithms we are considering

For a given algorithm, we consider points labelled as good, andI remove outliers through clustering,I calculate the convex hull to de�ne a generalised area ofexpected good performance

I remove the convex hull of contradicting pointsI validate the accuracy of the remaining �footprint� throughout-of-sample testing

Footprints in Instance Space 21 / 57

Page 49: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 4: Visualising Algorithm Footprints in Instance Space

We can now superimpose the performance of algorithms in the

instance space by evaluating them on our m instances

We �rst need to de�ne �good� performance, where the

algorithm solves an instance easily (e.g. 1% optimality gap)

All instances are labelled 1 (�good�) or 0 (�bad�), and we

create a binary matrix Y ∈ Rm×|A|, where |A| is the number of

algorithms we are considering

For a given algorithm, we consider points labelled as good, andI remove outliers through clustering,I calculate the convex hull to de�ne a generalised area ofexpected good performance

I remove the convex hull of contradicting pointsI validate the accuracy of the remaining �footprint� throughout-of-sample testing

Footprints in Instance Space 21 / 57

Page 50: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 4: Visualising Algorithm Footprints in Instance Space

We can now superimpose the performance of algorithms in the

instance space by evaluating them on our m instances

We �rst need to de�ne �good� performance, where the

algorithm solves an instance easily (e.g. 1% optimality gap)

All instances are labelled 1 (�good�) or 0 (�bad�), and we

create a binary matrix Y ∈ Rm×|A|, where |A| is the number of

algorithms we are considering

For a given algorithm, we consider points labelled as good, andI remove outliers through clustering,I calculate the convex hull to de�ne a generalised area ofexpected good performance

I remove the convex hull of contradicting pointsI validate the accuracy of the remaining �footprint� throughout-of-sample testing

Footprints in Instance Space 21 / 57

Page 51: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 4: Visualising Algorithm Footprints in Instance Space

We can now superimpose the performance of algorithms in the

instance space by evaluating them on our m instances

We �rst need to de�ne �good� performance, where the

algorithm solves an instance easily (e.g. 1% optimality gap)

All instances are labelled 1 (�good�) or 0 (�bad�), and we

create a binary matrix Y ∈ Rm×|A|, where |A| is the number of

algorithms we are considering

For a given algorithm, we consider points labelled as good, andI remove outliers through clustering,I calculate the convex hull to de�ne a generalised area ofexpected good performance

I remove the convex hull of contradicting pointsI validate the accuracy of the remaining �footprint� throughout-of-sample testing

Footprints in Instance Space 21 / 57

Page 52: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 4: Algorithm Footprints in TSP Instance Space

Footprints in Instance Space 22 / 57

Page 53: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Footprints in Instance Space 23 / 57

Page 54: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Convex and Concave Hulls

Footprints in Instance Space 24 / 57

Page 55: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Convex and Concave Hulls

Footprints in Instance Space 24 / 57

watch out forcontradictions

Page 56: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 5: Measuring the Size of Algorithm Footprints

Now we need only to calculate the area de�ning the footprint

I our metric of the power of an algorithm is the ratio of this areato the total area of the instance space

Area of Algorithm Footprint

Let H(S) be the convex hull of a region de�ned by a set of

points S = {(xi ,yi )∀i = 1, . . .η}

Area(H(S)) =1

2

k

∑j=1

(xjyj+1− yjxj+1)+(xky1− ykx1)

with the subset {(xj ,yj)∀j = 1, . . .k} and k ≤ η de�ning the

extreme points of H(S)

Footprints in Instance Space 25 / 57

Page 57: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 5: Measuring the Size of Algorithm Footprints

Now we need only to calculate the area de�ning the footprint

I our metric of the power of an algorithm is the ratio of this areato the total area of the instance space

Area of Algorithm Footprint

Let H(S) be the convex hull of a region de�ned by a set of

points S = {(xi ,yi )∀i = 1, . . .η}

Area(H(S)) =1

2

k

∑j=1

(xjyj+1− yjxj+1)+(xky1− ykx1)

with the subset {(xj ,yj)∀j = 1, . . .k} and k ≤ η de�ning the

extreme points of H(S)

Footprints in Instance Space 25 / 57

Page 58: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Step 5: CLK and LKCC Footprint Size

Good performance de�ned as 1% gap to known optimal solution

(via Concorde TSP solver)

Footprints in Instance Space 26 / 57

Page 59: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Step 1: Instance FeaturesStep 2: Instance GenerationStep 3: Visualising Instance SpaceStep 4: Visualising Algorithm FootprintsStep 5: Measuring Algorithm Footprints

Footprint size versus goodness de�nition

Relative area of footprint versus optimality gap %

Top line: Middle line: Bottom line:

good uniquely good uniquely good with no contradictions

Footprints in Instance Space 27 / 57

CLK

LKCC

Page 60: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Discussion

How do instance features help us understand the strengths and

weaknesses of optimisation algorithms?

I Provided we have the right feature set, we can create atopology-preserving instance space

I The boundary between good and bad performance can be seenI Feature selection methods may improve topology-preservation

How can we infer and visualise algorithm performance across a

huge �instance space�?

I PCA has been used to visualise instances in 2-d (or 3-d)I More than 90% of variation in data was preserved, but someimportant information (as well as noise) is naturally lost

I If the 4th largest eigenvalue is still large, then we loose toomuch detail, and other dimension reduction methods areneeded

Footprints in Instance Space 28 / 57

Page 61: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Discussion

How do instance features help us understand the strengths and

weaknesses of optimisation algorithms?

I Provided we have the right feature set, we can create atopology-preserving instance space

I The boundary between good and bad performance can be seenI Feature selection methods may improve topology-preservation

How can we infer and visualise algorithm performance across a

huge �instance space�?

I PCA has been used to visualise instances in 2-d (or 3-d)I More than 90% of variation in data was preserved, but someimportant information (as well as noise) is naturally lost

I If the 4th largest eigenvalue is still large, then we loose toomuch detail, and other dimension reduction methods areneeded

Footprints in Instance Space 28 / 57

Page 62: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Discussion, continued

How can we objectively measure algorithm performance?I We have proposed a method to calculate the relative size ofthe area of algorithm footprints

I Convex or concave hulls can be used depending ongeneralisation comfort (out-of-sample testing can help)

I The area of the footprint depends on the de�nition of �good�I LKCC has a larger footprint, for a broader de�nition of good,than CLK

How easy or hard are the benchmark instances?I TSPLIB and random instances are all in the middle (averagefeatures), and lie within the footprint of both algorithms

I More discriminating instances can be generated intentionallyusing evolutionary algorithms

I The diversity of instances is critical to achieve a generalisedinstance space

Footprints in Instance Space 29 / 57

Page 63: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Discussion, continued

How can we objectively measure algorithm performance?I We have proposed a method to calculate the relative size ofthe area of algorithm footprints

I Convex or concave hulls can be used depending ongeneralisation comfort (out-of-sample testing can help)

I The area of the footprint depends on the de�nition of �good�I LKCC has a larger footprint, for a broader de�nition of good,than CLK

How easy or hard are the benchmark instances?I TSPLIB and random instances are all in the middle (averagefeatures), and lie within the footprint of both algorithms

I More discriminating instances can be generated intentionallyusing evolutionary algorithms

I The diversity of instances is critical to achieve a generalisedinstance space

Footprints in Instance Space 29 / 57

Page 64: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Graph Colouring

Footprints in Instance Space 30 / 57

Given an undirected graph G (V ,E )with |V |= n, colour the vertices such

that no two vertices connected by an

edge share the same colour

Try to �nd the minimum number of

colours needed to colour the graph

(chromatic number)

NP-hard problem → numerous

heuristics for large n

Many applications, such as timetabling

where edges represent con�icts

between events

Page 65: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

What makes graph colouring hard?

In total we have 18 features that describe a graph instance

G (V ,E )

5 features relating to the nodes and edges

I The number of nodes or vertices in a graph: n = |V |I The number of edges in a graph: m = |E |I The density of a graph: the ratio of the number of edges tothe number of possible edges.

I Mean node degree: the degree of a node is the number ofconnections a node has to other nodes.

I SD of node degree: the average node degree and its standarddeviation can give us an idea of how connected a graph is.

Footprints in Instance Space 31 / 57

Page 66: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

What makes graph colouring hard?

In total we have 18 features that describe a graph instance

G (V ,E )

5 features relating to the nodes and edges

I The number of nodes or vertices in a graph: n = |V |I The number of edges in a graph: m = |E |I The density of a graph: the ratio of the number of edges tothe number of possible edges.

I Mean node degree: the degree of a node is the number ofconnections a node has to other nodes.

I SD of node degree: the average node degree and its standarddeviation can give us an idea of how connected a graph is.

Footprints in Instance Space 31 / 57

Page 67: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Graph features (continued)

8 features related to cycles and paths on the graphI The diameter of a graph: max shortest path distance betweenany two nodes.

I Average path length: average length of shortest paths for allnode pairs.

I The girth of a graph: the length of the shortest cycle.I The clustering coe�cient: a measure of node clustering.I Mean betweenness centrality: average fraction of all shortestpaths connecting all pairs of nodes that pass through a givennode.

I SD of betweenness centrality: with the mean, the SD gives ameasure of how central the nodes are in a graph.

I Szeged index / revised Szeged index: generalisation of Wienernumber to cyclic graphs (correlates with bipartivity)

I Beta: proportion of even closed walks to all closed walks(correlates with bipartivity)

Footprints in Instance Space 32 / 57

Page 68: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Graph features (continued)

5 features related to the Adjacency and Laplacian matrices

I Mean eigenvector centrality: the Perron-Frobenius eigenvectorof the adjacency matrix, averaged across all components.

I SD of eigenvector centrality: together with the mean, thestandard deviation of eigenvector centrality gives us a measureof the importance of a node inside a graph.

I Mean spectrum: the mean of absolute values of eigenvalues ofthe adjacency matrix (a.k.a �energy� of the graph).

I SD of the set of absolute values of eigenvalues of theadjacency matrix.

I Algebraic connectivity: the 2nd smallest eigenvalue of theLaplacian matrix, re�ecting how well connected a graph is.Cheeger's constant, another important graph property, isbounded by half the algebraic connectivity.

Footprints in Instance Space 33 / 57

Page 69: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Graph Colouring Instances

We use a set of 6788 instances from a variety of well-studied

sources, and others we have generated to explore bipartivity

DataSet # instances Description

B 1000 Bipartivity Controlled

C1 1000 Culberson: cycle-driven

C2 932 Culberson: geometric

C3 1000 Culberson: girth and degree inhibited

C4 1000 Culberson: IID edge probabilities

C5 1000 Culberson: weight-biased

D 743 DIMACS instances

E 20 Social Network graphs

F 80 Sports Scheduling

G 13 Exam Timetabling

Footprints in Instance Space 34 / 57

Page 70: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Graph Colouring Algorithms

We use the same 8 algorithms considered by Lewis et al.I DSATUR: Brelaz's greedy algorithm (exact for bipartitegraphs)

I RandGr: Simple greedy �rst-�t colouring of randompermutations of nodes

I Bktr: a backtracking version of DSATUR (Culberson)I HillClimb: a hill-climbing improvement on initial DSATURsolution

I HEA: Hybrid evolutionary algorithmI TabuCol: Tabu search algorithmI PartCol: Like TabuCol, but doesn't restricts to feasible spaceI AntCol: Ant Colony meta-heuristic

Reference

Lewis, R. et al. �A wide-ranging computational comparison of high-performance graphcolouring algorithms�. Computers & Operations Research 39(9), pp. 1933-1950, 2012.

Footprints in Instance Space 35 / 57

Page 71: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Graph Colouring Algorithms

We use the same 8 algorithms considered by Lewis et al.I DSATUR: Brelaz's greedy algorithm (exact for bipartitegraphs)

I RandGr: Simple greedy �rst-�t colouring of randompermutations of nodes

I Bktr: a backtracking version of DSATUR (Culberson)I HillClimb: a hill-climbing improvement on initial DSATURsolution

I HEA: Hybrid evolutionary algorithmI TabuCol: Tabu search algorithmI PartCol: Like TabuCol, but doesn't restricts to feasible spaceI AntCol: Ant Colony meta-heuristic

Reference

Lewis, R. et al. �A wide-ranging computational comparison of high-performance graphcolouring algorithms�. Computers & Operations Research 39(9), pp. 1933-1950, 2012.

Footprints in Instance Space 35 / 57

HEA reported as bestoverall

Page 72: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Visualising the instance space

Footprints in Instance Space 36 / 57

Page 73: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

De�ning goodness of algorithm performance

Footprints in Instance Space 37 / 57

Identify best solution from all 8

algorithms

Algorithm is good if gap to best

solution is ≤ α%

Blue means good=easy,

Red means bad=hard

Bktr footprint:α = 0.04

Page 74: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

De�ning di�culty of instances

If less than a given fraction β of the 8 algorithms �nd an

instance easy, then we label the instance as hard for the

portfolio of algorithms

I e.g. if β = 0.5 then an instance will be labelled hard if less thanhalf (only 1, 2 or 3 of the total eight algorithms) �nd it easy

It is important that we understand where good algorithm

performance is uninteresting (if all algorithms �nd the

instances easy) or interesting (if other algorithms struggle)

Footprints in Instance Space 38 / 57

Page 75: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

De�ning di�culty of instances

If less than a given fraction β of the 8 algorithms �nd an

instance easy, then we label the instance as hard for the

portfolio of algorithms

I e.g. if β = 0.5 then an instance will be labelled hard if less thanhalf (only 1, 2 or 3 of the total eight algorithms) �nd it easy

It is important that we understand where good algorithm

performance is uninteresting (if all algorithms �nd the

instances easy) or interesting (if other algorithms struggle)

Footprints in Instance Space 38 / 57

Page 76: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

How many algorithms �nd an instance hard? (α = 0)

Footprints in Instance Space 39 / 57

Page 77: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Algorithm Footprints

Footprints in Instance Space 40 / 57

Area of Footprintscan give quantitativemeasure of power

Page 78: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Algorithm Footprints

Footprints in Instance Space 40 / 57

Warning: Blue (easy)may be on top of Red(hard)

Page 79: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Learning to predict easy or hard instances for a given α,β

Footprints in Instance Space 41 / 57

Naive Bayes classi�er inR2 is 85% accurate

Page 80: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Recommending algorithms

Footprints in Instance Space 42 / 57

But why?

Page 81: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

On which instance classes is each algorithm best suited?

Footprints in Instance Space 43 / 57

Page 82: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Characterising algorithm suitability based on features

Enables us to see what properties (not instance class labels)

explain algorithm performance.

Representation of instance space (location of instances)

depends on feature set.

We have used a GA to select optimal feature subset to

maximise separability (reduce contradictions) in footprints to

enable cleaner calculation of area of footprints.

Using all 18 features, some interesting feature distributions

clearly show the properties of instances that create easy or

hard instances for each algorithm.

Footprints in Instance Space 44 / 57

Page 83: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Characterising algorithm suitability based on features

Enables us to see what properties (not instance class labels)

explain algorithm performance.

Representation of instance space (location of instances)

depends on feature set.

We have used a GA to select optimal feature subset to

maximise separability (reduce contradictions) in footprints to

enable cleaner calculation of area of footprints.

Using all 18 features, some interesting feature distributions

clearly show the properties of instances that create easy or

hard instances for each algorithm.

Footprints in Instance Space 44 / 57

Page 84: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Characterising algorithm suitability based on features

Enables us to see what properties (not instance class labels)

explain algorithm performance.

Representation of instance space (location of instances)

depends on feature set.

We have used a GA to select optimal feature subset to

maximise separability (reduce contradictions) in footprints to

enable cleaner calculation of area of footprints.

Using all 18 features, some interesting feature distributions

clearly show the properties of instances that create easy or

hard instances for each algorithm.

Footprints in Instance Space 44 / 57

Page 85: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Characterising algorithm suitability based on features

Enables us to see what properties (not instance class labels)

explain algorithm performance.

Representation of instance space (location of instances)

depends on feature set.

We have used a GA to select optimal feature subset to

maximise separability (reduce contradictions) in footprints to

enable cleaner calculation of area of footprints.

Using all 18 features, some interesting feature distributions

clearly show the properties of instances that create easy or

hard instances for each algorithm.

Footprints in Instance Space 44 / 57

Page 86: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Feature Distributions in Instance Space

Footprints in Instance Space 45 / 57

Page 87: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Footprints in Instance Space 46 / 57

Page 88: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Footprints in Instance Space 47 / 57

Page 89: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Footprints in Instance Space 48 / 57

Page 90: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Footprints in Instance Space 49 / 57

Page 91: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Footprints in Instance Space 50 / 57

Page 92: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Footprints in Instance Space 51 / 57

Page 93: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Reference

Balakrishnan, R. �The energy of a graph�. Linear Algebra and its applications, vol.387, pp. 287-295, 2004.

Footprints in Instance Space 52 / 57

Page 94: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Reference

Pisanski, T., & Randi¢, M. �Use of the Szeged index and the revised Szeged index formeasuring network bipartivity�. Disc. Appl. Math, vol. 158, pp. 1936-1944, 2010.

Footprints in Instance Space 53 / 57

Page 95: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

Reference

Estrada, E., & Rodríguez-Velázquez, J. A. �Spectral measures of bipartivity in complexnetworks�. Physical Review E, vol. 72(4), 046105, 2005.

Footprints in Instance Space 54 / 57

Page 96: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Meta-DataApplying the methodologyVisualising Strengths and Weaknesses

HEA is not best everywhere (NFL) ... why not?

Footprints in Instance Space 55 / 57

Page 97: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Conclusions

The proposed methodology is a �rst step towards providing

researchers with a tool to

I report the strengths and weaknesses of their algorithmsI show the relative power of an algorithm either

across the entire instance space, orin a particular region of interest (e.g. real world problems).

We are currently developing the key components of the

methodology (evolved instances, feature sets) for a number of

broad classes of optimization problems.

The approach generalises to parameter selection within

algorithms as well, and to choice of formulation.

We hope to be providing a free lunch for optimisation

researchers soon!

Footprints in Instance Space 56 / 57

Page 98: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Conclusions

The proposed methodology is a �rst step towards providing

researchers with a tool to

I report the strengths and weaknesses of their algorithmsI show the relative power of an algorithm either

across the entire instance space, orin a particular region of interest (e.g. real world problems).

We are currently developing the key components of the

methodology (evolved instances, feature sets) for a number of

broad classes of optimization problems.

The approach generalises to parameter selection within

algorithms as well, and to choice of formulation.

We hope to be providing a free lunch for optimisation

researchers soon!

Footprints in Instance Space 56 / 57

Page 99: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Conclusions

The proposed methodology is a �rst step towards providing

researchers with a tool to

I report the strengths and weaknesses of their algorithmsI show the relative power of an algorithm either

across the entire instance space, orin a particular region of interest (e.g. real world problems).

We are currently developing the key components of the

methodology (evolved instances, feature sets) for a number of

broad classes of optimization problems.

The approach generalises to parameter selection within

algorithms as well, and to choice of formulation.

We hope to be providing a free lunch for optimisation

researchers soon!

Footprints in Instance Space 56 / 57

Page 100: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Conclusions

The proposed methodology is a �rst step towards providing

researchers with a tool to

I report the strengths and weaknesses of their algorithmsI show the relative power of an algorithm either

across the entire instance space, orin a particular region of interest (e.g. real world problems).

We are currently developing the key components of the

methodology (evolved instances, feature sets) for a number of

broad classes of optimization problems.

The approach generalises to parameter selection within

algorithms as well, and to choice of formulation.

We hope to be providing a free lunch for optimisation

researchers soon!

Footprints in Instance Space 56 / 57

Page 101: Footprints in Instance Space: Steps oTwards a Free Lunch · Introduction Methodology (TSP Case Study) Discussion Case Study: Graph Colouring Conclusions Footprints in Instance Space:

IntroductionMethodology (TSP Case Study)

DiscussionCase Study: Graph Colouring

Conclusions

Further Reading

K. Smith-Miles & L. Lopes, �Measuring Instance Di�culty for Combinatorial OptimizationProblems�, Comp. & Oper. Res., vol. 39(5), pp. 875-889, 2012.K. Smith-Miles, �Cross-disciplinary perspectives on meta-learning for algorithm selection�, ACMComputing Surveys, vol. 41, no. 1, article 6, 2008.Travelling Salesman Problem

I K. Smith-Miles and J. van Hemert, �Discovering the Suitability of OptimisationAlgorithms by Learning from Evolved Instances�, Annals of Mathematics and Arti�cialIntelligence, vol. 61, no. 2, pp. 87-104, 2011.

Quadratic Assignment Problem

I K. Smith-Miles, �Towards insightful algorithm selection for optimisation usingmetalearning concepts,� IEEE Int. Joint Conf. Neural Networks, pp. 4118-4124, 2008.

Job Shop Scheduling

I K. Smith-Miles, R. James, J. Gi�n, Y. Tu, Y, �A Knowledge Discovery Approach toUnderstanding Relationships between Scheduling Problem Structure and HeuristicPerformance,� Proc 3rd Learning and Intelligent Optimization conference, LNCS, vol.5851, pp. 89-103, 2009.

Timetabling

I K. Smith-Miles and L. Lopes, �Generalising algorithm performance in instance space: atimetabling case study,� Proc. 5th Learning and Intelligent Optimization conference,LNCS, vol. 6683, pp. 524-538, 2011.

Graph Colouring

I K. Smith-Miles, B. Wreford, L. Lopes and N. Insani, �Predicting metaheuristicperformance on graph coloring problems using data mining�, in E. Talbi, (Ed.), HybridMetaheuristics, Springer, pp. 417-432, 2013.

Footprints in Instance Space 57 / 57