stereo vision algorithms in reconfigurable hardware for robotics applications

Mälardalen University Press Licentiate ThesesNo. 141

STEREO VISION ALGORITHMS IN RECONFIGURABLEHARDWARE FOR ROBOTICS APPLICATIONS

Jörgen Lidholm

School of Innovation, Design and Engineering

Mälardalen University Press Licentiate ThesesNo. 141

STEREO VISION ALGORITHMS IN RECONFIGURABLEHARDWARE FOR ROBOTICS APPLICATIONS

Jörgen Lidholm

School of Innovation, Design and Engineering

Abstract

This thesis presents image processing solutions in FPGA based embedded vi-sion systems. Image processing is a demanding process but the informationthat can be extracted from images is very useful and can be used for manytasks like mapping and navigation, object detection and recognition, collisiondetection and more.

Image processing or analysis involves reading images from a camera sys-tem, improve an image with respect to colour fidelity and white balance, re-moving distortion, extracting salient information. The mentioned steps areoften referred to as low to medium level image processing and involve largeamounts of data and fairly simple algorithms suitable for parallel processing.Medium to high level processing involves a reduced amount of data and morecomplex algorithms. Object recognition which involves matching image fea-tures to information stored in a database is of higher complexity.

A vision system can be used in anything from a car to industry processesto mobile robots playing soccer or assisting people in their homes. A visionsystem often works with video streams that are processed to find pieces thatcan be handled in an industry process, detect obstacles that may be potentialhazards in traffic or to find and track landmarks in the environment that can beused to build and navigate from. This involves large amount of calculationsand this is a problem, even though modern computers are fast they may stillnot be able to execute the desired algorithms with the frequency wanted. Evenif there are computers that are fast enough they are bulky and require a lot ofpower. They are not suitable for incorporating on small mobile robots.

In this thesis I will present the image processing sequence to give an under-standing of the complexity of the processes involved and I will discuss someprocessing platforms suitable for image processing. I will also present mywork that is focused on image algorithm implementations for reconfigurablehardware suitable for mobile robots with requirements on speed an power con-sumption.

Abstract

This thesis presents image processing solutions in FPGA based embedded vi-sion systems. Image processing is a demanding process but the informationthat can be extracted from images is very useful and can be used for manytasks like mapping and navigation, object detection and recognition, collisiondetection and more.

Image processing or analysis involves reading images from a camera sys-tem, improve an image with respect to colour fidelity and white balance, re-moving distortion, extracting salient information. The mentioned steps areoften referred to as low to medium level image processing and involve largeamounts of data and fairly simple algorithms suitable for parallel processing.Medium to high level processing involves a reduced amount of data and morecomplex algorithms. Object recognition which involves matching image fea-tures to information stored in a database is of higher complexity.

A vision system can be used in anything from a car to industry processesto mobile robots playing soccer or assisting people in their homes. A visionsystem often works with video streams that are processed to find pieces thatcan be handled in an industry process, detect obstacles that may be potentialhazards in traffic or to find and track landmarks in the environment that can beused to build and navigate from. This involves large amount of calculationsand this is a problem, even though modern computers are fast they may stillnot be able to execute the desired algorithms with the frequency wanted. Evenif there are computers that are fast enough they are bulky and require a lot ofpower. They are not suitable for incorporating on small mobile robots.

In this thesis I will present the image processing sequence to give an under-standing of the complexity of the processes involved and I will discuss someprocessing platforms suitable for image processing. I will also present mywork that is focused on image algorithm implementations for reconfigurablehardware suitable for mobile robots with requirements on speed an power con-sumption.

Swedish Summary - Svensk

Sammanfattning

Robotar blir allt vanligare i vårat samhälle. De finns allt ifrån traditionella in-dustrirobotar till robotar som hjälper oss i hemmen med de tråkigaste sysslorna,robotar som används för att övervaka stora områden, robotar som vi bara harför nöjes skull och en hel uppsjö med olika varianter.

En robot är mer än både än maskin och en intelligent dator, en robot är justen robot om den kan känna, planera och agera. Att känna innebär att robotenskall med sensorer känna av fysiska fenomen i sin omgivning, det kan vara alltifrån att känna av temperatur, bestämma ett avstånd till någonting, känna igenen person och så vidare. Att planera innebär att roboten skall genom intelligensutifrån vad den känner till om sin omgivning utifrån sensordata, planera påvilket sätt den skall agera. Till slut så innebär agera, att roboten skall utföra enfysisk handling, vilket kan bestå i att den förflyttar sig, lyfter upp ett objekt avnågot slag eller följer en boll med ögonen (kameror).

En central del för en robot är följaktligen en intelligent sensor som helstskall kunna användas till så mycket som möjligt, kunna leverera den informa-tion som roboten vill ha med tillräcklig hastighet så att roboten hinner fatta ettbeslut och dessutom skall den leverera information som roboten har använd-ning av. Då ögonen fyller en viktig funktion för människan och vi använderdom främst till att se vad som finns i vår omgivning, vi kan med våra ögonockså bestämma avstånd med tillräckligt god noggrannhet men ögonen hjälpeross också att hålla balansen. För att kunna bygga ett system för artificiellsyn för robotar behövs främst kameror men också metoder för att analyserabilderna som kamerorna tar. Att analysera bildinformation kräver stora resursermed avseende på beräkningskraft, både för att de metoder som normalt användsär relativt avancerade men främst beroende av den stora mängden information.

Swedish Summary - Svensk

Sammanfattning

Robotar blir allt vanligare i vårat samhälle. De finns allt ifrån traditionella in-dustrirobotar till robotar som hjälper oss i hemmen med de tråkigaste sysslorna,robotar som används för att övervaka stora områden, robotar som vi bara harför nöjes skull och en hel uppsjö med olika varianter.

En robot är mer än både än maskin och en intelligent dator, en robot är justen robot om den kan känna, planera och agera. Att känna innebär att robotenskall med sensorer känna av fysiska fenomen i sin omgivning, det kan vara alltifrån att känna av temperatur, bestämma ett avstånd till någonting, känna igenen person och så vidare. Att planera innebär att roboten skall genom intelligensutifrån vad den känner till om sin omgivning utifrån sensordata, planera påvilket sätt den skall agera. Till slut så innebär agera, att roboten skall utföra enfysisk handling, vilket kan bestå i att den förflyttar sig, lyfter upp ett objekt avnågot slag eller följer en boll med ögonen (kameror).

En central del för en robot är följaktligen en intelligent sensor som helstskall kunna användas till så mycket som möjligt, kunna leverera den informa-tion som roboten vill ha med tillräcklig hastighet så att roboten hinner fatta ettbeslut och dessutom skall den leverera information som roboten har använd-ning av. Då ögonen fyller en viktig funktion för människan och vi använderdom främst till att se vad som finns i vår omgivning, vi kan med våra ögonockså bestämma avstånd med tillräckligt god noggrannhet men ögonen hjälpeross också att hålla balansen. För att kunna bygga ett system för artificiellsyn för robotar behövs främst kameror men också metoder för att analyserabilderna som kamerorna tar. Att analysera bildinformation kräver stora resursermed avseende på beräkningskraft, både för att de metoder som normalt användsär relativt avancerade men främst beroende av den stora mängden information.

En bild innehåller miljontals små färgvärden, för att kunna bestämma djup ibilderna krävs dessutom två bilder från olika perspektiv vilket dubblerar infor-mationsmängden. För att en robot skall kunna röra sig någorlunda snabbt krävsdessutom en uppdateringsfrekvens på bilderna med tiotals bilder i sekunden.

En vanlig modern persondator innehåller en processor som är generell ochjobbar med hög klockfrekvens. Trotts den höga klockfrekvensen tar det tidatt göra beräkningar på stora mängder data. En så kallad FPGA är en enhetsom kan programmeras att lösa ett antal specifika uppgifter och bara dessa.FPGAn tillåter att man köra otroligt stora mängder beräkningar parallellt ochkan därför resultera i en enorm förbättring i beräkningskapacitet jämfört meden standarddator och dessutom med lägre energiåtgång.

I den här avhandlingen diskuterar jag övergripande vad en robot är, vilkaolika sorters robotar som finns. Fokus för avhandlingen ligger dock på bild-analys för robotapplikationer i just FPGA-er. Jag diskuterar bland annat prob-lem och lösningar för bildanalys i FPGA och tar även upp relaterade forskn-ingsområden som kan appliceras på detta problem.

En bild innehåller miljontals små färgvärden, för att kunna bestämma djup ibilderna krävs dessutom två bilder från olika perspektiv vilket dubblerar infor-mationsmängden. För att en robot skall kunna röra sig någorlunda snabbt krävsdessutom en uppdateringsfrekvens på bilderna med tiotals bilder i sekunden.

En vanlig modern persondator innehåller en processor som är generell ochjobbar med hög klockfrekvens. Trotts den höga klockfrekvensen tar det tidatt göra beräkningar på stora mängder data. En så kallad FPGA är en enhetsom kan programmeras att lösa ett antal specifika uppgifter och bara dessa.FPGAn tillåter att man köra otroligt stora mängder beräkningar parallellt ochkan därför resultera i en enorm förbättring i beräkningskapacitet jämfört meden standarddator och dessutom med lägre energiåtgång.

I den här avhandlingen diskuterar jag övergripande vad en robot är, vilkaolika sorters robotar som finns. Fokus för avhandlingen ligger dock på bild-analys för robotapplikationer i just FPGA-er. Jag diskuterar bland annat prob-lem och lösningar för bildanalys i FPGA och tar även upp relaterade forskn-ingsområden som kan appliceras på detta problem.

Acknowledgements

I would like to take the opportunity to mention the people that has supportedmy work at Mälardalen university in one way or the other. I have really enjoyedthe years that has passed by to fast and would not hesitate to make the samejourney again.

First I would like to thank the ones that has provided financial support toenabled this and they are the Knowledge Foundation, Robotdalen and Xilinx(that has supported by providing tools and hardware).

My main supervisor Lars Asplund has not only been my supervisor, he hasalso been an never ending source of ideas. It has not always been so easy tokeep up, but it has always been great fun! I also consider Lars a dear friendand I have enjoyed the many discussions we have shared during the years. Agreat thank you also goes to my two assisting supervisors. Mikael Ekströmthat became more and more involved and was of great help while I was writingmy final paper. Giacomo Spampinato brought fresh research thinking into thegroup and I hold you to your word that if I need help, your Sicilian family canhelp.

During the course of my studies, both at graduate and under graduate level,I have had the pleasure to meet many great friends. I would like thank some ofyou by sending an extra large hug to you. Thank you Andreas Hjertström for allthe interesting talks and for being a great friend since we started the computerengineering program in 2001. Fredrik Ekstrand, my comrade in arms, thankyou four all the great talks both on private and professional matters. HüseyinAysan, the man of strong prepositions and probably the most curious personI’ve ever met, I’m looking forward to ride Siljan runt with you! Carl Ahlberg,a good friend and an inspiration when it comes to enjoying life, hopefully I canconvince you to ride Finnmarskturen or Cykelvasan, or both, with me. PeterWallin, a great friend since we started the computer engineering program in2001.

Acknowledgements

I would like to take the opportunity to mention the people that has supportedmy work at Mälardalen university in one way or the other. I have really enjoyedthe years that has passed by to fast and would not hesitate to make the samejourney again.

First I would like to thank the ones that has provided financial support toenabled this and they are the Knowledge Foundation, Robotdalen and Xilinx(that has supported by providing tools and hardware).

My main supervisor Lars Asplund has not only been my supervisor, he hasalso been an never ending source of ideas. It has not always been so easy tokeep up, but it has always been great fun! I also consider Lars a dear friendand I have enjoyed the many discussions we have shared during the years. Agreat thank you also goes to my two assisting supervisors. Mikael Ekströmthat became more and more involved and was of great help while I was writingmy final paper. Giacomo Spampinato brought fresh research thinking into thegroup and I hold you to your word that if I need help, your Sicilian family canhelp.

During the course of my studies, both at graduate and under graduate level,I have had the pleasure to meet many great friends. I would like thank some ofyou by sending an extra large hug to you. Thank you Andreas Hjertström for allthe interesting talks and for being a great friend since we started the computerengineering program in 2001. Fredrik Ekstrand, my comrade in arms, thankyou four all the great talks both on private and professional matters. HüseyinAysan, the man of strong prepositions and probably the most curious personI’ve ever met, I’m looking forward to ride Siljan runt with you! Carl Ahlberg,a good friend and an inspiration when it comes to enjoying life, hopefully I canconvince you to ride Finnmarskturen or Cykelvasan, or both, with me. PeterWallin, a great friend since we started the computer engineering program in2001.

I would also like to thank all the people taking part in coffee break dis-cussions, you are many and you have all taken part in making my time at theuniversity a pleasure.

A big thank you to my parents, Sonja and Tommy, for supporting me duringmy life and pushing me to follow my heart, you mean everything to me. Tomy brother Johan, thank you for being an inspiration in life, for the times wehave shared discussing things and life over a whiskey by the whiskey-holk orelsewhere. Thank you my sister Linda for being supportive and a great sister.Thank you Sandra and Robert for believing in me and giving me the privilegeto become “godfather” to your three lovely boys, Albin, Hampus and Viktor.

Teuvo ja Raili, Kiitos kaikesta tuesta.And last but not least, thank you Pia for always supporting me, you are the

love of my life!

Thank you all!!

Jörgen LidholmGöteborg, September, 2011

List of Publications

Papers Included in the Licentiate Thesis

Paper A: Two Camera System for Robot Applications; Navigation,Jörgen Lidholm, Fredrik Ekstrand and Lars Asplund, In Proceedings of13th IEEE International Conference on Emerging Technologies and Fac-tory Automation (ETFA’08), IEEE Industrial Electronics Society, Ham-burg, Germany, (2008).

Paper B: Validation of Stereo Matching for Robot Navigation,Jörgen Lidholm, Giacomo Spampinato, Lars Asplund, 14th IEEE Inter-national Conference on Emerging Technology and Factory Automation,Palma de Mallorca, Spain, September, 2009.

Paper C: Hardware support for image processing in robot applications,Jörgen Lidholm, Lars Asplund, Mikael Ekström and Giacomo Spamp-inato, In submission.

I would also like to thank all the people taking part in coffee break dis-cussions, you are many and you have all taken part in making my time at theuniversity a pleasure.

A big thank you to my parents, Sonja and Tommy, for supporting me duringmy life and pushing me to follow my heart, you mean everything to me. Tomy brother Johan, thank you for being an inspiration in life, for the times wehave shared discussing things and life over a whiskey by the whiskey-holk orelsewhere. Thank you my sister Linda for being supportive and a great sister.Thank you Sandra and Robert for believing in me and giving me the privilegeto become “godfather” to your three lovely boys, Albin, Hampus and Viktor.

Teuvo ja Raili, Kiitos kaikesta tuesta.And last but not least, thank you Pia for always supporting me, you are the

love of my life!

Thank you all!!

Jörgen LidholmGöteborg, September, 2011

List of Publications

Papers Included in the Licentiate Thesis

Paper A: Two Camera System for Robot Applications; Navigation,Jörgen Lidholm, Fredrik Ekstrand and Lars Asplund, In Proceedings of13th IEEE International Conference on Emerging Technologies and Fac-tory Automation (ETFA’08), IEEE Industrial Electronics Society, Ham-burg, Germany, (2008).

Paper B: Validation of Stereo Matching for Robot Navigation,Jörgen Lidholm, Giacomo Spampinato, Lars Asplund, 14th IEEE Inter-national Conference on Emerging Technology and Factory Automation,Palma de Mallorca, Spain, September, 2009.

Paper C: Hardware support for image processing in robot applications,Jörgen Lidholm, Lars Asplund, Mikael Ekström and Giacomo Spamp-inato, In submission.

Additional Papers by the Author

ROBOTICS FOR SMEs - 3D VISION IN REAL-TIME FOR NAVIGATION

AND OBJECT RECOGNITION,Fredrik Ekstrand, Jörgen Lidholm, Lars Asplund, 39th International Sym-posium on Robotics (ISR 2008), Seoul, Korea.

Stereo Vision Based Navigation for Automated Vehicles in Industry,Giacomo Spampinato, Jörgen Lidholm, Fredrik Ekstrand, Lars Asplund,14th IEEE International Conference on Emerging Technology and Fac-tory Automation, Palma de Mallorca, Spain, September, 2009.

Navigation in a Box: Stereovision for Industry Automation,Giacomo Spampinato, Jörgen Lidholm, Fredrik Ekstrand, Carl Ahlberg,Lars Asplund, Mikael Ekström, Advances in Theory and Applications ofStereo Vision, Book edited by: Asim Bhatti, ISBN: 978-953-307-516-7,Publisher: InTech, Publishing date: January 2011.

An Embedded Stereo Vision Module for 6D pose estimation and mapping,Giacomo Spampinato, Jörgen Lidholm, Carl Ahlberg, Fredrik Ekstrand,Mikael Ekström, Lars Asplund, 2011 IEEE/RSJ International Confer-ence on Intelligent Robots and Systems, In submission.

Additional Bookchapters by the Author

Navigation in a Box: Stereovision for Industry Automation,Giacomo Spampinato, Jörgen Lidholm, Fredrik Ekstrand, Carl Ahlberg,Lars Asplund and Mikael Ekström (2011). Advances in Theory and Ap-plications of Stereo Vision, Asim Bhatti (Ed.), ISBN: 978-953-307-516-7,InTech, Available from: http://www.intechopen.com/articles/show/title/navigation-in-a-box-stereovision-for-industry-automation

Additional Papers by the Author

ROBOTICS FOR SMEs - 3D VISION IN REAL-TIME FOR NAVIGATION

AND OBJECT RECOGNITION,Fredrik Ekstrand, Jörgen Lidholm, Lars Asplund, 39th International Sym-posium on Robotics (ISR 2008), Seoul, Korea.

Stereo Vision Based Navigation for Automated Vehicles in Industry,Giacomo Spampinato, Jörgen Lidholm, Fredrik Ekstrand, Lars Asplund,14th IEEE International Conference on Emerging Technology and Fac-tory Automation, Palma de Mallorca, Spain, September, 2009.

Navigation in a Box: Stereovision for Industry Automation,Giacomo Spampinato, Jörgen Lidholm, Fredrik Ekstrand, Carl Ahlberg,Lars Asplund, Mikael Ekström, Advances in Theory and Applications ofStereo Vision, Book edited by: Asim Bhatti, ISBN: 978-953-307-516-7,Publisher: InTech, Publishing date: January 2011.

An Embedded Stereo Vision Module for 6D pose estimation and mapping,Giacomo Spampinato, Jörgen Lidholm, Carl Ahlberg, Fredrik Ekstrand,Mikael Ekström, Lars Asplund, 2011 IEEE/RSJ International Confer-ence on Intelligent Robots and Systems, In submission.

Additional Bookchapters by the Author

Navigation in a Box: Stereovision for Industry Automation,Giacomo Spampinato, Jörgen Lidholm, Fredrik Ekstrand, Carl Ahlberg,Lars Asplund and Mikael Ekström (2011). Advances in Theory and Ap-plications of Stereo Vision, Asim Bhatti (Ed.), ISBN: 978-953-307-516-7,InTech, Available from: http://www.intechopen.com/articles/show/title/navigation-in-a-box-stereovision-for-industry-automation

Contents

I Thesis 2

1 Introduction 5

1.1 Thesis outline . . . . . . . . . . . . . . . . . . . . . . . . . . 6

2 Background and Motivation 7

2.1 The Robot: Sense, plan and act . . . . . . . . . . . . . . . . . 72.1.1 Types of robots . . . . . . . . . . . . . . . . . . . . . 92.1.2 Robotics tasks . . . . . . . . . . . . . . . . . . . . . 102.1.3 Vision in robotics application . . . . . . . . . . . . . 152.1.4 Robotics for SME. . . . . . . . . . . . . . . . . . . . 16

2.2 Vision and image processing . . . . . . . . . . . . . . . . . . 172.2.1 A camera system/Image sensor . . . . . . . . . . . . . 182.2.2 Colour domains . . . . . . . . . . . . . . . . . . . . . 202.2.3 Distortion and correction . . . . . . . . . . . . . . . . 212.2.4 Image features . . . . . . . . . . . . . . . . . . . . . 232.2.5 Stereo vision . . . . . . . . . . . . . . . . . . . . . . 24

2.3 Hardware support for heavy computation tasks . . . . . . . . 262.3.1 Digital Signal Processors . . . . . . . . . . . . . . . . 272.3.2 Vector processors . . . . . . . . . . . . . . . . . . . . 272.3.3 Systolic array processors . . . . . . . . . . . . . . . . 282.3.4 Asymmetric multicore processors . . . . . . . . . . . 282.3.5 General Purpose GPUs (GPGPU) . . . . . . . . . . . 292.3.6 Field Programmable Gate Arrays . . . . . . . . . . . 29

2.4 FPGA for (stereo-) vision applications . . . . . . . . . . . . . 302.4.1 Designing FPGA architectures (What they need) . . . 31

2.5 Embedded FPGA based vision system . . . . . . . . . . . . . 32

Contents

I Thesis 2

1 Introduction 5

1.1 Thesis outline . . . . . . . . . . . . . . . . . . . . . . . . . . 6

2 Background and Motivation 7

2.1 The Robot: Sense, plan and act . . . . . . . . . . . . . . . . . 72.1.1 Types of robots . . . . . . . . . . . . . . . . . . . . . 92.1.2 Robotics tasks . . . . . . . . . . . . . . . . . . . . . 102.1.3 Vision in robotics application . . . . . . . . . . . . . 152.1.4 Robotics for SME. . . . . . . . . . . . . . . . . . . . 16

2.2 Vision and image processing . . . . . . . . . . . . . . . . . . 172.2.1 A camera system/Image sensor . . . . . . . . . . . . . 182.2.2 Colour domains . . . . . . . . . . . . . . . . . . . . . 202.2.3 Distortion and correction . . . . . . . . . . . . . . . . 212.2.4 Image features . . . . . . . . . . . . . . . . . . . . . 232.2.5 Stereo vision . . . . . . . . . . . . . . . . . . . . . . 24

2.3 Hardware support for heavy computation tasks . . . . . . . . 262.3.1 Digital Signal Processors . . . . . . . . . . . . . . . . 272.3.2 Vector processors . . . . . . . . . . . . . . . . . . . . 272.3.3 Systolic array processors . . . . . . . . . . . . . . . . 282.3.4 Asymmetric multicore processors . . . . . . . . . . . 282.3.5 General Purpose GPUs (GPGPU) . . . . . . . . . . . 292.3.6 Field Programmable Gate Arrays . . . . . . . . . . . 29

2.4 FPGA for (stereo-) vision applications . . . . . . . . . . . . . 302.4.1 Designing FPGA architectures (What they need) . . . 31

2.5 Embedded FPGA based vision system . . . . . . . . . . . . . 32

xiv Contents

2.6 Component based software development for hybrid FPGA sys-tems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

2.7 Simultaneous Localization And Mapping (SLAM) . . . . . . 352.8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36

3 Summary of papers and their contribution 39

3.1 Paper A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 393.1.1 Contribution . . . . . . . . . . . . . . . . . . . . . . 39

3.2 Paper B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403.2.1 Contribution . . . . . . . . . . . . . . . . . . . . . . 40

3.3 Paper C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403.3.1 Contribution . . . . . . . . . . . . . . . . . . . . . . 41

4 Conclusions and Future work 43

4.1 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . 434.2 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

Bibliography 45

II Included Papers 49

5 Paper A:

Two Camera System for Robot Applications; Navigation 51

5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 535.2 Related work . . . . . . . . . . . . . . . . . . . . . . . . . . 535.3 Experimental platform . . . . . . . . . . . . . . . . . . . . . 54

5.3.1 Image sensors . . . . . . . . . . . . . . . . . . . . . . 545.3.2 FPGA board . . . . . . . . . . . . . . . . . . . . . . 565.3.3 Carrier board . . . . . . . . . . . . . . . . . . . . . . 56

5.4 Feature detectors . . . . . . . . . . . . . . . . . . . . . . . . 575.4.1 Stephen and Harris combined corner and edge detector 575.4.2 FPGA implementation of Harris corner detector . . . . 58

5.5 Interest point location . . . . . . . . . . . . . . . . . . . . . . 595.5.1 Image sequence feature tracking . . . . . . . . . . . . 635.5.2 Spurious matching and landmark evaluation . . . . . . 635.5.3 Experiments . . . . . . . . . . . . . . . . . . . . . . 65

5.6 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 665.7 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . 68Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69

Contents xv

6 Paper B:

Validation of Stereo Matching for Robot Navigation 71

6.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 736.2 Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

6.2.1 Definitions . . . . . . . . . . . . . . . . . . . . . . . 746.2.2 Navigation process overview . . . . . . . . . . . . . . 766.2.3 Stereo triangulation . . . . . . . . . . . . . . . . . . . 776.2.4 Back projection of landmarks onto the image sensor . 806.2.5 Planar egomotion estimation . . . . . . . . . . . . . . 80

6.3 Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . 816.3.1 Experimental Platform . . . . . . . . . . . . . . . . . 816.3.2 Stereo Camera Calibration and rectification . . . . . . 836.3.3 Experimental setup . . . . . . . . . . . . . . . . . . . 83

6.4 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 846.4.1 Stereo matching . . . . . . . . . . . . . . . . . . . . 846.4.2 Landmark location . . . . . . . . . . . . . . . . . . . 846.4.3 Egomotion estimation . . . . . . . . . . . . . . . . . 85

6.5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . 876.6 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . 89Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

7 Paper C:

Hardware support for image processing in robot applications 93

7.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 957.2 Image Processing; Hardware and Software support . . . . . . 96

7.2.1 The image processing sequence . . . . . . . . . . . . 977.2.2 Hardware for Image Processing . . . . . . . . . . . . 987.2.3 Image processing in FPGAs . . . . . . . . . . . . . . 101

7.3 Developing Software in FPGA . . . . . . . . . . . . . . . . . 1027.4 Lens distortion correction . . . . . . . . . . . . . . . . . . . . 103

7.4.1 Method 1: For cases when the tangential distortion isnegligible . . . . . . . . . . . . . . . . . . . . . . . . 108

7.4.2 Method 2: For cases when the tangential distortionmust corrected . . . . . . . . . . . . . . . . . . . . . 109

7.5 Implementation details . . . . . . . . . . . . . . . . . . . . . 1117.5.1 Proposed hardware design for component based em-

bedded image processing . . . . . . . . . . . . . . . . 1127.5.2 Fixed point arithmetic . . . . . . . . . . . . . . . . . 1137.5.3 Look–up tables (LUTs) . . . . . . . . . . . . . . . . . 114