learning to order objects using haptic and proprioceptive...

47
Learning to Order Objects using Haptic and Proprioceptive Exploratory Behaviors Jivko Sinapov, Priyanka Khante, Maxwell Svetlik, and Peter Stone Department of Computer Science University of Texas at Austin, Austin TX 78712, USA {jsinapov,pkhante,maxwell, pstone}@cs.utexas.edu

Upload: others

Post on 26-Feb-2021

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

Learning to Order Objects using Haptic and Proprioceptive Exploratory Behaviors

Jivko Sinapov, Priyanka Khante, Maxwell Svetlik, and Peter Stone

Department of Computer ScienceUniversity of Texas at Austin, Austin TX 78712, USA{jsinapov,pkhante,maxwell, pstone}@cs.utexas.edu

Page 2: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

2

Building-Wide Intelligence Project:http://www.cs.utexas.edu/~larg/bwi_web/

Page 3: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

3

Building-Wide Intelligence Project:http://www.cs.utexas.edu/~larg/bwi_web/

Page 4: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

4

Building-Wide Intelligence Project:http://www.cs.utexas.edu/~larg/bwi_web/

Page 5: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

5

Motivation: Grounded Language Learning

Robot, fetch me the green empty bottle

Page 6: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

6

Object Category Recognition in Robotics

Sridharan et al. 2008

Lai et al. 2011Rusu et al. 2009

Collet et al. 2009

Page 7: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

7

Object Category Learning in Robotics

Thomason, J., Sinapov, J., Svetlik, M., Stone, P., and Mooney, R. (2016).Learning Multi-Modal Grounded Linguistic Semantics by Playing I, Spy

Robotics and Vision 3 Session

Page 8: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

8

Now, when and where does this fail...

Consider the word, “weight” - how should it be grounded?

Page 9: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

9

How do humans ground such words?

Sample Montessori toys designed to teach children about the ordinal properties of object weight, height, and size

Page 10: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

10

Object Ordering in Psychology

Page 11: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

11

Object Orderings in Human Environments

Page 12: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

12

Problem Formulation

● Order Recognition: what property is a given series of objects ordered by?

“height”

“width”

Page 13: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

13

Problem Formulation (2)

● Order Insertion: given an object series, insert a new object into the correct position

series test object

Page 14: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

14

Three-Stage Approach

Stage 1: Object Exploration Stage 2: Unsupervised Order Discovery

. . . .

Stage 3: Semantic Grounding

weight width height

. . . .

Page 15: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

16

Stage 1: Object Exploration

32 common household and office items

The objects vary along three ordinal properties:

1) Weight2) Width3) Height

Page 16: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

17

Exploratory Behaviors

Page 17: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

18

Video

Page 18: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

19

Video

Page 19: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

20

Video

Page 20: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

21

Haptic and Proprioceptive Feature Extraction

Time

Join

t P

ositi

ons

(Pro

rioce

ptio

n)Jo

int

Effo

rts

(Hap

tics)

. . . . . .

Page 21: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

22

Haptic and Proprioceptive Feature Extraction

Time

Join

t P

ositi

ons

(Pro

rioce

ptio

n)Jo

int

Effo

rts

(Hap

tics)

. . . . . .

Page 22: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

23

Haptic and Proprioceptive Feature Extraction

Time

Join

t P

ositi

ons

(Pro

rioce

ptio

n)Jo

int

Effo

rts

(Hap

tics)

. . . . . .

Page 23: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

24

Stage 2: Unsupervised Order Discovery

grasp

lift

hold

lower

drop

proprioception

Beh

avio

rsSensory Modalities

push

press

haptics

Page 24: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

26

Unsupervised Order Discovery Example with Synthetic Data

Object order with highest likelihood using the method of [Kemp and Tennenbam, 2008]

Input Relational Count Matrix

Page 25: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

27

Example Relational Count Matrix with the Press action and Haptic features

Similarity between objects i and j in the press-haptic sensorimotor context

Page 26: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

28

Resulting Order (Press behavior and Haptic modality)

The number corresponds to the object's height in millimeters

Page 27: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

29

Stage 2: Unsupervised Order Discovery

grasp

lift

hold

lower

drop

proprioception

Beh

avi

ors

Sensory Modalities

push

press

haptics

Page 28: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

30

Stage 3: Order Grounding Stage

Page 29: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

31

Order Grounding Example: “height”

Positive Examples: Negative Examples:

. . .

. . .

Page 30: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

32

Object Order Representation

Training Example:

Object Orders Discovered During Stage 2

. . . .

Page 31: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

33

Object Order Representation

Training Example:

. . . .

Object Orders Discovered During Stage 2

Page 32: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

34

Object Order Representation

Training Example:

. . . .

Object Orders Discovered During Stage 2

Page 33: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

35

Object Order Representation

Training Example:

x1

. . . .

Object Orders Discovered During Stage 2

Page 34: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

36

Object Order Representation

Training Example:

x1

. . . .

Object Orders Discovered During Stage 2

Page 35: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

37

Object Order Representation

. . . .

Training Example:

x1

x2

Object Orders Discovered During Stage 2

Page 36: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

38

Object Order Representation

. . . .

Training Example:

x1

x2 . . . . x

n

Object Orders Discovered During Stage 2

Page 37: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

39

Results: Order Recognition

Page 38: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

40

Sample Learned Decision Trees

Hold Haptics

Lower Haptics

Lift Haptics

Press Proprioception

Press Haptics

Grasp Proprioception

weight width height

Page 39: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

41

When does the robot make mistakes?

Page 40: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

42

When does the robot make mistakes?

difficult easy

Page 41: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

43

When does the robot make mistakes?

difficult easy

Page 42: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

44

Object Order Insertion Results

Page 43: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

45

Object Order Insertion Results

Page 44: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

46

Object Order Insertion Results

Page 45: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

47

Object Order Insertion Results

Page 46: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

48

Conclusion

● A behavior-grounded framework for learning object ordering concepts

● The robot grounded three ordering concepts, “weight”, “height”, and “width”

● Future Work:– Active action selection

– Learn object ordering concepts in conjunction with object categories, pairwise object relations, etc.

– Learn from humans (for a preview, see our next talk at Robotics and Vision III)

Page 47: Learning to Order Objects using Haptic and Proprioceptive ...jsinapov/slides/jsinapov_slides_IJCAI_2016.pdf48 Conclusion A behavior-grounded framework for learning object ordering

49

Thank you!

Jivko Sinapov Maxwell Svetlik Peter Stone

http://www.cs.utexas.edu/~larg/bwi_web/

Priyanka Khante