analysing microarray expression data through effective ...web.cs.ucla.edu/~zaniolo/papers/analysing...

25
8/26/14 3:05 PM Analysing microarray expression data through effective clustering Page 1 of 25 file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html GI>8A: DJIA>C: GI>8A: DJIA>C: ,=DL ;JAA DJIA>C: 7HIG68I $:NLDG9H "CIGD9J8I>DC EEGD68= &:I=D9H >H8JHH>DC DC8AJH>DC 8@CDLA:9<B:CIH EE:C9>M :I6>A:9 9>H8JHH>DC DC ,,*[ +:;:G:C8:H ><JG:H 6C9 I67A:H ><JG:H 6C9 I67A:H -67A: -67A: -67A: -67A: -67A: -67A: -67A: -=: DJIA>C: ;DG I=>H 9D8JB:CI >H 8JGG:CIAN JC6K6>A67A: "C;DGB6I>DC ,8>:C8:H /DAJB: &6G8= )6<:H V C6ANH>C< B>8GD6GG6N :MEG:HH>DC 96I6 I=GDJ<= :;;:8I>K: 8AJHI:G>C< C6ANH>C< B>8GD6GG6N :MEG:HH>DC 96I6 I=GDJ<= :;;:8I>K: 8AJHI:G>C< &6H8>6G> 6 & &6OO:D 6 36C>DAD 7 ,=DL BDG: ,=DL BDG: (" ?>CH 7HIG68I -=: G:8:CI 69K6C8:H >C <:CDB>8 I:8=CDAD<>:H 6C9 I=: 6K6>A67>A>IN D; A6G<:H86A: B>8GD6GG6N 96I6H:IH 86AA ;DG I=: 9:K:ADEB:CI D; 69K6C8:9 96I6 6C6ANH>H I:8=C>FJ:H HJ8= 6H 96I6 B>C>C< 6C9 HI6I>HI>86A 6C6ANH>H ID 8>I: 6 ;:L BDC< I=: B>C>C< I:8=C>FJ:H EGDEDH:9 HD ;6G 8AJHI:G 6C6ANH>H =6H 7:8DB: 6 HI6C96G9 B:I=D9 ;DG I=: 6C6ANH>H D; B>8GD6GG6N :MEG:HH>DC 96I6 "I 86C 7: JH:9 7DI= ;DG >C>I>6A H8G::C>C< D; E6I>:CIH 6C9 ;DG :MIG68I>DC D; 9>H:6H: BDA:8JA6G H><C6IJG:H &DG:DK:G 8AJHI:G>C< 86C 7: EGD;>I67AN :MEAD>I:9 ID 8=6G68I:G>O: <:C:H D; JC@CDLC ;JC8I>DC 6C9 JC8DK:G E6II:GCH I=6I 86C 7: >CI:GEG:I:9 6H >C9>86I>DCH D; I=: HI6IJH D; 8:AAJA6G EGD8:HH:H >C6AAN 8AJHI:G>C< 7>DAD<>86A 96I6 LDJA9 7: JH:;JA CDI DCAN ;DG :MEADG>C< I=: 96I6 7JI 6AHD ;DG 9>H8DK:G>C< >BEA>8>I A>C@H 7:IL::C I=: D7?:8IH -D I=>H :C9 H:K:G6A 8AJHI:G>C< 6EEGD68=:H =6K: 7::C EGDEDH:9 >C DG9:G ID D7I6>C 6 <DD9 IG69:D;; 7:IL::C 688JG68N 6C9 :;;>8>:C8N D; I=: 8AJHI:G>C< EGD8:HH "C E6GI>8JA6G <G:6I 6II:CI>DC =6H 7::C 9:KDI:9 ID =>:G6G8=>86A 8AJHI:G>C< 6A<DG>I=BH ;DG I=:>G 688JG68N >C JCHJE:GK>H:9 >9:CI>;>86I>DC 6C9 HIG6I>;>86I>DC D; <GDJEH D; H>B>A6G <:C:H DG E6I>:CIH L=>A: E6GI>I>DC 76H:9 6EEGD68=:H 6G: :MEAD>I:9 L=:C ;6HI 8DBEJI6I>DCH 6G: G:FJ>G:9 "C9::9 >I >H L:AA @CDLC I=6I CD :M>HI>C< 8AJHI:G>C< 6A<DG>I=B 8DBEA:I:AN H6I>H;>:H 7DI= 688JG68N 6C9 :;;>8>:C8N G:FJ>G:B:CIH I=JH 6 <DD9 8AJHI:G>C< 6A<DG>I=B =6H ID 7: :K6AJ6I:9 L>I= G:HE:8I ID HDB: :MI:GC6A 8G>I:G>6 I=6I 6G: >C9:E:C9:CI ;GDB I=: B:IG>8 7:>C< JH:9 ID 8DBEJI: 8AJHI:GH "C I=>H E6E:G L: EGDEDH: 6 8AJHI:G>C< 6A<DG>I=B 86AA:9 &%., ;DG &>8GD6GG6N 96I6 %JHI:G>C< .H>C< >C6GN ,EA>II>C< :M=>7>I>C< =><=:G 688JG68N I=6C I=: =>:G6G8=>86A DC:H EGDEDH:9 HD ;6G L=>A: 6AADL>C< 6 ;6HI:G 8DBEJI6I>DC L>I= G:HE:8I ID E6GI>I>DC 76H:9 6EEGD68=:H "C9::9 & %., >H ;6HI:G 6C9 BDG: 688JG6I: I=6C DI=:G 6A<DG>I=BH >C8AJ9>C< @B:6CH 6C9 >IH G:8:CIAN EGDEDH:9 G:;>C:B:CIH 6H L: L>AA H=DL >C I=: :ME:G>B:CI6A H:8I>DC -=: 6A<DG>I=B 8DCH>HIH D; 6 9>K>H>K: E=6H: 6C9 6C 6<<ADB:G6I>K: E=6H: 9JG>C< I=:H: ILD E=6H:H I=: H6BEA:H 6G: G:E6GI>I>DC:9 JH>C< 6 A:6HI FJ69G6I>8 9>HI6C8: 8G>I:G>DC EDHH:HH>C< JC>FJ: 6C6ANI>86A EGDE:GI>:H I=6I L: :MEAD>I ID 68=>:K: 6 K:GN ;6HI 8DBEJI6I>DC &%., 9:G>K:H <DD9 8AJHI:GH L>I=DJI G:FJ>G>C< >CEJI ;GDB JH:GH 6C9 >I >H GD7JHI 6C9 >BE:GK>DJH ID CD>H: L=>A: EGDK>9>C< 7:II:G HE::9 6C9 688JG68N I=6C B:I=D9H HJ8= 6H "+! I=6I 6G: :C9DL:9 L>I= I=: H6B: 8G>I>86A EGDE:GI>:H J: ID I=: HIGJ8IJG6A ;:6IJG: D; B>8GD6GG6N 96I6 I=:N 6G: G:EG:H:CI:9 6H 6GG6NH D; CJB:G>8 K6AJ:H &%., >H HJ>I67A: ;DG 6C6ANO>C< I=:B H>C8: >I >H 9:H><C:9 ID E:G;DGB L:AA ;DG J8A>9:6C 9>HI6C8:H "C DG9:G ID HIGDC<:G I=: D7I6>C:9 G:HJAIH L: >CI:GEG:I:9 I=: D7I6>C:9 8AJHI:GH 7N 6 9DB6>C :ME:GI 6C9 I=: :K6AJ6I>DC 7N FJ6A>IN B:6HJG:H HE:8>;>86AAN I6>ADG:9 ;DG 7>DAD<>86A K6A>9>IN 6HH:HHB:CI $:NLDG9H >D>C;DGB6I>8H AJHI:G>C< >DAD<>86A 96I6 6C6ANH>H "CIGD9J8I>DC 'DL696NH B>8GD6GG6N :ME:G>B:CIH 6AADL I=: :MEADG6I>DC D; =J<: 6BDJCIH D; <:C: :MEG:HH>DCH JH>C< 6 H>C<A: 8=>E &DG:DK:G I=: G:A6I>K:AN BD9:G6I: 8DHI ;DG 6 8=>E 6C9 I=: HB6AA H6BEA: EG:E6G6I>DC I>B:H :C67A: I=: 6C6ANH>H D; 6 A6G<: CJB7:G D; 9>;;:G:CI :ME:G>B:CI6A 8DC9>I>DCH HJ8= 6H ED>CIH D; I>B:H:G>:H :ME:G>B:CIH DG 9>H:6H: EGD<G:HH>DC >C 6 8D=DGI D; E6I>:CIH 45 :I G><=IH 6C9 8DCI:CI Search ScienceDirect 9K6C8:9 H:6G8= DLCAD69 ) DLCAD69 ) Export ( &DG: DEI>DCH Journals Books Brought to you by: the UCLA Library Help Sign in

Upload: phamminh

Post on 17-Feb-2019

236 views

Category:

Documents


0 download

TRANSCRIPT

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 1 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

�GI>8A:�DJIA >C:�GI>8A:�DJIA >C: ,=DL�;JAA�DJIA>C:

�7HIG68I$:NLDG9H���"CIGD9J8I>DC����EEGD68=���&:I=D9H����>H8JHH>DC����DC8AJH>DC�8@CDLA:9<B:CIH�EE:C9>M�����:I6>A:9�9>H8JHH>DC�DC�,,*[+:;:G:C8:H

�><JG:H�6C9�I67A:H�><JG:H�6C9�I67A:H

-67A:��-67A:��-67A:��-67A:��-67A:��-67A:��-67A:��

-=:�DJIA>C:�;DG�I=>H�9D8JB:CI�>H8JGG:CIAN�JC6K6>A67A:�

"C;DGB6I>DC�,8>:C8:H/DAJB:������ �&6G8=�� ���)6<:H���V��

�C6ANH>C<�B>8GD6GG6N�:MEG:HH>DC�96I6�I=GDJ<=�:;;:8I>K:�8AJHI:G>C<�C6ANH>C<�B>8GD6GG6N�:MEG:HH>DC�96I6�I=GDJ<=�:;;:8I>K:�8AJHI:G>C<���&6H8>6G>6� � � �&��&6OO:D6� ����36C>DAD7�

,=DL�BDG:,=DL�BDG:

�("��� �� ���?�>CH�� ������ �

�7HIG68I-=:�G:8:CI�69K6C8:H�>C�<:CDB>8�I:8=CDAD<>:H�6C9�I=:�6K6>A67>A>IN�D;�A6G<:H86A:�B>8GD6GG6N�96I6H:IH�86AA�;DGI=:�9:K:ADEB:CI�D;�69K6C8:9�96I6�6C6ANH>H�I:8=C>FJ:H�HJ8=�6H�96I6�B>C>C<�6C9�HI6I>HI>86A�6C6ANH>H�ID�8>I:�6;:L���BDC<�I=:�B>C>C<�I:8=C>FJ:H�EGDEDH:9�HD�;6G�8AJHI:G�6C6ANH>H�=6H�7:8DB:�6�HI6C96G9�B:I=D9�;DG�I=:6C6ANH>H�D;�B>8GD6GG6N�:MEG:HH>DC�96I6��"I�86C�7:�JH:9�7DI=�;DG�>C>I>6A�H8G::C>C<�D;�E6I>:CIH�6C9�;DG�:MIG68I>DC�D;9>H:6H:�BDA:8JA6G� H><C6IJG:H��&DG:DK:G� 8AJHI:G>C<� 86C� 7:� EGD;>I67AN� :MEAD>I:9� ID� 8=6G68I:G>O:� <:C:H� D;JC@CDLC� ;JC8I>DC� 6C9� JC8DK:G� E6II:GCH� I=6I� 86C� 7:� >CI:GEG:I:9� 6H� >C9>86I>DCH� D;� I=:� HI6IJH� D;� 8:AAJA6GEGD8:HH:H���>C6AAN�8AJHI:G>C<�7>DAD<>86A�96I6�LDJA9�7:�JH:;JA�CDI�DCAN� ;DG�:MEADG>C<� I=:�96I6�7JI�6AHD� ;DG9>H8DK:G>C<� >BEA>8>I� A>C@H� 7:IL::C� I=:� D7?:8IH�� -D� I=>H� :C9� H:K:G6A� 8AJHI:G>C<� 6EEGD68=:H� =6K:� 7::CEGDEDH:9�>C�DG9:G�ID�D7I6>C�6�<DD9�IG69:D;;�7:IL::C�688JG68N�6C9�:;;>8>:C8N�D;�I=:�8AJHI:G>C<�EGD8:HH��"CE6GI>8JA6G� <G:6I� 6II:CI>DC� =6H� 7::C� 9:KDI:9� ID� =>:G6G8=>86A� 8AJHI:G>C<� 6A<DG>I=BH� ;DG� I=:>G� 688JG68N� >CJCHJE:GK>H:9� >9:CI>;>86I>DC�6C9�HIG6I>;>86I>DC�D;�<GDJEH�D;�H>B>A6G�<:C:H�DG�E6I>:CIH�L=>A:�E6GI>I>DC�76H:96EEGD68=:H�6G:�:MEAD>I:9�L=:C� ;6HI�8DBEJI6I>DCH�6G:� G:FJ>G:9�� "C9::9� >I� >H�L:AA� @CDLC� I=6I�CD�:M>HI>C<8AJHI:G>C<�6A<DG>I=B�8DBEA:I:AN�H6I>H;>:H�7DI=�688JG68N�6C9�:;;>8>:C8N�G:FJ>G:B:CIH�I=JH�6�<DD9�8AJHI:G>C<6A<DG>I=B�=6H�ID�7:�:K6AJ6I:9�L>I=�G:HE:8I�ID�HDB:�:MI:GC6A�8G>I:G>6�I=6I�6G:�>C9:E:C9:CI�;GDB�I=:�B:IG>87:>C<� JH:9� ID� 8DBEJI:� 8AJHI:GH�� "C� I=>H� E6E:G� L:� EGDEDH:� 6� 8AJHI:G>C<� 6A<DG>I=B� 86AA:9�&�%.�,� �;DG&>8GD6GG6N�96I6��%JHI:G>C<�.H>C<��>C6GN�,EA>II>C<��:M=>7>I>C<�=><=:G�688JG68N� I=6C� I=:�=>:G6G8=>86A� DC:HEGDEDH:9�HD�;6G�L=>A:�6AADL>C<�6�;6HI:G�8DBEJI6I>DC�L>I=�G:HE:8I�ID�E6GI>I>DC�76H:9�6EEGD68=:H��"C9::9�&�%.�,� >H� ;6HI:G�6C9�BDG:�688JG6I:� I=6C�DI=:G�6A<DG>I=BH� >C8AJ9>C<�@B:6CH�6C9� >IH� G:8:CIAN�EGDEDH:9G:;>C:B:CIH�6H�L:�L>AA�H=DL�>C�I=:�:ME:G>B:CI6A�H:8I>DC��-=:�6A<DG>I=B�8DCH>HIH�D;�6�9>K>H>K:�E=6H:�6C9�6C6<<ADB:G6I>K:� E=6H:�� 9JG>C<� I=:H:� ILD� E=6H:H� I=:� H6BEA:H� 6G:� G:E6GI>I>DC:9� JH>C<� 6� A:6HI� FJ69G6I>89>HI6C8:�8G>I:G>DC�EDHH:HH>C<�JC>FJ:�6C6ANI>86A�EGDE:GI>:H�I=6I�L:�:MEAD>I�ID�68=>:K:�6�K:GN�;6HI�8DBEJI6I>DC�&�%.�,�9:G>K:H�<DD9�8AJHI:GH�L>I=DJI�G:FJ>G>C<�>CEJI�;GDB�JH:GH�6C9�>I�>H�GD7JHI�6C9�>BE:GK>DJH�ID�CD>H:L=>A:�EGDK>9>C<�7:II:G�HE::9�6C9�688JG68N�I=6C�B:I=D9H�HJ8=�6H��"+�!�I=6I�6G:�:C9DL:9�L>I=�I=:�H6B:8G>I>86A�EGDE:GI>:H���J:�ID�I=:�HIGJ8IJG6A�;:6IJG:�D;�B>8GD6GG6N�96I6��I=:N�6G:�G:EG:H:CI:9�6H�6GG6NH�D;�CJB:G>8K6AJ:H��&�%.�,�>H�HJ>I67A:�;DG�6C6ANO>C<�I=:B�H>C8:�>I�>H�9:H><C:9�ID�E:G;DGB�L:AA�;DG��J8A>9:6C�9>HI6C8:H�"C�DG9:G�ID�HIGDC<:G�I=:�D7I6>C:9�G:HJAIH�L:�>CI:GEG:I:9�I=:�D7I6>C:9�8AJHI:GH�7N�6�9DB6>C�:ME:GI�6C9�I=::K6AJ6I>DC�7N�FJ6A>IN�B:6HJG:H�HE:8>;>86AAN�I6>ADG:9�;DG�7>DAD<>86A�K6A>9>IN�6HH:HHB:CI�

$:NLDG9H�>D>C;DGB6I>8H���AJHI:G>C<���>DAD<>86A�96I6�6C6ANH>H

���"CIGD9J8I>DC'DL696NH�B>8GD6GG6N�:ME:G>B:CIH�6AADL�I=:�:MEADG6I>DC�D;�=J<:�6BDJCIH�D;�<:C:�:MEG:HH>DCH�JH>C<�6�H>C<A:8=>E��&DG:DK:G�I=:�G:A6I>K:AN�BD9:G6I:�8DHI�;DG�6�8=>E�6C9�I=:�HB6AA�H6BEA:�EG:E6G6I>DC�I>B:H�:C67A:�I=:6C6ANH>H�D;�6�A6G<:�CJB7:G�D;�9>;;:G:CI�:ME:G>B:CI6A�8DC9>I>DCH�HJ8=�6H�ED>CIH�D;�I>B:H:G>:H�:ME:G>B:CIH�DG9>H:6H:�EGD<G:HH>DC�>C�6�8D=DGI�D;�E6I>:CIH�4��5�

:I�G><=IH�6C9�8DCI:CI

Search ScienceDirect ��9K6C8:9�H:6G8=�DLCAD69�)���DLCAD69�)�� � Export � (I=:G:MEDGIDEI>DCH

&DG:�DEI>DCH���

Journals BooksBrought to you by:

the UCLA Library

HelpSign in

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 2 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

Z

Z

Z

-=>H� =J<:� 6BDJCI� D;� 96I6� EDH:H�B6CN� 8=6AA:C<:H� ID� I=:� 7>D>C;DGB6I>8H� 8DBBJC>IN� HJ8=� 6H� ;>C9>C<� I=:7:=6K>DG�D;� H:I�D;� G:A6I:9�<:C:H� >C�9>;;:G:CI� 8DC9>I>DCH��-=>H�<D6A� >H�D;I:C�68=>:K:9�7N�B:6CH�D;� 8AJHI:G6C6ANH>H� >�:�� I=:� >9:CI>;>86I>DC�D;�H>B>A6G�E6II:GCH� >C�9>;;:G:CI�8DC9>I>DCH� 4��5�� "C9::9� I=:�67>A>IN� ID�<6I=:G<:CDB:L>9:�:MEG:HH>DC�96I6�=6H�;6G�DJIHIG>EE:9�I=:�67>A>IN�D;�=JB6C�7G6>CH�ID�EGD8:HH�I=:�G6L�96I6�I=JH8AJHI:G�6C6ANH>H�86C�=:AE�H8>:CI>HIH�ID�9>HI>AA�I=:�96I6�9DLC�ID�6�BDG:�8DBEG:=:CH>7A:�A:K:A�7N�HJ79>K>9>C<�I=:<:C:H�>CID�6�HB6AA:G�CJB7:G�D;�86I:<DG>:H�6C9�I=:C�6C6ANO>C<�I=DH:�4�5�4�5�6C9�4��5�

�JGI=:G�BDI>K6I>DC�;DG�I=:�:MEAD>I6I>DC�D;�8AJHI:G�6C6ANH>H�;DG�7>DAD<>86A�96I6�A>:H�>C�I=:�;68I�I=6I�H>B>A6G�E6II:GCH;DJC9�7N�8AJHI:G>C<�B6N�8DGG:HEDC9�ID�8DG:<JA6I>DC�D;�<:C:H�4��5��&DG:DK:G�8AJHI:G�6C6ANH>H�G:EG:H:CIH�6;JC96B:CI6A�6C9�L>9:AN�JH:9�B:I=D9�D;�@CDLA:9<:�9>H8DK:GN�4��5�9J:�ID�I=:�K6AJ67A:�>C;DGB6I>DC�>I�86CEGDK>9:��"C�E6GI>8JA6G�I=:�JH:�D;�8AJHI:G�6C6ANH>H�=6H�7:8DB:�6�HI6C96G9�B:I=D9�>C�A>I:G6IJG:�;DG�I=:�6C6ANH>HD;�B>8GD6GG6N�:MEG:HH>DC�96I6�JH:9�7DI=�;DG�>C>I>6A�H8G::C>C<�D;�E6I>:CIH�6H�L:AA�6H�;DG�:MIG68I>DC�D;�BDA:8JA6GH><C6IJG:H�D;�9>H:6H:�4��5�DG�;:6IJG:�H:A:8I>DC�4�5�6C9�4� 5���N�8AJHI:G�6C6ANH>H�B>8GD6GG6N�96I6�G:H:6G8=:G86C� ;D8JH�DC� ;>C9>C<�<GDJE�D;�<:C:H� I=6I�:M=>7>I�6�H>B>A6G�6C9�8D=:G:CI�:KDAJI>DC6GN�E6II:GCH� >C�6�H:I�D;E6I>:CIH�DG�I>B:ED>CIH���DG�>CHI6C8:��6N:H>6C�6EEGD68=:H�=6K:�7::C�A6G<:AN�JH:9�;DG�96I6�6C6ANH>H�7JI�I=:>GA>B>I:9� H86A67>A>IN� 6C9�:;;>8>:C8N�EG:K:CI� I=:>G� JH:� >C� A6G<:� H86A:�B>8GD6GG6N� 96I6H:IH� 4��5� 4��5� 6C9� 4��5��C6AD<DJHAN�6�A6G<:�CJB7:G�D;�:M>HI>C<�6A<DG>I=BH�=6H�7::C�6EEA>:9�ID�B>8GD6GG6N�96I6�HI6GI>C<�;GDB�L:AA@CDLC�6EEGD68=:H��6BDC<� I=DH:�L:�B:CI>DC�=:G:�E6GI>I>DC76H:9�8AJHI:G>C<� �:�<��@B:6CH� 4��5�� 6C9� >IHK6G>6CIH��:�<��;JOON�8B:6CH�4��5��9:CH>IN�76H:9�8AJHI:G>C<��:�<����,86C� 4��5��=>:G6G8=>86A�B:I=D9H��:�<��"+�!�4��5�+��!��4� 5��6C9�<G>976H:9�B:I=D9H��:�<��,-"' �4��5�6C9�4��5���"C�E6GI>8JA6G�6<<ADB:G6I>K:=>:G6G8=>86A� 8AJHI:G>C<� =6H� 7::C� JH:9� ID� E6GI>I>DC� H:I� D;� E6I>:CIH� >CID� HB6AA:G� <GDJEH� 8=6G68I:G>O:9� 7N:MEAD>I>C<�>C;DGB6I>DC�DC�H:I�D;�<:C:H�:M=>7>I>C<�H>B>A6G�:KDAJI>DC�L>I=�G:HE:8I�ID�6�H:I�D;�H>B>A6G�8DC9>I>DCH�:�<��8A>C>86A�8DC9>I>DCH�I>B:�:KDAJI>DC�DG�9GJ<�G:HEDCH:H��4��5�

':K:GI=:A:HH�I=:�AD<>86A�6C9�6A<DG>I=B>8�8DBEA:M>I>:H�D;�I=>H�B6CN;68:I�EGD7A:B�B6@:�I=>H�G:H:6G8=�68I>K>INFJ>I:�>CIG><J>C<��"C9::9�>C�HE>I:�D;�I=:�C:L�EGD<G:HH�68=>:K:9�>C�G:8:CI�N:6GH��:�<��6<<ADB:G6I>K:�8AJHI:G>C<4��5�7>8AJHI:G>C<�4�5�<:C:I>8�6A<DG>I=B�76H:9�8AJHI:G>C<�4��5�CDCB:IG>8�8AJHI:G>C<�4��5��H><C>;>86CI�EGD<G:HHH=DJA9�7:�:ME:8I:9�>C�I=:�;JIJG:��"C�E6GI>8JA6G�>I�>H�L:AA�@CDLC�I=6I�CD�8AJHI:G>C<�6A<DG>I=B�8DBEA:I:AN�H6I>H;>:H7DI=�688JG68N�6C9�:;;>8>:C8N�G:FJ>G:B:CIH�I=JH�6�<DD9�8AJHI:G>C<�6A<DG>I=B�=6H�ID�7:�:K6AJ6I:9�L>I=�G:HE:8IID�HDB:�:MI:GC6A�8G>I:G>6�I=6I�6G:�>C9:E:C9:CI�;GDB�I=:�B:IG>8�7:>C<�JH:9�ID�8DBEJI:�8AJHI:GH���H�6C�:M6BEA:7DDIHIG6EE>C<�I:8=C>FJ:H�=6K:�D;I:C�7::C�JH:9�ID�86A8JA6I:�I=:�H><C>;>86C8:�D;�I=:�D7I6>C:9�9:C9GD<G6B�4��5�

"C�I=>H�E6E:G�L:�EGDEDH:�&�%.�,�6�CDK:A�6A<DG>I=B�I=6I�:M=>7>IH�FJ>I:�<DD9�E:G;DGB6C8:H� >C�I:GB�D;HE::9�G:E:6I67>A>IN�688JG68N�6C9�GD7JHIC:HH�ID�CD>H:��&�%.�,�E:G;DGB6C8:H�=6K:�7::C�:K6AJ6I:9�JH>C<L>9:AN� 688:EI:9� 8AJHI:G>C<� K6A>9>IN� B:IG>8� I=6I� 6G:� B:I=D9� >C9:E:C9:CI� I=JH� FJ>I:� G:A>67A:�� &�%.�,:M8:AA:CI�E:G;DGB6C8:H�6G>H:�;GDB�HDB:�@:N�;:6IJG:�D;�DJG�6A<DG>I=B�>C�E6GI>8JA6G�

&�%.�,�>H�CDI�I>:9�ID�6�;>M:9�<G>9�9>;;:G:CIAN�;GDB�<G>976H:9�B:I=D9H��:�<��,-"' �4��5�

>I�86C�768@IG68@�DC�EG:K>DJHAN�LGDC<�86A8JA6I>DC�H>C8:�>I�E:G;DGBH�;>GHI�6�IDE9DLC�HEA>II>C<�D;�96I6�6C9I=:C��:K:CIJ6AAN��>I�E:G;DGBH�6�7DIIDBJE�G:;>C:B:CI�D;�I=:�D7I6>C:9�G:HJAIH

>I�E:G;DGBH�6AHD�L:AA�DC�CDC<AD7JA6G�8AJHI:GH��>�:��8AJHI:GH�I=6I�6G:�CDI�HE=:G>86A�>C�H=6E:��9>;;:G:CIAN;GDB�@B:6CH�4��5�6C9��"+�!�4��5�

"C�I=:�;DAADL>C<�6;I:G�6�EG:H:CI6I>DC�D;�DJG�B:I=D9�L:�H=DL�I=:�&�%.�,�EGDE:GI>:H�6C9�;>C6AAN�6H�EGDD;D;EG>C8>EA:�L:�9>H8JHH�I=:�E:G;DGB6C8:�D;�DJG�6A<DG>I=B�JH>C<�HDB:�EJ7A>8AN�6K6>A67A:�96I6H:I�

����EEGD68="C�I=>H�E6E:G�L:�EGDEDH:�6�C:L�=>:G6G8=>86A�6A<DG>I=B�86AA:9�&�%.�,��;DG�&>8GD6GG6N�96I6��%JHI:G>C<�.H>C<�>C6GN� ,EA>II>C<�� L=DH:� HE::9� E:G;DGB6C8:H� 6G:� 7:II:G� I=6C� @B:6CH� 6C9� L=DH:� 688JG68N� DK:G8DB:HEG:K>DJH�=>:G6G8=>86A�6A<DG>I=BH�L=>A:�DE:G6I>C<�>C�6�8DBEA:I:AN�JCHJE:GK>H:9�;6H=>DC��-=:�;>GHI�E=6H:�D;�I=:6A<DG>I=B�>H�9>K>H>K:�6H�I=:�DG><>C6A�96I6�H:I�>H�HEA>I�G:8JGH>K:AN�>CID�B>C>8AJHI:GH�I=GDJ<=�HJ88:HH>K:�7>C6GNHEA>IH��I=:�6A<DG>I=BWH�H:8DC9�E=6H:�>H�6<<ADB:G6I>K:�H>C8:�I=:H:�B>C>8AJHI:GH�6G:�G:8DB7>C:9�>CID�I=:�;>C6AG:HJAI���J:�ID�>IH�;:6IJG:H�DJG�6A<DG>I=B�86C�7:�JH:9�6AHD�;DG�G:;>C>C<�DI=:G�6EEGD68=:H�E:G;DGB6C8:H���H�6C:M6BEA:�>I�86C�7:�JH:9�ID�DK:G8DB:�@B:6CH�>C>I>6A�6HH><CB:CI�EGD7A:B�H>C8:�>IH� ADL�8DBEA:M>IN�L>AA�CDI6;;:8I�I=:�DK:G6AA�8DBEA:M>IN�L=>A:�I=:�688JG68N�D;�DJG�G:HJAIH�L>AA�<J6G6CI::�6C�:M8:AA:CI�>C>I>6A�6HH><CB:CI�D;8AJHI:G�8:CIGD>9H���JGI=:G�DJG�6EEGD68=�>C9J8:H�9JG>C<�:M:8JI>DC�6�9NC6B>8�=>:G6G8=>86A�<G>9�I=6I�L>AA�7:II:G�;>II=:�96I6H:I�L>I=�G:HE:8I�ID�8A6HH>86A�<G>9�6EEGD68=:H�I=6I�:MEAD>I�6�;>M:9�<G>9�>CHI:69���>C6AAN�I=:�6A<DG>I=B:MEAD>IH�I=:�6C6ANI>86A�EGDE:GI>:H�D;�I=:�,JB�D;�,FJ6G:H��,,*�>C�I=:�;DAADL>C<��;JC8I>DC�ID�B>C>B>O:�I=:�8DHI�D;B:G<:�6C9�HEA>I�DE:G6I>DCH�6C9�>C9::9�I=:�6EEGD68=�G:HJAIH�G:6AAN�;6HI��(C:�B6N�6G<J:�I=6I�B6CN�9>;;:G:CIB:6HJG:H�8DJA9�7:�JH:9�;DG�8AJHI:G�8DBEJI6I>DC�7JI�I=:�688JG68N�D;�,,*�>H�6H�<DD9�6H�DI=:G�8AJHI:G�9>HI6C8:

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 3 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

B:6HJG:H�HJ8=�6H�,>C<A:�%>C@��DBEA:I:�%>C@��K:G6<:� �H::�,:8I>DC���� ;DG� G:6A�86H:�H8:C6G>DH�6C9� >IH8DBEJI6I>DC�86C�7:�B69:�;6HI:G�I=6C�DI=:G�B:6HJG:H�

&6>C� �>;;:G:C8:� D;� &�%.�,� L>I=� G:HE:8I� ID� DI=:G� 6EEGD68=:H�� &�%.�,� LDG@H� >C� 6� 8DBEA:I:ANJCHJE:GK>H:9�L6N�6C9�DK:G8DB:H�I=:�B6>C�A>B>I6I>DCH�I=6I�7:H:I�DI=:G�6A<DG>I=BH��"C�E6GI>8JA6G�L:�=6K:�I=6I����&�%.�,�>H�CDI�I>:9�ID�6�;>M:9�<G>9�����>I�86C�768@IG68@�DC�EG:K>DJHAN�LGDC<�86A8JA6I>DC�6C9�����>I�E:G;DGBH6AHD�L:AA�DC�CDC<AD7JA6G�8AJHI:GH�L=:G:�8AJHI:GH�6G:�CDI�HE=:G>86A�>C�H=6E:�I=>H�;:6IJG:�L>AA�7:�>CIJ>I>K:ANJC9:GHIDD9�6;I:G�I=:�E6GI>I>DC>C<�6C9�G:8DB7>C6I>DC�HIG6I:<N�L>AA�7:�9:I6>A:9�>C�C:MI�H:8I>DC���"+�!�9D:H�CDIE:G;DGB�6H�L:AA�7:86JH:�>I�JH:H�I=:�CDI>DC�D;�G69>JH�DG�9>6B:I:G�ID�8DCIGDA�I=:�7DJC96GN�D;�6�8AJHI:G�6C9�I=:H6B:�9G6L768@�6AHD�6;;:8IH�@B:6CH�A>@:�6A<DG>I=BH���&DG:DK:G�L:�=6K:�I=6I�����&�%.�,�86C�9:I:8I�I=:C6IJG6A�8AJHI:GH�EG:H:CI�>C�96I6�L=>A:�>C��>G8=�:68=�CD9:�>C�I=:�6JM>A>6GN�IG::�:MEAD>I:9��86AA:9����IG::��86C=DA9�DCAN�6�A>B>I:9�CJB7:G�D;�:CIG>:H�9J:�ID�>IH�H>O:�I=JH�6����IG::�CD9:�9D:H�CDI�6AL6NH�8DGG:HEDC9�ID�L=6I�6JH:G�B6N�8DCH>9:G�6�C6IJG6A�8AJHI:G���>C6AAN�����9:CH>IN�76H:9�6A<DG>I=BH�A>@:���,��'�6G:�K:GN�H:CH>I>K:�ID8AJHI:G>C<�E6G6B:I:GH�A>@:�&>C>BJB�':><=7DG=DD9�)D>CIH�6C9�I=:N�;6>A�ID�>9:CI>;N�8AJHI:GH�>;�9:CH>IN�K6G>:H�6C9>;� I=:�96I6�H:I� >H� IDD�HE6GH:�6C9�9>;;:G:CI�H6BEA>C<�6;;:8IH�9:CH>IN�B:6HJG:H�=DL:K:G�L:�8DBE6G:9�&�%.�,�6<6>CHI�()-"�,�I=6I�6AADLH�ID�9:I:8I�8AJHI:GH�L>I=�9>;;:G:CI�9:CH>I>:H� >CHI:69���H�L>AA�7:�8A:6G�7N:ME:G>B:CI6A�:K6AJ6I>DC�&�%.�,�9D:H�CDI�HJ;;:G�I=:H:�A>B>I6I>DCH�9J:�ID�I=:�JC>FJ:�;:6IJG:�D;�,,*�6C9�I=:ILDE=6H:�6A<DG>I=B�

�CDI=:G� G:A:K6CI� E6G6B:I:G� ID� I6@:� >CID� 688DJCI� >H� I=:� 8DBEJI6I>DC6A� 8DHI� D;� I=:� 6A<DG>I=BH�� "C� <:C:G6A=>:G6G8=>86A�6A<DG>I=BH�6G:�HADL:G�I=6C�E6GI>I>DC�76H:9�6A<DG>I=BH�A>@:�@B:6CH��"C9::9�@B:6CH�>H�G:6AAN�;6HI7JI�I=:�688JG68N�D;�I=:�G:HJAIH�8DJA9�7:�CDI�H6I>H;68IDGN�BDG:DK:G�@B:6CH�9:E:C9H�DC�I=:�8=D>8:�D;�I=:CJB7:G�D;�8AJHI:G�@�6C9�I=:�>C>I>6A�6HH><CB:CI�D;�I=:�8AJHI:G�8:CI:GH��"C�E6GI>8JA6G�6�LGDC<�8=D>8:�D;�I=:�>C>I>6A8AJHI:G�8:CI:GH�A:69�ID�6C�>C8DGG:8I�8AJHI:G>C<��"C9::9�&�%.�,�D;;:GH�6�<DD9�688JG68N�L=>A:�E:G;DGB>C<�6G:6AAN�;6HI�8DBEJI6I>DC�

���&:I=D9H0:�;>GHI�G:86AA�HDB:�76H>8�CDI>DCH�:MEAD>I:9�7N�DJG�6A<DG>I=B�I=:C�L:�9>H8JHH�DJG�7>C6GN�E6GI>I>DC>C<�HIG6I:<N6C9�I=:�8AJHI:G�FJ6A>IN�B:6HJG:H�L:�JH:9�ID�:K6AJ6I:�I=:�D7I6>C:9�G:HJAIH��-=GDJ<=DJI�I=:�E6E:G�;DG�:68=96I6H:I�6�9��9>B:CH>DC6A�96I6�9>HIG>7JI>DC�����>H�6HHJB:9������L>AA�7:�IG:6I:9�6H�6�BJAI>9>B:CH>DC6A�6GG6N�D;>CI:<:GH�L>I=�KDAJB:� ��L>I=DJI�ADHH�D;�<:C:G6A>IN�L:�6HHJB:�I=6I�6AA�9>B:CH>DCH�D;�����=6K:�I=:�H6B:�H>O:��-=:�CJB7:G�D;�CDCO:GD�:A:B:CIH�D;�����L>AA�7:�9:CDI:9�6H�'������G6C<:�� �DC�I=:�>��I=�9>B:CH>DC�D;�����>H�6C>CI:GK6A� �HJ8=�I=6I� ���DJC96G>:H�A���6C9�J���D;� �6G:�9:CDI:9�7N� ��ADL:G�7DJC9����6C9

��JEE:G�7DJC9����G:HE:8I>K:AN��-=:�H>O:�D;� �L>AA�7:�9:CDI:9�6H� ���7AD8@�7����D;������>H�6�9��IJEA:� �L=:G:� � >H�6�G6C<:�DC�I=:�9>B:CH>DC�>���;DG�:68=� �"C;DGB6AAN�6�7AD8@�G:EG:H:CIH�6�X=NE:GG:8I6C<JA6GY�G:<>DC�D;��������7AD8@�7���D;�����L>I=�6AA�O:GD�:A:B:CIH�>H�H6>9ID�7:�6�CJAA�7AD8@����-=:�KDAJB:�D;�6�7AD8@� �>H�<>K:C�7N� �6C9�L>AA7:�9:CDI:9�6H� �� >K:C�6�ED>CI�>C�I=:�BJAI>9>B:CH>DC6A�HE68:� �L:�H6N�I=6I�7:ADC<H�ID�I=:�7AD8@�7����LG>II:C� ��>;� �;DG�:68=� �

>K:C� 6� 7AD8@� � A:I� M� � � 7:� 6� 8DDG9>C6I:� DC� I=:� >� � I=� 9>B:CH>DC� D;� 7� � � HJ8=� I=6I�� �DDG9>C6I:� M� � � 9>K>9:H� I=:� G6C<:� � D;� 7� � � >CID� � 6C9�

� I=JH� E6GI>I>DC>C<� 7� � � >CID� � 6C9�

��-=:�E6>G� �>H�H6>9�ID�7:�I=:�7>C6GN�HEA>I�D;�7�6ADC<�I=:9>B:CH>DC�>� 6I�I=:�EDH>I>DC�M��9>B:CH>DC�>� 6C9�8DDG9>C6I:�M� 6G:�H6>9�ID�7:�I=:�HEA>II>C<�9>B:CH>DC� 6C9� I=:HEA>II>C<�EDH>I>DC�G:HE:8I>K:AN�

"C;DGB6AAN�6�7>C6GN�E6GI>I>DC�86C�7:�D7I6>C:9�7N�E:G;DGB>C<�6�7>C6GN�HEA>I�DC������I=JH�<:C:G6I>C<�I=:�ILD�HJ77AD8@H� � 6C9� �� 6C9� I=:C� G:8JGH>K:AN� E6GI>I>DC>C<� I=:H:� ILD� HJ77AD8@H� L>I=� I=:� H6B:� 7>C6GN=>:G6G8=>86A�H8=:B:�

�:;>C>I>DC����:;>C>I>DC���

>K:C�6�9��9>B:CH>DC6A�96I6�9>HIG>7JI>DC�����L>I=�KDAJB:� �6�7>C6GN�E6GI>I>DC��)���D;�����>H�6�7>C6GN�IG::�HJ8=I=6I�I=:�GDDI�D;��)���>H�I=:�7AD8@� �6C9�;DG�:68=�>CI:GC6A�CD9:�E���D;��)���I=:�E6>G�D;8=>A9G:C�D;�E���>H�6�7>C6GNHEA>I�D;�E����

>K:C�6�96I6H:I� �8AJHI:G�6C6ANH>H�6>BH�6I�EGD9J8>C<�6�8AJHI:G>C<� �I=6I�>H�6�HJ7H:I�D;�I=:H:I�D;�6AA�HJ7H:IH�D;� �HJ8=�I=6I�����8DCI6>CH�9>H?D>CI��CDCDK:GA6EE>C<��HJ7H:IH�8DK:G>C<�I=:�L=DA:�D7?:8IH:I��L:�G:;:G�>C�I=>H�E6E:G�:M8AJH>K:AN�ID�=6G9�8AJHI:G>C<�EGD7A:B�L=:G:�:K:GN�96I6�ED>CI�7:ADC<H�ID�DC:�6C9DCAN�DC:�8AJHI:G����DCH:FJ:CIAN�:K:GN�ED>CI� �>H�8DCI6>C:9�>C�:M68IAN�DC:�6C9�DCAN�DC:�H:I� ��-=:H:H:IH� �6G:�86AA:9�8AJHI:GH�

n d

ρ i[ l … u ] 1 ⩽ l ⩽ u ⩽ n ρ i l b ( ρ i )

u b ( ρ i ) ρ i s i z e ( ρ i ) = u b ( ρ i ) - l b ( ρ i ) + 1〈 ρ 1 , … , ρ d 〉 ρ i 1 ⩽ i ⩽ d

b =〈ρ 1 , … , ρ d〉 s i z e ( ρ 1 ) × … × s i z e ( ρ d )v o l ( b ) x =〈 x 1 , … , x d 〉 x

x ∈ b l b ( ρ i ) ⩽ x i ⩽ u b ( ρ i ) i ∈ [ 1 … d ]

b = 〈 ρ 1 , … , ρ d 〉l b ( ρ i ) ⩽ x < u b ( ρ i ) ρ i

〈 b l o w , b h i g h 〉

D l o w D h i g h

n d

〈 [ 1 … n ] , … , [ 1 … n ]〉□

D S C = { C 1 , … , C n }D S

x ∈ D S C iC i

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 4 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

�:;>C>I>DC����:;>C>I>DC���

%:I� � 7:� 6� 8AJHI:G� �H:I�� D;� '� 9� � 9>B:CH>DC6A� ED>CIH�� %:I� � 7:� I=:� K:8IDGG:EG:H:CI>C<� I=:� HJB� D;� ED>CIH� >C� �� -=:� 8:CI:G� D;� � >H� �� %:I� � L=:G:�

�7:�I=:�K:8IDG�L=DH:�>��I=�8DDG9>C6I:�>H�I=:�HJB�D;�I=:�HFJ6G:9�>��I=�8DDG9>C6I:H�D;�I=:�ED>CIH>C�,����-=:�,,*��,JB�D;�,FJ6G:H��D;� �>H�9:;>C:9�6H�

L:�G:86AA�I=6I�'���>H�I=:�CJB7:G�D;�ED>CIH�>C�����6C9

I=JH�L:�D7I6>C�7N�HJ7HI>IJI>C<�

;>C6AAN�7N�9:;>C>I>DC�D;� �6C9� �L:�D7I6>C�

�GDB�I=:�A6II:G�>I�>H�8A:6G�I=6I�>C�DG9:G�ID�FJ>8@AN�8DBEJI:�I=:�,,*�D;�6�8AJHI:G�L:�C::9�DCAN�ID�HIDG:� 6C9�'�� "C� I=:�C:MI�H:8I>DC�L:�L>AA� H=DL�=DL� I=:H:� >C;DGB6I>DC�86C�7:�JH:9�:;;:8I>K:AN�6C9�:;;>8>:CIAN� IDDEI>B>O:�I=:�9>K>H>K:�6C9�6<<ADB:G6I>K:�HI:EH�D;�I=:�&�%.�,�6A<DG>I=B�

�����(JG�8AJHI:G>C<�6EEGD68="C�DG9:G� ID�D7I6>C�6�<DD9� IG69:D;;� 7:IL::C�688JG68N�6C9�:;;>8>:C8N�L:�:MEAD>I� >C� I=>H� E6E:G�6�C:L� ;6HI=>:G6G8=>86A�6EEGD68=���BDC<�=>:G6G8=>86A�6A<DG>I=BH�7DIIDBJE�6EEGD68=:H�I:C9�ID�7:�BDG:�688JG6I:�7JI=6K:�6�=><=:G�8DBEJI6I>DC6A�8DHI�I=6C�I=:�IDE9DLC�6EEGD68=:H�4��5��-=:�=><=:G�8DHI�>H�9J:�ID�I=:�=><=:GCJB7:G�D;�86C9>96I:�8AJHI:GH� ID�7:� I6@:C� >CID�688DJCI��-D�DK:G8DB:�I=>H� A>B>I6I>DC� >C�DJG�6EEGD68=� I=:6<<ADB:G6I>K:� HI:E� >H� DCAN� JH:9� DC�B>C>8AJHI:GH� <:C:G6I:9� 7N� 6� ;>GHI� 9>K>H>K:� EGD8:HH� I=>H� G:HJAIH� >C� 6G:B6G@67A:�:;;>8>:C8N�>C8G:6H:��-DE9DLC�E6GI>I>DC>C<�:MEAD>I>C<�<G::9N�6A<DG>I=BH�=6H�7::C�L>9:AN�JH:9�>CI=:�BJAI>9>B:CH>DC6A� 96I6� 8DBEG:HH>DC�9J:� ID� >IH�:;;>8>:C8N��!:G:�L:�JH:�6� H>B>A6G� 9>K>H>K:�6EEGD68=� IDB>C>B>O:�I=:�,,*�6BDC<�I=:�96I6�7:ADC<>C<�ID�8AJHI:GH�L:�G:86AA�6<6>C�I=6I�>C�A>I:G6IJG:�B6CN�B:6HJG:H=6K:�7::C�EGDEDH:9�:�<����,���GGDG�D;��HI>B6I:H��4��5�I=6I�LDG@H�>C�6�H>B>A6G�L6N�6H�,,*�7JI�L:�8=DH:�,,*H>C8:�>I�D;;:GH�6�G:6AAN�;6HI�8DBEJI6I>DC�L=>A:�B6>CI6>C>C<�6C�=><=�688JG68N�>C�8AJHI:G�BD9:A�:K6AJ6I>DC��-=JHDJG�8AJHI:G>C<�6A<DG>I=B�8DCH>HIH�D;�ILD�HI:EH�L=:G:�>C�I=:�;>GHI�HI:E�L:�JH:�7>C6GN�=>:G6G8=>86A�E6GI>I>DC>C<�IDEGD9J8:�6�H:I�D;�B>C>8AJHI:GH�6C9�>C�I=:�H:8DC9�HI:E�L:�E6>GL>H:�B:G<:�I=:�B>C>8AJHI:GH�HD�D7I6>C:9�>C�67DIIDBJE�;6H=>DC��"C�7DI=�HI:EH�I=:�8AJHI:GH�6G:�9:;>C:9�7N�6�=>:G6G8=>86A�E6GI>I>DC�D;�I=:�BJAI>9>B:CH>DC6AHE68:��-=:�E6GI>I>DC�86C�7:�8DBE68IAN�G:EG:H:CI:9�7N�6�7>C6GN�IG::�L=:G:������:68=�CD9:�>H�6HHD8>6I:9�L>I=�6G6C<:�D;�I=:�BJAI>9>B:CH>DC6A�9DB6>C������I=:�GDDI�>H�6HHD8>6I:9�L>I=�I=:�L=DA:�96I6�9DB6>C������;DG�:68=>CC:G�CD9:�C�>IH�8=>A9G:C�6G:�6HHD8>6I:9�L>I=�6�E6>G�D;�G6C<:H�G:EG:H:CI>C<�6��G:8I6C<JA6G��E6GI>I>DC�D;�C�

�68=�CD9:�6AHD�B6>CI6>CH� HJBB6GN� >C;DGB6I>DC�67DJI�ED>CIH� >CH>9:� >IH� G6C<:� ID�:ME:9>I:� I=:�8AJHI:G>C<8DBEJI6I>DC��-=:�IDE9DLC�HEA>II>C<�LDG@H�6H�;DAADLH���H�6JM>A>6GN�HIGJ8IJG:�L:�B6>CI6>C�6�EG>DG>IN�FJ:J:�D;8AJHI:GH�L=DH:�:A:B:CIH�6G:�DG9:G:9�DC�I=:�76H>H�D;�I=:�,,*�D;�:68=�8AJHI:G���I�:68=�>I:G6I>DC�I=:�6A<DG>I=BE:G;DGBH�I=:�;DAADL>C<�ILD�HI:EH������H:A:8I�I=:�8AJHI:G� �I=6I�:M=>7>IH�I=:�=><=:HI�,,*��>�:��I=:�DC:�DC�IDE�D;I=:�EG>DG>IN�FJ:J:��6C9�I=:C�����E6GI>I>DC�I=>H� � >C�HJ8=�6�L6N�I=6I�I=:�,,*�G:9J8I>DC�9:CDI:9� �>HB6M>B>O:9���DG�HI:E���L:�JH:�;DGBJA6������G:EDGI:9�C:MI��ID�8DBEJI:� �;DG�:68=�9>B:CH>DC�>���6C9;DG�:68=�8JII>C<�EDH>I>DC� ?� � �� I=:C�L:�8=DDH:� I=:�EDH>I>DC� ?� �� I=6I�<J6G6CI::H� I=:�B6M>BJB� ��-=>H8DBEJI6I>DC�86C�7:�9DC:�K:GN�:;;>8>:CIAN�H>C8:�L:�EG:8DBEJI:�*���6C9�,���6C9�I=:G:;DG:�L:�C::9�6�H>C<A:H86C�D;�I=:�96I6��0:�G:E:6I�I=:H:�ILD�HI:EH���6C9���67DK:�L=>A:� �>H�<G:6I:G�I=6C�I=:�6K:G6<:�,,*�0:�G:86AA�I=6I�I=:�E6GI>I>DC��>�:��I=:�8AJHI:G�IG::��>H�7J>AI�7N�:MEAD>I>C<�6�<G::9N�HIG6I:<N��-D�I=>H�:C9�I=:�IG::�>H

C s S = ( S 1 , … , S d ) = ∑ p ∈ C s pC s C s Q = ( Q 1 , … , Q d )

C s

Q i S i

���

C sC s Δ S S Q

Δ S S Q ( i , j )Δ S S Q

Δ S S Q

-JGC � DC

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 5 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

8DCHIGJ8I:9�IDE9DLC�7N�B:6CH�D;�A:6;CD9:�HEA>II>C<���I�:68=�HI:E�I=:�A:6;�L>I=�I=:�A6G<:HI�,,*�>H�8=DH:C6C9�>I�>H�HEA>I�6H�ID�B6M>B>O:�I=:�,,*�G:9J8I>DC�9:CDI:9� ����:>C<�,,*�6�B:6HJG:�D;�G6C<:�H@:LC:HHL:� E:G;DGB� HEA>IH� 6H� ADC<� 6H� � G:B6>CH� XH><C>;>86CIY�� �;I:G� I=:� :6GAN� HEA>IH� I=6I� N>:A9� A6G<:� ,,*G:9J8I>DCH�I=:�K6AJ:H�D;� � 7:8DB:�HB6AA:G�6C9�HB6AA:G�JCI>A�6;I:G�C��� HEA>IH�7DI=�,,*�6C9�7:8DB:� � �H>C8:�:68=�ED>CI�=6H�7:8DB:� >IH�DLC�8AJHI:G���-=JH� I=:�6K:G6<:�,,*� G:9J8I>DC�E:G� HEA>I� >H

�6C9�L:�L>AA�8DBE6G:�I=>H�K6AJ:�6<6>CHI�I=:�8JGG:CI� �ID�9:8>9:�L=:C�L:�H=DJA9�HIDE�HEA>II>C<�-=:�G6I>DC6A:� ;DG� I=>H�8G>I:G>DC� >H�8A:6GAN� >AAJHIG6I:9�7N��><����L=:G:� I=:� INE>86A� � HADE:� >H�9>HEA6N:96<6>CHI�I=:�6K:G6<:�,,*��I=:G:�>H�CD�<6>C�>C�HEA>II>C<�7:NDC9�I=:�IJGC>C<�ED>CI��B6G@:9�L>I=�6�HDA>9�8>G8A:�H>C8:�I=:�,,*�G:9J8I>DC�>H�A:HH�I=6C�I=:�6K:G6<:� �6C9�I=JH�>BEJI67A:�ID�G6C9DB�9>HIG>7JI>DCH�G6I=:GI=6C�8AJHI:GA>@:�DC:H�

�><�����

�K:G6<:�,,*�6C9� �:M6BEA:�EADIH�

-=:�HEA>II>C<�EGD8:HH�?JHI�9:H8G>7:9�>H�I>:9�ID�I=:�<G>9�E6GI>I>DC>C<�6C9�I=JH�B6N�86JH:�6�CDCDEI>B6A�HEA>II>C<�D;HDB:�8AJHI:GH��-=:�HJ88:HH>K:�E=6H:�DK:G8DB:H�I=>H�A>B>I6I>DC�H>C8:�I=:�B:G<>C<�>H�E:G;DGB:9�8DCH>9:G>C<6AA�I=:�EDHH>7A:�E6>GH�D;�69?68:CI�B>C>8AJHI:GH�6C9�G:8DB7>C>C<�I=DH:�I=6I�D;;:G�7:HI�,,*�G:9J8I>DC��-=>H6<<ADB:G6I>K:�EGD8:HH�D;;:GH�H><C>;>86CI�69K6CI6<:H��-=:�;>GHI�69K6CI6<:�>H�I=6I�>I�B:G<:H�8AJHI:GH�>C�9>;;:G:CI<G>9�E6GI>I>DCH��-=>H�768@IG68@>C<�HI:E�DK:G8DB:H�CDCDEI>B6A�HEA>IH�D7I6>C:9�>C�I=:�;>GHI�E=6H:�6H�>I�>H�:6HN�IDH::�>C��><����7�6C9�8���-=:�H:8DC9�8G>I>86A�69K6CI6<:�>H�I=6I�I=:�8DBEJI6I>DC6A�8DBEA:M>IN�D;�I=>H�7DIIDBJEHI:E� >H�K:GN� ADL�H>C8:� I=:�CJB7:G�D;�B:G<>C<�HI:EH� >H� G:A6I:9� ID� I=:�CJB7:G�D;�8AJHI:GH� I=6I� >H�K:GN� ADL8DBE6G:9�ID�JHJ6A�96I6H:I�H>O:H��-=:�;>C6A�69K6CI6<:�>H�I=6I�I=>H�E=6H:�6AHD�=6AIH�6JIDB6I>86AAN�EGD9J8>C<�6C6A<DG>I=B�I=6I�9D:H�CDI�G:FJ>G:�6CN�H::9>C<�DG�DI=:G�E6G6B:I:GH�;GDB�I=:�JH:G�6�G:6AAN�C>8:�;:6IJG:�I=6I�>H�CDIH=6G:9�7N�6AA�8AJHI:G>C<�6A<DG>I=BH�

+:B6G@H�67DJI�&�%.�,�JCHJE:GK>H:9�;:6IJG:H���K:GN�8AJHI:G>C<�6A<DG>I=B�HD�;6G�EGDEDH:9�9:HE>I:�I=:JCHJE:GK>H:9� C6IJG:� D;� 8AJHI:G>C<� G:FJ>G:H� HDB:� JH:G� >CI:G68I>DC�� @B:6CH� G:FJ>G:H� 6C� >C>I>6A� 8:CIGD>9H6HH><CB:CI� I=JH� I=:� :ME:8I:9� CJB7:G� D;� 8AJHI:GH� I=6I� >C9::9� H=DJA9� 7:� 6� EG>DG>� JC@CDLC� =>:G6G8=>86A8AJHI:G>C<�6A<DG>I=BH�G:FJ>G:�6�I:GB>C6I>DC�8DC9>I>DC�:�<��I=:�9:H>G:9�CJB7:G�D;�8AJHI:GH�DG�I=:�9>6B:I:G�D;:68=�8AJHI:G�7JI�I=:H:�>C;DGB6I>DC�86CCDI�7:�H:I�L>I=�6�E:G;:8I�8DC;>9:C8:��,>B>A6G�EGD7A:BH�D88JG�;DG�<G>98:AAH� 9:;>C>I>DC� DG� :MEAD>I6I>DC� D;� 9:CH>IN� >C;DGB6I>DC�� "C9::9� &�%.�,� DK:G8DB:� I=:H:� A>B>I6I>DCH� 7NLDG@>C<�>C�6�;JAAN�JCHJE:GK>H:9�L6N�I=6I�<G:6IAN�9:8G:6H:�I=:�8DBEA:M>IN�D;�I=:�6A<DG>I=B�L=>A:�@::E>C<�6�=><=688JG68N��&DG:DK:G�IG69>I>DC6A�6EEGD68=:H�86C�C:K:G�JC9D�L=6I�L6H�9DC:�>C�EG:K>DJH�8DBEJI6I>DC�HI:EH6C9�I=:N�6G:�G:6AAN�H:CH>I>K:�ID�8AJHI:G�9>HI6C8:�B:6HJG:H���>C6AAN��>G8=�E:G;DGBH�6C�6<<ADB:G6I>K:�HI:E�DCB>8GD8AJHI:GH�7JI�>I�:MEAD>IH�6I�I=:�A6I:G�B68GD8AJHI:G>C<�HI6<:�9>;;:G:CI�8AJHI:G>C<�B:I=D9H�HJ8=�6H�>I:G6I>K:E6GI>I>DC>C<�I=JH�B>M>C<�9>;;:G:CI�HIG6I:<>:H�

�����&�%.�,��6�C:L�8AJHI:G>C<�6A<DG>I=B�;DG�B>8GD6GG6N�96I6�><����EGDK>9:H�6�BDG:�;DGB6A�9:H8G>EI>DC�D;�I=:�&�%.�,�6A<DG>I=B��'DI:�I=6I�>C�I=:�9:8A6G6I>DC�HI:EH�/6GH��9:CDI:H�I=:�K6G>67A:H�JH:9�>C�I=:�8DGG:HEDC9>C<�HJ7GDJI>C:H��0:�JH:�I=:�CDI6I>DC� �ID�9:CDI:�HI:E�N��� >CHJ7GDJI>C:�>CKD@:9�6I�HI:E�M���D;�I=:�B6>C��0:�JH:�I=:�>C>I>6A>O:-G::����,I:E����ID�AD69�I=:�96I6H:I�>CID�I=:�GDDI�D;I=:�6JM>A>6GN� IG::�HIGJ8IJG:��-��� :MEAD>I:9�;DG�E6GI>I>DC>C<��(C8:�I=:�IG::�HIGJ8IJG:�=6H�7::C� >C>I>6A>O:9�I=:IDE9DLCHEA>II>C<���HI6GIH��,I:E�����"C�E6GI>8JA6G�I=:�GDDI�D;��-��� >H�699:9�ID�6�EG>DG>IN�FJ:J:�L=DH:�DG9:G>C<8G>I:G>DC�>H�76H:9�DC�I=:�,,*�K6AJ:H�D;�8AJHI:GH�HIDG:9�>C�I=:�FJ:J:��-=:�>C>I>6A�8AJHI:G�6HH><CB:CI�E:G;DGB:97N�>C>I>6A>O:�AJHI:GH���>H�8DBEDH:9�7N�I=:�GDDI�G���D;��-���6C9�I=:�>C>I>6A�,,*�>H�I=:�DC:�8DBEJI:9�DC�G����,I:EH���V������-=:�;JC8I>DC�8DBEJI:�K:G6<:�:AI6,,*���6K:G6<:H�I=:�68IJ6A�,,*�;DG�6AA�I=:�ED>CIH�>C�I=:�8AJHI:G

Δ S S QΔ S S Q

Δ S S Q Δ S S Q

S S Q 0 / n Δ S S QΔ S S Q

Δ S S Q

Δ S S Q

�><JG:�DEI>DCH

x · y

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 6 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

�,I:E�������-=:�;JC8I>DC�8DBEJI:0:><=I:9�:AI6,,*���>H��>I:G6I>K:AN��6EEA>:9�ID�I=:�8AJHI:G� �I=6I�>H�8JGG:CIANDC� IDE�D;� I=:�EG>DG>IN�FJ:J:� �,I:E�������-=:� � >H�8DBEJI:9�6H� I=:�6K:G6<:�<6>C�D;�,,*D7I6>C:9�7N�HEA>II>C<� �6H�:MEA6>C:9�67DK:�;DG� �>�:��L:�EG:8DBEJI:�I=:�B6G<>C6A�HJBH��,���6C9�*����;DG6�<>K:C�HEA>II>C<�ED>CI��L>I=�G:HE:8I�ID�I=:�8DDG9>C6I:H�DG9:G>C<��6C9�G:6HH><C>C<�I=:�HEA>II>C<�ED>CI�76H:9�DCI=:H:�E6GI>6A�HJBH��"C�DG9:G�ID�>BEGDK:�I=:�:;;:8I>K:C:HH�D;�HEA>IH�I=:�K6AJ:�D;� � >H�G6>H:9�ID�6�EDL:G

� I=JH� D7I6>C>C<� � K6AJ:�� ";� � >H� <G:6I:G� I=6C� 6K<�:AI6,,*� �8DBEJI:9�7N�8DBEJI:�K:G6<:�:AI6,,*���I=:C�L:�EGD8::9�L>I=�I=:�HEA>I��,I:E������DI=:GL>H:�L:�9D�CDI��0:JH:�K6AJ:H�D;�E���I=6I�6G:�A:HH�I=6C���H>C8:�;DG� �L:�LDJA9�:C9�JE�HEA>II>C<�8AJHI:GH�L=:G:�I=:�<6>C�9D:H�CDI:M8::9�I=:�6K:G6<:� �6HHD8>6I:9�L>I=�6�G6C9DB�9>HIG>7JI>DC��-=>H�LDJA9�G:HJAI�>C�6�A6G<:�CJB7:G�D;�HB6AA8AJHI:GH�L=:G:�7DI=�>CIG68AJHI:G�6C9�>CI:G8AJHI:G�9>HI6C8:H�6G:�HB6AA��0:�>CHI:69�H::@�K6AJ:H�D;�E��� I=6IG:9J8:�I=:�;DGB:G�L=>A:�B6<C>;N>C<�I=:�A6II:G��0:�9:I:GB>C:9�I=6I�I=:�7:HI�K6AJ:�>H� � G:<6G9A:HH�I=:96I6H:I� ;:6IJG:� I=JH� I=:� JH:G� >H� CDI� G:FJ>G:9� ID� H:I� 6CN� E6G6B:I:G��0=:C�CD�BDG:� IDE9DLC� HEA>IH� 6G:EDHH>7A:�I=:�IDE�DLC,EA>II>C<� :C9H�6C9�L:�7:<>C�I=:�7DIIDB.E&:G<>C<� �,I:E�����"C�DG9:G�ID�D7I6>C�BDG:8DBE68I�8AJHI:GH�L:�H:A:8I��7N�GJCC>C<�H:A:8I�:HI)6>G�6I�,I:E������I=:�E6>G�D;�8AJHI:GH�I=6I�>;�B:G<:9�N>:A9HI=:�A:6HI�,,*�>C8G:6H:��I=6I�>H�6HH><C:9�ID�B>C"C8�7N�;JC8I>DC�8DBEJI:,,*"C8G:6H:�GJC�6I�,I:E�������-=>HB:G<>C<�HI:E�>H�G:E:6I:9�JCI>A�B>C"C8�7:8DB:H�A6G<:G�I=6C�6K<�:AI6,,*��,I:EH����V�������><���� H=DLH�I=:6A<DG>I=B�>C�68I>DC���;I:G�I=G::�HI:EH�I=:�>C>I>6A�H6BEA:H�>C��><����6��6G:�E6GI>I>DC:9�688DG9>C<�ID�I=:�<G>9�H=DLC>C��><����7���-=:�6A<DG>I=B�I6@:H�H:K:C�BDG:�HEA>II>C<�HI:EH�EGD9J8>C<�I=:�E6GI>I>DC�D;��><����8���-=:�B:G<>C<E=6H:�EGD9J8:H�I=:�;>C6A�;>K:�8AJHI:GH�I=6I�6�=JB6C�L>AA�>CHI>C8I>K:AN�G:8D<C>O:�6I�6�<A6C8:��><����9��

C sw e i g h t e d Δ S S Q

C s Δ S S Q

Δ S S Qp , p < 1 w e i g h t e d Δ S S Q w e i g h t e d Δ S S Q

p ⩾ 1Δ S S Q

p = 0 . 8

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 7 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

�><�����

-=:�M-CLUBS�8AJHI:G>C<�6A<DG>I=B�

�><JG:�DEI>DCH

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 8 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

�><�����

�M:8JI>DC�HI:EH�D;�&�%.�,�

0:� ED>CI� DJI� I=6I� EGD9J8>C<� 6M>H� E6G6AA:A� 8JIH� >H� CDI� 6� A>B>I6I>DC� L:� 86C� HI>AA� D7I6>C� >C� DJG� 6EEGD68=CDCE6G6AA:A� 8JIH�=DL:K:G� I=>H�L>AA�CDI� >BEGDK:� I=:�E:G;DGB6C8:H�D;� I=:�6A<DG>I=B���JGI=:GBDG:�6AHD�<G>976H:9�6EEGD68=:H�6G:�I>:9�ID�E6G6AA:A�8JIH�H>C8:�I=:N�6AADL�BDG:�:;;>8>:CI�8DBEJI6I>DC�L>I=DJI�E6N>C<�6CN688JG68N�ADHH�

�J:�ID�>IH�<G>976H:9�9>K>H>K:�6<<ADB:G6I>K:�6EEGD68=�&�%.�,�EGD9J8:H�8AJHI:G�G:HJAIH�D;�HJE:G>DG�FJ6A>IN��JGI=:GBDG:�I=:�6A<DG>I=B�86C�7:�:M:8JI:9�L>I=�HJE:G>DG�I>B:�6C9�688JG68N�E:G;DGB6C8:H�JH>C<�6�H>BEA:�6C9�;6HI��;DGBJA6�I=6I�6AADLH�JH�ID�:HI>B6I:�I=:�,,*�G:9J8I>DC�EGD9J8:9�7N�6�HEA>I�

-=:�67DK:�9:H8G>7:9�6EEGD68=�>H�G:6AAN�:;;>8>:CI�>C�I=>H�G:HE:8I�L:�CDL�9>H8JHH�>IH�8DBEJI6I>DC6A�8DBEA:M>IN�0:� ED>CI� DJI� I=6I� 6A<DG>I=BH� :M=>7>I>C<� ADL� 8DBEJI6I>DC6A� 8DBEA:M>IN� �L=>A:� H6K>C<� G:HJAI� 688JG68N�� 6G:E6GI>8JA6GAN�HJ>I:9�;DG�7>DAD<>86A�96I6�6C6ANH>H�9J:�ID�I=:�=><=�H>O:�6C9�=><=�9>B:CH>DC6A>IN�D;�I=:�6K6>A67A:96I6H:IH��-=:�;DAADL>C<�EGDEDH>I>DC�>H�9:KDI:9�ID�9>H8JHH�DJG�LDGHI�86H:�8DBEA:M>IN�

)GDEDH>I>DC���)GDEDH>I>DC���

�A<DG>I=B�&�%.�,�LDG@H�>C�� L=:G:�C�>H�I=:�CJB7:G�D;�ED>CIH�9�>H�I=:�CJB7:G�D;�9>B:CH>DCH�A�>HI=:�CJB7:G�D;�HEA>II>C<�EDH>I>DCH�;DG�:68=�9>B:CH>DC�6C9�H�>H�I=:�CJB7:G�D;�HEA>IH�

-D�EGDK:�I=:�EGDEDH>I>DC�L:�G:86AA�I=6I�>C�DG9:G�ID�E:G;DGB�HEA>IH�L:�=6K:�ID�8DBEJI:�I=:�,,*�;DG�:68=9>B:CH>DC�6C9�;DG�:68=�HEA>II>C<�ED>CI��-=JH�:68=�HEA>I�=6H�6�8DBEA:M>IN� �6C9�L:�E:G;DGB�H���HEA>IH�-=:�7DIIDBJE�HI:E�8DCIG>7JI:H� ID� I=:�DK:G6AA�8DBEA:M>IN�L>I=�6� I:GB� � L=:G:�@� �� >H� I=:�CJB7:G�D;8AJHI:GH�H>C8:�;DG�:68=�8AJHI:G�L:�=6K:�ID�8DCH>9:G�DCAN�I=:�EDHH>7AN�69?68:CI�8AJHI:GH�;DG�B:G<>C<��7JI�H>C8:

� �DI=:GL>H:� 8AJHI:G� 6HH><CB:CI� L>AA� 7:� B:6C>C<A:HH� 4��5�� L:� 86C� 9>HG:<6G9� I=>H� I:GB� A:69>C<� ID�8DBEA:M>IN�;DG�I=:�LDGHI�86H:�H8:C6G>D�

����>H8JHH>DC"C�DG9:G�ID�H=DL�I=:�8=6G68I:G>HI>8H�D;�&�%.�,�L:�JH:9�ILD�EJ7A>8AN�6K6>A67A:�96I6H:I�DC� :C:��MEG:HH>DC(BC>7JH��6I676H:��6�96I6H:I�EGDK>9:9�7N�4��5��6I6H:I���=:G:6;I:G�6C9�6�96I6H:I�EGDK>9:9�7N�4��5��6I6H:I��=:G:6;I:G���JGI=:GBDG:�L:�I:HI:9�DJG�6A<DG>I=BH�DC�96I6H:I���� � � �4��5�6C9�96I6H:I�2:6HI�,EDGJA6I>DC4��5���C6AD<DJHAN�ID�4��5� L:�8DBE6G:9�H:K:G6A�8AJHI:G>C<�6A<DG>I=BH�>C�DG9:G�ID�6HH:HH�I=:�K6A>9>IN�D;�DJG6EEGD68=�>C�I=:�7>DAD<>86A�96I6�H8:C6G>D��"C�E6GI>8JA6G�L:�8DBE6G:9�DJG�B:I=D9�L>I=��"+�!�4��5�$B:6CH��4�5��L:�G:;:G�ID�>I�6H�$&����@�B:6CH�4� 5��L:�G:;:G�ID�>I�6H�,&�+-��()-"�,�4�5�6C9��"�'��4��5���DG�I=:�@B:6CH�76H:9�6A<DG>I=BH�L:�E:G;DGB:9�� �GJCH��H6B:�6H�4�5��6C9�L:�G:EDGI�I=:�6K:G6<:�K6AJ:H�;DG�I=:H:�GJCH�&DG:DK:G�H>C8:�DJG�6A<DG>I=B�>H�=>:G6G8=>86A�L:�8DBE6G:9�>I�L>I=�G:HE:8I�ID�,>C<A:�%>C@�4��5��JHJ6AAN�G:;:GG:96H�':6G:HI�':><=7DJG��AJHI:G>C<�L:�G:;:G�ID�>I�6H�''�>C�I=:�;DAADL>C<���DBEA:I:�%>C@�4��5��JHJ6AAN�G:;:GG:9�6H�6GI=:HI�':><=7DJG��AJHI:G>C<�L:� G:;:G� ID� >I� 6H� �'� >C� I=:� ;DAADL>C<�� �K:G6<:� 6EEGD68=:H� 4��5� �JHJ6AANG:;:GG:9�6H�.CL:><=I:9�)6>G� GDJE�&:I=D9�L>I=��G>I=B:I>8�&:6C�L:�G:;:G�ID�>I�6H�.) &��>C�I=:�;DAADL>C<��

�><JG:�DEI>DCH

O ( n · d · l · s )

O ( n · d · l )O ( k 2 )

k ≪ nO ( n · d · l · s )

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 9 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

�AA� I=:� 6EEGD68=:H� B:CI>DC:9� 67DK:� 699G:HH� I=:� 8AJHI:G>C<� EGD7A:B� ;GDB� 9>;;:G:CI� K>:LED>CIH� I=JHHIG:C<I=:C>C<�DJG�:K6AJ6I>DC���>C6AAN�;DG�I=:�H6@:�D;�8DBEA:I:C:HH�L:�6AHD�G6C�H:K:G6A�:ME:G>B:CIH�JH>C<�6C6A<DG>I=B�9:H><C:9�;DG�7>DAD<>86A�96I6�6H�,>&&-,�4�5� I=6I�8DC;>GB:9�DJG�HJE:G>DG�E:G;DGB6C8:H�6H�L>AA�7:H=DLC�7:ADL�

0:�HI6GI:9�DJG�6C6ANH>H�8DCH>9:G>C<�I=:H:�96I6H:IH�DC�L=>8=�L:�JH:9�&�%.�,�6C9�I=:�DI=:G�8AJHI:G>C<6A<DG>I=BH�;DG�I=:�H6@:�D;�8DBE6G>HDC��-=:�D7I6>C:9�G:HJAIH�6G:�G:EDGI:9�>C�-67A:���6C9�-67A:���

-67A:���

�88JG68N�6C9�I>B:�E:G;DGB6C8:H�;DG��6I6H:I��6C9��6I6H:I��

�A<DG>I=B -:HI�96I6H:IH

,,* ->B: ,,* ->B:

&�%.�, �� ������ ���� ���������� �������������� � ��� � ���

()-"�, ������� ����� ������� �����

�"+�! ������� ����� ������� �����

$&�� ������� ����� ������� �����

,&�+- ������� �� �� ������� �����

�"�'� ������� ����� ������� �����

.) &� �� ���� ����� ������� �����

'' ������� ����� ������� �����

�' ������� ����� ������� �� ��

,>&&-, ������� ����� ��� ��� �����

/6AJ:H�G:EG:H:CI�,,*�E:G�96I6H:I��->B:H�6G:�:MEG:HH:9�>C�H:8DC9H�

-67A:���

�88JG68N�6C9�I>B:�E:G;DGB6C8:H�;DG���� � � �6C9�2:6HIHEDGJA6I>DC�

�A<DG>I=B -:HI�96I6H:IH

,,* ->B: ,,* ->B:

&�%.�, ��� ������ ��� ���� ���� �������������� ����� �����

()-"�, ������� �� �� ������� �����

�"+�! ������� ����� ������� �� �

$&�� ������� ���� ������� �����

,&�+- ������� �� �� ������� �����

�"�'� �� ���� ����� ������� �����

.) &� ������� ����� ������� �����

'' �� ���� ����� ��� ��� �����

�' ������� ����� ������� �����

,>&&-, ������� �� � ������� �� ��

/6AJ:H�G:EG:H:CI�,,*�E:G�96I6H:I��->B:H�6G:�:MEG:HH:9�>C�H:8DC9H�

-=:�G:HJAIH�D7I6>C:9�6G:�FJ>I:�8DCK>C8>C<�7DI=�;DG�I=:�688JG68N�6C9�I=:�:M:8JI>DC�I>B:H�L=:G:�&�%.�,:M=>7>IH�7:HI�E:G;DGB6C8:H� �7:HI� G:HJAIH� ;DG� :68=� I67A:�6G:� G:EDGI:9� >C�7DA9��� "C�E6GI>8JA6G�DJG� 8AJHI:G>C<B:I=D9�8DGG:8IAN�9:I:8I:9�I=:�CJB7:G�D;�8AJHI:GH�>C�I=:�96I6�6H�HI6I:9�>C�9:I6>A�>C�C:MI�H:8I>DC��"C9::9�&

D a t a s e t 1 D a t a s e t 2

-67A:�DEI>DCH

A D 4 0 0 - 1 0 - 1 0 Y e a s t s p o r u l a t i o n

-67A:�DEI>DCH

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 10 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

�%.�,�H=DL:9�6�C>8:�;:6IJG:�L=:C�8AJHI:G>C<��6I6H:I����I=:�!'�<GDJE�8DCI6>CH�ILD�HJ7<GDJEH��+��6C9��+�&�%.�,�9JG>C<� I=:�HEA>II>C<�HI:E� >9:CI>;>:9� I=:H:� ILD�HJ7<GDJEH� I=6I�=6K:�7::C�8DAA6EH:9� >C�6�H>C<A:8AJHI:G�6;I:G�I=:�B:G<>C<�HI:E��-D�;JGI=:G�6HH:H�I=:�K6A>9>IN�D;�I=:�6EEGD68=�L:�:MEAD>I:9�H:K:G6A�B:I=D9>C9:E:C9:CI�FJ6A>IN�B:6HJG:�I=6I�6G:�G:EDGI:9�>C�I=:�;DAADL>C<�

�����*J6A>IN�D;�8AJHI:G>C<�G:HJAIH!:G:�L:�L>AA�:K6AJ6I:�I=:�FJ6A>IN�D;�I=:�G:HJAIH�&�%.�,�EGD9J8:H�6C9�>IH�G:A>67>A>IN��-=:�>HHJ:�D;�;>C9>C<B:I=D9>C9:E:C9:CI�B:6HJG:H�;DG�8AJHI:G>C<�G:HJAIH�=6H�7::C�I=:�HDJG8:�D;�BJ8=�IDE>86A�9>H8JHH>DCH�7JIDK:G� I>B:�HDJC9�B:6HJG:H�=6K:�:B:G<:9�I=6I�86C�7:�JH:9�G:A>67AN� ID�8DBE6G:�I=:�FJ6A>IN�D;� I=:�G:HJAIHEGD9J8:9�7N�6�L>9:�G6C<:�D;�8AJHI:G>C<�6A<DG>I=BH�4�5��"C�E6GI>8JA6G�I=:�;DAADL>C<�I=G::�B:6HJG:H�=6K:�HDJC9I=:DG:I>86A� 6C9� EG68I>86A� 76H:H�� /6G>6C8:�+6I>D� �� �>IH� G6C<:� >H� � 6C9� A6G<:G� K6AJ:H� >C9>86I:� 7:II:G8AJHI:G>C<�FJ6A>IN��+:A6I>K:�&6G<>C��� �>IH�G6C<:�>H� �6C9�ADL:G�K6AJ:H�>C9>86I:H�6�7:II:G�8AJHI:G>C<��6C90:6@:HI�%>C@���>IH�G6C<:�>H� �6C9�ADL:G�K6AJ:H�G:EG:H:CI�7:II:G�8AJHI:G>C<H��

-=:�G:HJAIH�D7I6>C:9�;DG�I=:�67DK:�B:CI>DC:9�FJ6A>IN�B:6HJG:H�6G:�<>K:C�>C�-67A:���6C9�-67A:����I=:N�H=DLI=6I�&�%.�,�DJIE:G;DGBH�DI=:G�B:I=D9H�H><C>;>86CIAN�EGD9J8>C<�K6AJ:H�;DG�+:A6I>K:�&6G<>C���0:6@:HI�%>C@�G:HE��/6G>6C8:�+6I>D��I=6I�6G:�H><C>;>86CIAN� ADL:G��A6G<:G��I=6C�I=DH:�DI=:G�B:I=D9H� >�:��8AJHI:GH�D;�BJ8=7:II:G�FJ6A>IN�

-67A:���

�AJHI:G>C<�FJ6A>IN�B:6HJG:H�:K6AJ6I>DC�

��AJHI:GH /6G>6C8:�G6I>D +:A6I>K:�B6G<>C 0:6@:HI�A>C@

�6I6H:I��

&�%.�, �� ���������� � �� � �� ���� ����

()-"�, � ����� ���� �� ��

�"+�! � ����� ���� �����

$&�� �� ����� ���� �����

,&�+- �� ����� ���� �����

�"�'� � ����� ���� �����

.) &� � ����� ���� �����

'' � ����� ���� �����

�' � ����� ���� �����

,>&&-, � ����� ���� �����

�6I6H:I��

&�%.�, �� ���������� � �� � �� ���� ����

()-"�, �� ����� ���� �����

�"+�! �� � ��� ���� �����

$&�� �� ����� �� � �����

,&�+- �� ����� ���� �����

�"�'� �� ����� ���� �����

.) &� �� ����� ���� �����

'' �� � ��� ���� �����

�' �� ��� � ���� �����

,>&&-, �� ����� �� � �����

-67A:���

�AJHI:G>C<�FJ6A>IN�B:6HJG:H�:K6AJ6I>DC�

��AJHI:GH /6G>6C8:�G6I>D +:A6I>K:�B6G<>C 0:6@:HI�A>C@

��� � �

[ 0 , ∞ )[ 0 , 1 )

[ 0 , ∞ )

-67A:�DEI>DCH

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 11 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

&�%.�, � � ���������� �� � �� � ���� ����

()-"�, � ����� ���� ����

�"+�! � ����� ���� �� ��

$&�� � � ����� ���� �����

,&�+- � � ����� �� � �����

�"�'� � � ����� ���� �� ��

.) &� � ���� ���� �����

'' � ����� ���� �����

�' � ����� ���� �����

,>&&-, � � ����� ���� ����

2:6HI�HEDGJA6I>DC

&�%.�, �� ���������� ���� ���� ���� ����

()-"�, � ����� ���� �����

�"+�! � � ��� ���� �� ��

$&�� �� ����� ���� ��� �

,&�+- �� ����� ���� �����

�"�'� � ����� ���� �����

.) &� � ��� � ���� �����

'' � ����� �� � �����

�' � ����� ���� �����

,>&&-, �� ����� ���� ����

-=:H:�G:HJAIH�H=DL�I=6I�&�%.�,�6AL6NH�;>C9H�I=:�:M68I�CJB7:G�D;�8AJHI:GH�6C9�I=:�FJ6A>IN�D;�I=:�;DJC98AJHI:G�>H�DK:GL=:AB>C<�L>I=�G:HE:8I�ID�I=:�DI=:G�B:I=D9H�

������99>I>DC6A�FJ6A>IN�B:6HJG:H,,*�>H�6�C6IJG6A�6C9�L>9:AN�JH:9�CDGB�D;�H>B>A6G>IN�7JI�6�9:K>AWH�69KD86I:�86C�ED>CI�DJI�I=6I�DI=:G�8AJHI:G>C<6A<DG>I=BH�B><=I�CDI�B:6HJG:�I=:>G�:;;:8I>K:C:HH�>C�I:GBH�D;�,,*�DG�:K:C�I=:�8DBE68IC:HH�D;�:68=�8AJHI:G6GDJC9�>IH�8:CIGD>9��-=JH�L:�L>AA�6II:BEI�ID�B:6HJG:�I=:�FJ6A>IN�D;�I=:�8AJHI:GH�EGD9J8:9�7N�&�%.�,�JH>C<K:GN�9>;;:G:CI�8G>I:G>6�>CHE>G:9�7N�I=:�C:6G:HI�HJ78A6HH�8A6HH>;>:GH�I=6I�L:G:�EG:K>DJHAN�JH:9�>C�6�H>B>A6G�GDA:�>C4��5�6C9�4��5�

�� ;>GHI� G:A:K6CI�:K6AJ6I>DC�B:6HJG:� >C� I=>H�6EEGD68=� >H� I=:�:GGDG� G6I:�D;�6�@� � ':6G:HI�':><=7DG�8A6HH>;>:G9:;>C:9�7N�I=:�8AJHI:G>C<�G:HJAIH��-=>H�K6AJ:�EGDK>9:�G:A:K6CI�>C;DGB6I>DC�67DJI�I=:�67>A>IN�D;�I=:�8AJHI:G>C<B:I=D9�JC9:G�:K6AJ6I>DC�ID�B>C>B>O:�I=:�:GGDGH�9J:�ID�>C8DGG:8I�6HH><CB:CI�D;�ED>CIH�ID�I=:�EGDE:G�8AJHI:G�"C9::9�I=>H�>C;DGB6I>DC�>H�8GJ8>6A�;DG�7>DAD<>86A�96I6�6C6ANH>H��-=JH�;DG�:68=�ED>CI�L:�86C�8=:8@�L=:I=:G�I=:9DB>C6CI�8A6HH�D;�I=:�@���8ADH:G�:A:B:CIH�6AADLH�ID�8DGG:8IAN�EG:9>8I�I=:�68IJ6A�8A6HH�D;�B:B7:GH=>E��I=:G:�>H�CDG:A6I>DCH=>E�7:IL::C�I=:�K6AJ:�D;�@�JH:9�=:G:�6C9�I=6I�D;�@��B:6CH���-=JH�I=:�IDI6A�CJB7:G�D;�ED>CIH�8DGG:8IAN8A6HH>;>:9�B:6HJG:H� I=:� :;;:8I>K:C:HH� D;� I=:� 8AJHI:G>C<� 6I� =6C9�� �DGB6AAN� I=:� :GGDG� � D;� 6� @� � ''8A6HH>;>:G�:MEAD>I>C<�6�I=:�9>HI6C8:�B6IG>M�6BDC<�:K:GN�E6>G�D;�ED>CIH������86C�7:�9:;>C:9�6H

L=:G:�'���>H�I=:�IDI6A�CJB7:G�D;�ED>CIH�6C9� �>H� �>;�I=:�EG:9>8I:9�8A6HH�D;�I=:�>��I=�ED>CI�� ��8D>C8>9:HL>I=�>IH�68IJ6A�8A6HH�6C9���DI=:GL>H:��%DL�K6AJ:H�D;�I=:� �>C9:M�9:CDI:�=><=FJ6A>IN�8AJHI:GH�

�DAADL>C<�4��5�L:�86C�<D�9::E:G�>C�DJG�:K6AJ6I>DC�7N�B:6HJG>C<�I=:�6K:G6<:�CJB7:G�D;�:A:B:CIH�>C�6�G6C<:D;�@���:A:B:CIH��L:�G:86AA�6<6>C�I=6I�L:�JH:�I=:�:ME:8I:9�8AJHI:G�H>O:�K6AJ:��=6K>C<�I=:�H6B:�8A6HH�6H�I=:ED>CI� JC9:G� 8DCH>9:G6I>DC�� )G68I>86AAN� L:� 9:;>C:� � 6H� I=:� 6K:G6<:� E:G8:CI6<:� D;� ED>CIH� >C� I=:� @� � C:><=7DG=DD9�D;�6�<:C:G>8�ED>CI�7:ADC<>C<�ID�I=:�H6B:�8A6HH�D;�I=6I�ED>CI���DGB6AAN�

-67A:�DEI>DCH

e k ( D )

γ k ( i ) x ie k ( D )

q k

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 12 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

L=:G:� �G:EG:H:CIH�I=:�68IJ6A�8A6HH�6HHD8>6I:9�L>I=�I=:�>��I=�ED>CI�>C�I=:�96I6H:I� �6C9�>H�I=:�H:I�D;�@���ED>CIH�=6K>C<�I=:�ADL:HI�9>HI6C8:H�;GDB� �688DG9>C<�ID�I=:�9>HI6C8:�JH:9�6I�=6C9�

-=>H�K6AJ:�L>AA�EGDK>9:�6�G:6AAN�>CI:G:HI>C<�>C;DGB6I>DC�>C�;68I�>I�L>AA�B:6HJG:�I=:�EJG>IN���D;�I=:�8AJHI:GH�H>C8:>I�I6@:�>CID�688DJCI�I=:�CJB7:G�D;�ED>CIH�LGDC<AN�6HH><C:9�ID�6�8AJHI:G��"C�EG>C8>EA:�6�':6G:HI�':><=7DG8A6HH>;>:G�:M=>7>IH�6�<DD9�E:G;DGB6C8:�L=:C� �>H�=><=���JGI=:GBDG:� �EGDK>9:H�6�B:6HJG:�D;�I=:HI67>A>IN�D;�6�':6G:HI':><=7DG��=><=�K6AJ:H�D;� �B6@:�6�@��''�8A6HH>;>:G�A:HH�H:CH>I>K:�ID�>C8G:6H>C<K6AJ:H�@���D;�C:><=7DGH�8DCH>9:G:9��-=:�H:CH>I>K>IN�D;�I=:�8AJHI:G>C<�86C�6AHD�7:�B:6HJG:9�7N�8DCH>9:G>C<;DG�6�<>K:C�<GDJE�D;�ED>CIH� �I=:�EGD767>A>IN�I=6I�M���6C9�N���7:ADC<�ID�I=:�H6B:�8A6HH�6C9�O��7:ADC<H�ID�6�9>;;:G:CI�8A6HH�7JI�O���>H�BDG:�H>B>A6G�ID�M���I=6C�N���>H��0:�9:CDI:�I=>H�EGD767>A>IN�7N� L=>8=�>H�:HI>B6I:9�6H

L=:G:� �>H���>;� �6C9� �DI=:GL>H:��-=>H�K6AJ:�<>K:H�>C;DGB6I>DC�67DJI�I=:�6B7><J>IN�>C8AJHI:G�6HH><CB:CIH��!:G:�IDD�ADL�K6AJ:H�D;� �9:CDI:�6�<DD9�E:G;DGB6C8:�D;�I=:�8AJHI:G>C<�JC9:G8DCH>9:G6I>DC�

-=:� G:HJAIH� G:EDGI:9� >C�-67A:���6C9�-67A:��� H=DL� I=6I�&�%.�,�EGD9J8:H�7:II:G� G:HJAIH� I=6C� I=:�DI=:G6A<DG>I=BH�

-67A:���

*J6A>IN�>C9>8:H�;DG��6I6H:I���6C9��6I6H:I���

&:I=D9�>C9:M

�6I6H:I��

&�%.�, � ��� � ��� � ��� � ��� ����� �����

()-"�, ����� ����� �����

�"+�! ����� �� � �����

$&�� �� � ����� ��� �

,&�+- �� �� ��� � �����

�"�'� � ��� ����� �����

.) &� ����� ����� �����

'' ����� ����� ��� �

�' ����� ����� �����

,>&&-, � ��� �� �� �����

�6I6H:I��

&�%.�, � �� � �� � ��� � ��� ����� �����

()-"�, � ��� ����� �����

�"+�! � ��� � ��� �����

$&�� � ��� ����� �����

,&�+- � ��� ����� �����

�"�'� � ��� ����� �����

.) &� � ��� �� �� �����

'' � � � �� �� �����

�' � ��� �� �� �����

,>&&-, � �� � �� �����

-67A:���

*J6A>IN�>C9>8:H�;DG���� � � �6C9�N:6HI�HEDGJA6I>DC�

&:I=D9�>C9:M

C l ( i ) n i = | C l ( i ) |N k ( i ) x i

q k q kq k

ε ( D )

δ D D ( i , j ) < D ( i , k )ε ( D )

ε e k = 1 0 q k = 1 0

-67A:�DEI>DCH

ε e k = 4 0 q k = 4 0

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 13 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

��� � �

&�%.�, �� �� �� �� � ��� � ��� ����� �����

()-"�, ����� � ��� �����

�"+�! ����� �� �� �����

$&�� �� �� ����� �����

,&�+- �� �� ����� �����

�"�'� ����� � ��� �����

.) &� ����� �� �� ��� �

'' ����� �� �� �����

�' ����� �� �� �����

,>&&-, ����� � ��� �����

2:6HI�HEDGJA6I>DC

&�%.�, ����� ����� ����� ����� ����� �����

()-"�, �� �� ����� �����

�"+�! ����� ����� �����

$&�� ����� �� �� �����

,&�+- ����� �� �� �����

�"�'� ����� ����� �����

.) &� ����� ����� �����

'' ����� ����� �����

�' ����� ����� �����

,>&&-, ����� ����� �����

-67A:���6C9�-67A:���H=DL�I=6I�&�%.�,�D;;:GH�I=:�7:HI�E:G;DGB6C8:�DC�6AA�>C9>8:H�6C9�>C�E6GI>8JA6G�I=:�G:6AAN=><=�K6AJ:H�D;� � �>I�>H�EG68I>86AAN���H>C8:�>I�9:I:8IH�:M68IAN�I=:�CJB7:G�D;�8AJHI:GH�;DG�:68=�96I6H:I�6C9�I=:ED>CI� 6HH><CB:CI� ID� 8AJHI:G� >H� 8DGG:8I�� 6AADL� ID� 6HH:H� I=6I� I=:� 8AJHI:GH� 6G:�L:AA� 9:;>C:9� 6C9�&�%.�,DJIE:G;DGBH�DI=:G�B:I=D9H��"C�B:6HJG>C<� �6C9� �L:�JH:9�C:><=7DG=DD9H�D;�H>O:�8ADH:G�ID�I=:�68IJ6A8AJHI:G�H>O:�6K6>A67A:�7N�96I6H:IH�EGDK>9:G�I=JH�>I�>H�6�<DD9�8=D>8:�;DG�I:HI>C<�I=:�FJ6A>IN�D;�8AJHI:GH��-=:�DK:G6AAHIGJ8IJG:�D;�I=:�8AJHI:GH�6C9�I=:�ED>CIH�9>HIG>7JI>DC�;DG�6AA�96I6H:IH��G:HJAIH�>C�-67A:���6C9�-67A:����EGD9J8:9HJE:G>DG�E:G;DGB6C8:�;DG�&�%.�,�DC�:K:GN�>C9:M�L>I=�E6GI>8JA6GAN�ADL�K6AJ:H�D;� ��-=>H�G:HJAI�HJ<<:HIH�I=6I&�%.�,�:M=>7>IH�I=:�=><=:HI�:;;:8I>K:C:HH�8DBE6G:9�ID�I=:�DI=:G�6EEGD68=:H�:K:C�L=:C�,,*�>H�CDI�I=::MEAD>I:9�B:IG>8H�

������K6AJ6I>C<�I=:�7>DAD<>86A�G:A:K6C8:�D;�8AJHI:GH"C�I=>H�H:8I>DC�L:�G:EDGI�I=:�:ME:G>B:CI6A�G:HJAIH�G:<6G9>C<�6�;JGI=:G�8DBE6G>HDC�L:�E:G;DGB:9�ID�6HH:HH�I=:K6A>9>IN�D;�DJG�6EEGD68=�;GDB�6�7>DAD<>86A�K>:LED>CI��"C9::9�8AJHI:G>C<�<:C:�:MEG:HH>DC�96I6�>H�6�K6A>9�HJEEDGI;DG� ;JC8I>DC6A� 6CCDI6I>DC� I>HHJ:� 8A6HH>;>86I>DC� G:<JA6IDGN� BDI>;� >9:CI>;>86I>DC� 6C9� DI=:G� 6EEA>86I>DCH� 7JI8=DDH>C<� I=:� G><=I� 8AJHI:G>C<�B6N� 7:� G6I=:G� 9>;;>8JAI�� -D� 699G:HH� I=>H� >HHJ:� H:K:G6A� EGDEDH6A� =6K:� 7::CEG:H:CI:9�HJ8=�6H�4��5�6C9�4� 5��"C�I=>H�E6E:G�L:�:MEAD>I:9�I=:�FJ6A>IN�B:6HJG:�9:;>C:9�>C�4� 5�;DG�7>DAD<>86A96I6�8AJHI:G>C<�:K6AJ6I>DC��H>C8:�>I�HJBB6G>O:H�H:K:G6A�:K6AJ6I>DC�B:IG>8�>C�6�H>C<A:�B:6HJG:��0:�G6C�I=::ME:G>B:CIH�>C�HI6C96G9�BD9:�JH>C<�6AA� :C:�(CIDAD<N�� (��8A6HH:H�6H�>CEJI�H:II>C<�D;�I=:�EGD<G6B�6C9G:EDGI�I=:�D7I6>C:9��*,���AJHI:G>C<�*J6A>IN�,8DG:��4� 5��-=>H�6C6ANH>H�6HHJG:H�6�HIGDC<:G�K6A>96I>DC�D;�I=:8AJHI:G>C<�G:HJAIH�;GDB�6�7>DAD<>86A�K>:LED>CI��-=:�G:HJAIH�6G:�G:EDGI:9�>C�-67A:��� 6C9�HI6I:�I=:�7>DAD<>86AG:A:K6C8:�D;�&�%.�,�>H�FJ>I:�=><=�

-67A:���

�AJHI:G>C<�FJ6A>IN�H8DG:�;DG��6I6H:I����6I6H:I������ � � �6C9�N:6HI�HEDGJA6I>DC�

&:I=D9�>C9:M �*,

�6I6H:I��

&�%.�, � ���� ���

()-"�, �����

-67A:�DEI>DCH

q k

e k q k

ε

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 14 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

�"+�! �����

$&�� �����

,&�+- �����

�"�'� �����

.) &� �����

'' �����

�' ��� �

,>&&-, �����

�6I6H:I��

&�%.�, ����������

()-"�, �����

�"+�! ��� �

$&�� �����

,&�+- �����

�"�'� �����

.) &� ��� �

'' �����

�' �����

,>&&-, �����

��� � �

&�%.�, ����������

()-"�, �����

�"+�! �����

$&�� �����

,&�+- �����

�"�'� �����

.) &� �����

'' ��� �

�' �����

,>&&-, �����

2:6HI�HEDGJA6I>DC

&�%.�, ����������

()-"�, �����

�"+�! �����

$&�� �����

,&�+- �����

�"�'� �����

.) &� �����

'' �����

�' �����

,>&&-, �����

-=:�=><=�E:G;DGB6C8:�D;�&�%.�,�6AHD�;GDB�6�7>DAD<>86A�K>:LED>CI�86C�7:�JC9:GHIDD9�7N�8DCH>9:G>C<�I=6I�>I86C�768@IG68@�DC�EG:K>DJHAN�LGDC<�8DBEJI6I>DC�>C�I=:�HEA>II>C<�E=6H:��&DG:�>C�9:I6>A�7N�I=:�B:G<>C<�HI:E�L:86C�EGDE:GAN�6HH><C�<:C:�:MEG:HH>DC�ID�I=:>G�<GDJE�I=JH�ID�I=:�8DGG:8I�;JC8I>DC�L=:C�>I�>H�I=:�I6G<:I�D;�I=:6C6ANH>H�7N�JE96I>C<�EG:K>DJH�LGDC<�6HH><CB:CI��-=:�A6II:G�7:86JH:�L:�86C�<GDJE�ID<:I=:G�6AHD�XH>7A>C<HY

-67A:�DEI>DCH

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 15 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

<:C:�:MEG:HH>DC�>C�DJG�IG::�6JM>A>6GN�HIGJ8IJG:�

�JI=:G�9>H8JHH>DC�DC�7>DAD<>86A�G:A:K6C8:�D;�8AJHI:G>C<��"C�DG9:G�ID�;JGI=:G�6HH:H�I=:�7>DAD<>86A�8D=:G:C8:�D;&�%.�,� 8AJHI:GH� L:� 7G>:;AN� 9>H8JHH� =:G:� �CG>8=B:CI� �C6ANH>H�� �CG>8=B:CI� �C6ANH>H� >H� >CI:C9:9� ID8=6G68I:G>O:�7>DAD<>86A�6IIG>7JI:H�>C�6�<>K:C�<:C:�H:I��"C�I=>H�G:HE:8I�I=:� (�96I6H:I�>H�6�@:N�G:HDJG8:��"CE6GI>8JA6G� (�DCIDAD<>:H�6G:�HEA>I�>CID�8:AAJA6G�8DBEDC:CI�BDA:8JA6G�;JC8I>DC�6C9�7>DAD<>86A�EGD8:HH��.H>C<I=:H:�DCIDAD<>:H�L:�86C�7:II:G�8=6G68I:G>O:�<:C:H�I=JH�>BEGDK>C<�I=:�6CCDI6I>DC�EGD8:HH��&6CN�IDDAH�:M>HI;DG�6HH:HH>C<�H><C>;>86C8:�D;�:CG>8=B:CI�L>I=>C�6�<GDJE��-=:N�INE>86AAN�:MEAD>I�=NE:G<:DB:IG>8�I:HI>C<�7JI�86C6AHD�7:�76H:9�DC�6�$DABD<DGDKV,B>GCDK�HI6I>HI>8��-=:H:�IDDAH�JHJ6AAN�G:FJ>G:�:BE>G>86A�:HI>B6I>DCH�D;�EK6AJ:H�6C9�BJAI>EA:�I:HI>C<�8DGG:8I>DCH���J:�ID�DJG�E:8JA>6G�6EEGD68=�688DG9>C<�ID�4�5�6C9�4��5�L:�C::9�ID8DBEJI:�;DG�:68=�8AJHI:G�I=:� (�6CCDI6I>DCH�6C9�I=:�8DGG:HEDC9>C<�EK6AJ:H�I=6I�:K6AJ6I:H�I=:�EGD767>A>INI=6I�6�<>K:C�8AJHI:G�D88JGH���� "C9::9�L:�9:I:GB>C:�L=:I=:G�6C�D7H:GK:9�A:K:A�D;�6CCDI6I>DC�;DG�6�<GDJE�D;<:C:H�>H�H><C>;>86CI�L>I=>C�I=:�8DCI:MI�D;�6CCDI6I>DC���DG�I=:�96I6H:I�7:>C<�6C6ANO:9�L:�D7I6>C:9�I=:�;DAADL>C<EK6AJ:H���6I6H:I���V�����6I6H:I���V������� � � �V����2:6HI�,EDGJA6I>DC�V������H�G:EDGI:9�67DK:HJ8=�H6I>H;68IDGN�G:HJAIH�6G:�D7I6>C:9�6H�L:�<GDJE�ID<:I=:G�XH>7A>C<HY�<:C:�:MEG:HH>DC�L=:C�8AJHI:G>C<�96I6��H�6�B6II:G�D;�;68I�I=:H:�G:HJAIH�;JGI=:G�6HH:HH�I=:�G:A:K6C8:�D;�DJG�8AJHI:G>C<�;GDB�6�7>DAD<>86A�K>:LED>CI�

����DC8AJH>DC-=:�C6IJG6AC:HH�D;�I=:�=>:G6G8=>86A�6EEGD68=�;DG�8AJHI:G>C<�D7?:8IH�>H�L>9:AN�G:8D<C>O:9�6C9�6AHD�HJEEDGI:97N�EHN8=DAD<>86A� HIJ9>:H�D;� 8=>A9G:CWH� 8D<C>I>K:�7:=6K>DGH� 4��5��&�%.�,� >H�EGDK>9>C<� I=:�6C6ANI>86A�6C96A<DG>I=B>8�69K6C8:H�I=6I�=6K:�IJGC:9�I=>H�>CIJ>I>K:�6EEGD68=�>CID�6�96I6�B>C>C<�B:I=D9�D;�HJE:G>DG�688JG68NGD7JHIC:HH�6C9�HE::9��-=:�HE::9�68=>:K:9�7N�DJG�6EEGD68=�>H�A6G<:AN�9J:�ID�&�%.�,W�67>A>IN�D;�:MEAD>I>C<I=:� 6C6ANI>86A� EGDE:GI>:H� D;� >IH� FJ69G6I>8� 9>HI6C8:� ;JC8I>DCH� ID� H>BEA>;N� I=:� 8DBEJI6I>DC� I=JH�B6@>C<�&�%.�,�L:AA�HJ>I:9�;DG�=><=�H>O:9�6C9�=><=�9>B:CH>DC6A�96I6H:IH�A>@:�I=:�7>DAD<>86A�DC:H��0:�:K6AJ6I:9�I=::;;:8I>K:C:HH�D;�DJG�6EEGD68=�7N�JH>C<�H:K:G6A�B:I=D9�>C9:E:C9:CI�FJ6A>IN�B:6HJG:H�I=6I�8DC;>GB:9�I=:�=><=FJ6A>IN�D;�G:IG>:K:9�8AJHI:GH�7N�6�HIGJ8IJG6A�ED>CI�D;�K>:L��"C�E6GI>8JA6G�I=:�:ME:G>B:CI6A�6HH:HHB:CI�8A6G>;>:9I=6I� &�%.�,� <J6G6CI::H� <DD9� 8AJHI:G>C<H� ;DG� I=:� 96I6H:IH� 7:>C<� 6C6ANO:9� I=6I� G:EG:H:CI� 6� H:K:G:7:C8=B6G@�;DG�7>DAD<>86A�96I6�H8:C6G>D��&DG:DK:G�L:�EGDK>9:9�6�7>DAD<>86A�>CI:GEG:I6I>DC�D;�I=:�8AJHI:G>C<HDAJI>DCH�7N�6�9DB6>C�:ME:GI�6C9�FJ6A>IN�B:6HJG:H�I6>ADG:9�;DG�7>DAD<>86A�96I6�I=6I�8DC;>GB:9�I=:�=><=�FJ6A>IND;�I=:�8AJHI:GH�G:IG>:K:9�7N�&�%.�,��0:�8DC?:8IJG:�I=6I�H>B>A6G�7:C:;>IH�B><=I�7:�6I�=6C9�;DG�H>IJ6I>DCHL=:G:�I=:�H6BEA:H�6G:�>C�96I6�HIG:6BH�DG�>C�H:8DC96GN�HIDG:��-=:H:�H>IJ6I>DCH�L:G:�CDI�HIJ9>:9�>C�I=>H�E6E:G7JI�G:EG:H:CI�6�EGDB>H>C<�IDE>8�;DG�;JIJG:�G:H:6G8=�

�8@CDLA:9<B:CIH-=:�6JI=DGH�LDJA9�A>@:�ID�I=6C@�7DI=�I=:�6CDCNBDJH�G:K>:L:GH�6C9�",�6HHD8>6I:�:9>IDG�L=D�6HH>HI:9�DJGHJ7B>HH>DC�;DG�I=:>G� >CK6AJ67A:�HJ<<:HI>DCH�6C9�>CH><=I;JA�8DBB:CIH�L=>8=�=:AE:9�JH�>BEGDK:�I=:�E6E:GH><C>;>86CIAN��0:�6AHD�I=6C@�"G>I� 6I/>@H�6C9�,JHB>I6��6II6�;DG�EGDK>9>C<�JH�I=:�8D9:�D;�I=:>G�EGD?:8IH�6C9B6CN�JH:;JA�9:I6>AH�;DG�JH>C<�>I�EGDE:GAN�

�EE:C9>M�����:I6>A:9�9>H8JHH>DC�DC�,,*�G:9J8I>DC�J:�ID�>IH�<G>976H:9�9>K>H>K:�6<<ADB:G6I>K:�6EEGD68=��%.�,�EGD9J8:H�8AJHI:G�G:HJAIH�D;�HJE:G>DG�FJ6A>IN��JGI=:GBDG:�I=:�6A<DG>I=B�86C�7:�:M:8JI:9�L>I=�HJE:G>DG�I>B:�6C9�688JG68N�E:G;DGB6C8:H�JH>C<�6�H>BEA:�6C9�;6HI��;DGBJA6�I=6I�6AADLH�JH�ID�:HI>B6I:�I=:�,,*�G:9J8I>DC�EGD9J8:9�7N�6�HEA>I��-=:�8DGG:8IC:HH�D;�I=:6EEGD68=�I=6I�<J6G6CI::�>IH�8DCK:G<:C8:�>H�9>H8JHH:9�C:MI�

"C�DG9:G�ID�H:A:8I�6C�DEI>B6A�7>C6GN�E6GI>I>DC�L:�C::9�ID�:HI>B6I:�I=:�,,*�G:9J8I>DC�D7I6>C:9�7N�I=>H�HEA>I�

,JEEDH:�ID�HEA>I�6�8AJHI:G� �6I�6�<>K:C�EDH>I>DC�?���G:EG:H:CI:9�7N� �>CID�ILD�8AJHI:GH� �6C9��G:EG:H:CI:9�G:HE:8I>K:AN�7N� �6C9� �

-=JH�I=:�,,*�G:9J8I>DC�>H�I=:�CDCC:<6I>K:�K6AJ:�

6C9�>H�CDCC:<6I>K:���DG�9>B:CH>DC�>����;DG�I=:�H6@:�D;�8A6G>IN�L:�9:CDI:� �6H� �6H�6C9� �6H� �

6EEAN>C<�;DGBJA6�����;DG�,,*���L:�D7I6>C�

C s 〈Q , S , N〉 C s ′ C s″ 〈Q 1 , S 1 , N 1〉 〈Q 2 , S 2 , N 2〉

���Δ S S Q ( i , j ) = S S Q ( C s ) - S S Q ( C s ′ ) - S S Q ( C s ″ )

S S Q ( C s ) S S Q , S S Q ( C s ′ )S S Q 1 S S Q ( C s ″ ) S S Q 2

Δ S S Q i ( j ) = S S Q i ( j ) - ( S S Q 1 i ( j ) + S S Q 2 i ( j ) )

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 16 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

L:�G:86AA�I=6I

I=JH�L:�D7I6>C�

7N�BJAI>EAN>C<�7DI=�I:GBH�7N� �6C9�G:86AA>C<�I=6I

L:�D7I6>C�

'DL�I=:�DK:G6AA� �86C�7:�D7I6>C:9�7N�H>BEAN�HJBB>C<�JE�I=:�67DK:�>C�6AA�9>B:CH>DCH�

-=>H�;DGBJA6�:C67A:H�6�FJ>8@�8DBEJI6I>DC�D;�I=:�,,*�9>;;:G:C8:�68=>:K:9�L=:C�6�8AJHI:G�>H�HEA>I�6C9:FJ>K6A:CIAN�L=:C�ILD�8AJHI:GH�6G:�B:G<:9�>CID�DC:��"I�>H�HIG6><=I;DGL6G9�ID�H::�I=6I�I=:�8DBEJI6I>DC�>H9:I:GB>C>HI>8���I=JH�<J6G6CI::>C<�I=:�I:GB>C6I>DC���D;�I=:�DK:G6AA�6A<DG>I=B��&DG:�>C�9:I6>A�H>C8:�>I�>HE:G;DGB:9�6I�:68=�8AJHI:G�HEA>I��DG�B:G<:��>C�I=:�LDGHI�86H:�>I�L>AA�7:�9DC:� �I>B:H��L=:G:�9>B�>H�I=:CJB7:G�D;�ED>CIH�>C�I=:�96I6H:I��I=>H�8DGG:HEDC9�ID�I=:�86H:�I=6I�6�H>C<A:IDC�8AJHI:G�>H�<:C:G6I:9�;DG�:68=96I6�ED>CI�6C9�I=:C�>I:G6I>K:AN�:68=�H>C<A:IDC�>H�B:G<:9�ID�D7I6>C�6<6>C�I=:�>C>I>6A�96I6H:I���H�G:<6G9H�I=:8DGG:8IC:HH�D;�I=:�6A<DG>I=B�L:�G:86AA�I=6I�8AJHI:G�IDEDAD<N�6C9�6HH><CB:CI�HIGDC<AN�G:AN�DC�I=:�8=DH:CB:6HJG:�>C�I=>H�G:HE:8I�DJG�6EEGD68=�>H�8DGG:8I�H>C8:�>I�B>C>B>O:�I=:�>CIG68AJHI:G�9>HI6C8:H�6C9�B6M>B>O:I=:�>CI:G8AJHI:G�9>HI6C8:H�I=JH�<J6G6CI::>C<�I=6I�I=:�D7I6>C:9�E6GI>I>DC�8DC;DGBH�ID�8AJHI:G�9:;>C>I>DC��"CC:MI�,:8I>DC�L:�L>AA�9>H8JHH�I=:�HJE:G>DG�E:G;DGB6C8:H�D;�DJG�6A<DG>I=B�6C9�8DBE6G:�>I�6<6>CHI�H:K:G6A8AJHI:G>C<�6EEGD68=:H�

Q i = Q 1 i ( j ) + Q 2 i ( j )

Δ S S Q

���

2 · d i m

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 17 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

4�5

4�5

4�5

4�5

4�5

4�5

4�5

4�5

4�5

4� 5

4��5

4��5

4��5

+:;:G:C8:H#���=C�2��2DDC�,��)6G@'D>H:GD7JHI� 6A<DG>I=B� ;DG� >9:CI>;N>C<� ;JC8I>DC6AAN� 6HHD8>6I:9� 7>8AJHI:GH� ;GDB� <:C:'D>H:GD7JHI� 6A<DG>I=B� ;DG� >9:CI>;N>C<� ;JC8I>DC6AAN� 6HHD8>6I:9� 7>8AJHI:GH� ;GDB� <:C::MEG:HH>DC�96I6:MEG:HH>DC�96I6"C;DGB��,8>������������ ����EE�����V���

&���C@:GHI�&�&���G:JC><�!�)��$G>:<:A�#��,6C9:G�(EI>8H��DG9:G>C<�ED>CIH�ID�>9:CI>;N�I=:�8AJHI:G>C<�HIGJ8IJG:>C����&WH�,E:8>6A�"CI:G:HI� GDJE�DC�&6C6<:B:CI�(;��6I6������EE����V� �

/���GC6J�,��&6GH�"��&6GTC"I:G6I>K:�8AJHI:G�6C6ANH>H�D;�EGDI:>C�>CI:G68I>DC�96I6"I:G6I>K:�8AJHI:G�6C6ANH>H�D;�EGDI:>C�>CI:G68I>DC�96I6�>D>C;DGB6I>8H���������� ���EE�����V���

����GI=JG�,��/6HH>AK>IH@>>�@&:6CH����I=:�69K6CI6<:H�D;�86G:;JA�H::9>C<�>C����&,"�&�,NBEDH>JB�DC�>H8G:I:��A<DG>I=BH�� ��EE��� ��V� ���

0!��J�$�������=6C���$����0DC<�2��06C<�IIG>7JI:�8AJHI:G>C<�;DG�<GDJE>C<�H:A:8I>DC�6C9�8A6HH>;>86I>DC�D;�<:C:�:MEG:HH>DC�96I6�IIG>7JI:�8AJHI:G>C<�;DG�<GDJE>C<�H:A:8I>DC�6C9�8A6HH>;>86I>DC�D;�<:C:�:MEG:HH>DC�96I6"������&�-G6CH���DBEJI���>DA���>D>C;DGB������ ���EE����V� �

,���6C9NDE69=N6N����&J@=DE69=N6N�.��&6JA>@�C�>BEGDK:9�6A<DG>I=B�;DG�8AJHI:G>C<�<:C:�:MEG:HH>DC�96I6�C�>BEGDK:9�6A<DG>I=B�;DG�8AJHI:G>C<�<:C:�:MEG:HH>DC�96I6�>D>C;DGB6I>8H����������� ���EE������V����

3���6G#DH:E=�������:B6>C:���$�� >;;DG9�'��,G:7GD���&��!6B:A�-��#66@@DA6$6GN�8AJHI:G>C<�L>I=�DEI>B6A�A:6;�DG9:G>C<�;DG�<:C:�:MEG:HH>DC�96I6$6GN�8AJHI:G>C<�L>I=�DEI>B6A�A:6;�DG9:G>C<�;DG�<:C:�:MEG:HH>DC�96I6�>D>C;DGB6I>8H���������� ���EE��� � V� ��

,���:C�6K>9�&���8@:GB6C�&:6HJG:H�D;�8AJHI:G>C<�FJ6A>IN��6�LDG@>C<�H:I�D;�6M>DBH�;DG�8AJHI:G>C<�>C��':JG6A"C;DGB6I>DC�)GD8:HH>C<�,NHI:BH�� ��EE�����V����

����:C�DG�+��,=6B>G�3��26@=>C>�AJHI:G>C<�<:C:�:MEG:HH>DC�E6II:GCH�AJHI:G>C<�<:C:�:MEG:HH>DC�E6II:GCH#���DBEJI���>DA������V����������EE�����V���

2�&���=:JC<@B>96HI&:6CH��6�C:L�<:C:G6A>O:9�@B:6CH�8AJHI:G>C<�6A<DG>I=B@B>96HI&:6CH��6�C:L�<:C:G6A>O:9�@B:6CH�8AJHI:G>C<�6A<DG>I=B)6II:GC�+:8D<C��%:II������������ ���EE������V����

,���=J�#���:+>H>�&���>H:C�#��&JA=DAA6C9�����DIHI:>C�)�(���GDLC�"��!:GH@DL>IO-=:�IG6CH8G>EI>DC6A�EGD<G6B�D;�HEDGJA6I>DC�>C�7J99>C<�N:6HI-=:�IG6CH8G>EI>DC6A�EGD<G6B�D;�HEDGJA6I>DC�>C�7J99>C<�N:6HI,8>:C8:�������������������EE�����V� �

,���6II6�,���6II6�K6AJ6I>DC�D;�8AJHI:G>C<�6A<DG>I=BH�;DG�<:C:�:MEG:HH>DC�96I6�K6AJ6I>DC�D;�8AJHI:G>C<�6A<DG>I=BH�;DG�<:C:�:MEG:HH>DC�96I6�&���>D>C;DGB�����,����� ��

����:;6NH

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 18 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

4��5

4��5

4��5

4��5

4��5

4��5

4� 5

4��5

4��5

4��5

4��5

�C�:;;>8>:CI�6A<DG>I=B�;DG�6�8DBEA:I:�A>C@�B:I=D9�C�:;;>8>:CI�6A<DG>I=B�;DG�6�8DBEA:I:�A>C@�B:I=D9�DBEJI��#��� ��������EE�����V���

����:B7-A-�)��$6HIC:G�JOON�8B:6CH�B:I=D9�;DG�8AJHI:G>C<�B>8GD6GG6N�96I6�JOON�8B:6CH�B:I=D9�;DG�8AJHI:G>C<�B>8GD6GG6N�96I6�>D>C;DGB6I>8H���������� ���EE�����V��

)���W=6:H:A::G!DL�9D:H�<:C:�:MEG:HH>DC�8AJHI:G>C<�LDG@�!DL�9D:H�<:C:�:MEG:HH>DC�8AJHI:G>C<�LDG@�'6I���>DI:8=CDA������������ ���EE������V�� �

%�,���>C7DC9�-��,J�!����0J�+���G>:9B6C�1��06C<����+6B>G:O����$GDC:C7:G<�"����0:>CHI:>C-=:� <GDLI=� >C=>7>IDGN� :;;:8I� D;� 68I>DC� DC� =JB6C�7G:6HI� 86C8:G� 8:AAH� >H� 6HHD8>6I:9�L>I=-=:� <GDLI=� >C=>7>IDGN� :;;:8I� D;� 68I>DC� DC� =JB6C�7G:6HI� 86C8:G� 8:AAH� >H� 6HHD8>6I:9�L>I=68I>K6I>DC�D;�HIG:HH�G:HEDCH:�E6I=L6NH68I>K6I>DC�D;�HIG:HH�G:HEDCH:�E6I=L6NH"CI��#���6C8:G����������� ���EE��� ��V� ��

&���HI:G�!�)��$G>:<:A�#��,6C9:G�1��1J���9:CH>IN76H:9�6A<DG>I=B�;DG�9>H8DK:G>C<�8AJHI:GH�>C�A6G<:�HE6I>6A96I676H:H�L>I=�CD>H:�>C��$CDLA:9<:��>H8DK:GN�6C9��6I6�&>C>C<������

,���A:H86� ��&6C8D����&6H8>6G>�%��)DCI>:G>����)J<A>:H:�6HI�9:I:8I>DC�D;�1&%�HIGJ8IJG6A�H>B>A6G>IN�6HI�9:I:8I>DC�D;�1&%�HIGJ8IJG6A�H>B>A6G>IN"����-G6CH��$CDLA���6I6��C<����������� ���EE���� V���

%�� 6AAJ88>D�(��&>8=:A�)���DBDC���(��!:GD�AJHI:G>C<�L>I=�6�C:L�9>HI6C8:�B:6HJG:�76H:9�DC�6�9J6AGDDI:9�IG::�AJHI:G>C<�L>I=�6�C:L�9>HI6C8:�B:6HJG:�76H:9�DC�6�9J6AGDDI:9�IG::"C;DGB��,8>�������� ����EE����V���

"�� 6I/>@H�+��,=6G6C�+��,=6B>G,8DG>C<�8AJHI:G>C<�HDAJI>DCH�7N�I=:>G�7>DAD<>86A�G:A:K6C8:,8DG>C<�8AJHI:G>C<�HDAJI>DCH�7N�I=:>G�7>DAD<>86A�G:A:K6C8:�>D>C;DGB6I>8H����������� ���EE������V����

#�� DAAJ7� ��,=:GAD8@�AJHI:G>C<�B>8GD6GG6N�96I6�AJHI:G>C<�B>8GD6GG6N�96I6&:I=D9H��CONBDA�������� ���EE�����V���

$�� G6=6B�����:�%6H�&DG:C6H����-G>E6I=>����$>C<�&��$6K6C6=�#��&:C9:O�&��,IDC:�#��,A6B6�&�&>AA:G� ���CID>C:�!��0>AA:GH�)��,:76HI>6C>���%��+DH:C7:G< :C:� :MEG:HH>DC� >C� =>HIDAD<>86AAN� CDGB6A� :E>I=:A>JB� ;GDB� 7G:6HI� 86C8:G� E6I>:CIH� 6C9 :C:� :MEG:HH>DC� >C� =>HIDAD<>86AAN� CDGB6A� :E>I=:A>JB� ;GDB� 7G:6HI� 86C8:G� E6I>:CIH� 6C9;GDB�86C8:G;G::�EGDE=NA68I>8�B6HI:8IDBN�E6I>:CIH�H=6G:H�6�H>B>A6G�EGD;>A:;GDB�86C8:G;G::�EGDE=NA68I>8�B6HI:8IDBN�E6I>:CIH�H=6G:H�6�H>B>A6G�EGD;>A:�G>I��#���6C8:G�� �������� � ��EE������V����

"�� GDC6J�,��&DG6C�(EI>B6A�"BEA:B:CI6I>DCH�D;�.) &��6C9�(I=:G��DBBDC��AJHI:G>C<��A<DG>I=BH-:8=C>86A�+:EDGI�� ��

)�!�� JOO>�&���6CC6I6GDBJ�,��6C�:MI:CH>DC�D;�I=:�-&��EA6I;DGB�ID�B6C6<:�6;;NB:IG>M�7>C6GN�96I6BJ�,��6C�:MI:CH>DC�D;�I=:�-&��EA6I;DGB�ID�B6C6<:�6;;NB:IG>M�7>C6GN�96I6�&���>D>C;DGB������� � ��E�����

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 19 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

4��5

4��5

4��5

4��5

4��5

4� 5

4��5

4��5

4��5

4��5

4��5

4��5

)�!�� JOO>�&�-���>�&6GI>CD� ��-G69><D�)��/:AIG>�)��-6HHDC:�)��-6<A>6;:GG>�&���6CC6I6GD�JIDB6I>8�HJBB6G>H6I>DC�6C9�6CCDI6I>DC�D;�B>8GD6GG6N�96I6�JIDB6I>8�HJBB6G>H6I>DC�6C9�6CCDI6I>DC�D;�B>8GD6GG6N�96I6,D;I��DBEJI����������� ����EE���� �V����

#��!6C�&��$6B7:G�6I6�&>C>C<���DC8:EIH�6C9�-:8=C>FJ:H�6I6�&>C>C<���DC8:EIH�6C9�-:8=C>FJ:H&DG<6C�$6J;B6CC��� �

'��!:6G9����!DAB:H����,I:E=:CH��FJ6CI>I6I>K:�HIJ9N�D;�<:C:�G:<JA6I>DC� >CKDAK:9� >C� I=:� >BBJC:�G:HEDCH:�D;�6CDE=:A>C:��FJ6CI>I6I>K:�HIJ9N�D;�<:C:�G:<JA6I>DC� >CKDAK:9� >C� I=:� >BBJC:�G:HEDCH:�D;�6CDE=:A>C:BDHFJ>ID:H��6C�6EEA>86I>DC�D;�76N:H>6C�=>:G6G8=>86A�8AJHI:G>C<�D;�8JGK:HBDHFJ>ID:H��6C�6EEA>86I>DC�D;�76N:H>6C�=>:G6G8=>86A�8AJHI:G>C<�D;�8JGK:H#���B��,I6I���HHD8��� ���������� ���E����

$����!:AA:G�3�� =6=G6B6C>�6N:H>6C�=>:G6G8=>86A�8AJHI:G>C<�6N:H>6C�=>:G6G8=>86A�8AJHI:G>C<"CI���DC;��&68=��%:6GC���� ���EE�����V� �

��$��#6>C�&�'��&JGIN�)�#���ANCC�6I6�8AJHI:G>C<��6�G:K>:L�6I6�8AJHI:G>C<��6�G:K>:L��&��DBEJI��,JGK�����������

+��#DGCHI:C����2J,>BJAI6C:DJH�<:C:�8AJHI:G>C<�6C9�HJ7H:I�H:A:8I>DC�;DG�H6BEA:�8A6HH>;>86I>DC�K>6�&�%,>BJAI6C:DJH�<:C:�8AJHI:G>C<�6C9�HJ7H:I�H:A:8I>DC�;DG�H6BEA:�8A6HH>;>86I>DC�K>6�&�%�>D>C;DGB6I>8H���������� ���EE���� V�� �

%��$6J;B6C�)�#��+DJHH::JL�>C9>C<� GDJEH�>C��6I6���C�"CIGD9J8I>DC�ID��AJHI:G��C6ANH>H�>C9>C<� GDJEH�>C��6I6���C�"CIGD9J8I>DC�ID��AJHI:G��C6ANH>H0>A:N��� ��

��$:GG�!�#��+JH@>C�&���G6C:�)���DDA6C-:8=C>FJ:H�;DG�8AJHI:G>C<�<:C:�:MEG:HH>DC�96I6-:8=C>FJ:H�;DG�8AJHI:G>C<�<:C:�:MEG:HH>DC�96I6�DBEJI���>DA��&:9����������� ���EE�����V���

���$DH8=B>:9:G�$��3>BB:GB6CC�,��-G>SA�-��,IDAIB6CC�.��%:H:G-DDAH�;DG�B6C6<>C<�6C9�6C6ANO>C<�B>8GD6GG6N�96I6-DDAH�;DG�B6C6<>C<�6C9�6C6ANO>C<�B>8GD6GG6N�96I6�G>:;>C<H��>D>C;DGB����������� ����EE����V�

#�3����%6>�-�#��!J6C<�C�6<<ADB:G6I>K:�8AJHI:G>C<�6A<DG>I=B�JH>C<�6�9NC6B>8�@C:6G:HIC:><=7DG�A >HI�C�6<<ADB:G6I>K:�8AJHI:G>C<�6A<DG>I=B�JH>C<�6�9NC6B>8�@C:6G:HIC:><=7DG�A >HI"C;DGB��,8>������������ ����EE������V����

+��%>J�%��#>6D�1��3=6C<�2��%> :C:�IG6CHEDHDC�76H:9�8ADC:�H:A:8I>DC�6A<DG>I=B�;DG�6JIDB6I>8�8AJHI:G>C< :C:�IG6CHEDHDC�76H:9�8ADC:�H:A:8I>DC�6A<DG>I=B�;DG�6JIDB6I>8�8AJHI:G>C<"C;DGB��,8>��� ���� ����EE���V��

#����&68*J::C�,DB:�B:I=D9H�;DG�8A6HH>;>86I>DC�6C9�6C6ANH>H�D;�BJAI>K6G>6I:�D7H:GK6I>DCH�>C���I=��:G@:A:N,NBEDH>JB�DC�&6I=:B6I>86A�,I6I>HI>8H�6C9�)GD767>A>IN������EE�����V����

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 20 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

4��5

4��5

4��5

4� 5

4��5

4��5

4��5

4��5

4��5

4��5

4��5

���)>OOJI>�,����+DB7D��8D8AJHI:G>C<�6EEGD68=�;DG�B>C>C<�A6G<:�EGDI:>CEGDI:>C�>CI:G68I>DC�C:ILDG@H��8D8AJHI:G>C<�6EEGD68=�;DG�B>C>C<�A6G<:�EGDI:>CEGDI:>C�>CI:G68I>DC�C:ILDG@H"������&�-G6CH���DBEJI���>DA���>D>C;DGB���������� ����EE�����V��

#�&��)AJB:GI�A:M>7>A >IN�>C�8=>A9G:CWH�JH:�D;�HE6I>6A�6C9�86I:<DG>86A�DG<6C>O6I>DC6A�HIG6I:<>:H�A:M>7>A >IN�>C�8=>A9G:CWH�JH:�D;�HE6I>6A�6C9�86I:<DG>86A�DG<6C>O6I>DC6A�HIG6I:<>:H+:86AA��:K:ADE��)HN8=DA��� ������������EE�����V���

���+6HBJHH:C�����:�%6��GJO�3�� =6=G6B6C>���%��0>A9&D9:A>C<�6C9�K>HJ6A>O>C<�JC8:GI6>CIN�>C�<:C:�:MEG:HH>DC�8AJHI:GH�JH>C<��>G>8=A:I�EGD8:HH&D9:A>C<�6C9�K>HJ6A>O>C<�JC8:GI6>CIN�>C�<:C:�:MEG:HH>DC�8AJHI:GH�JH>C<��>G>8=A:I�EGD8:HHB>MIJG:HB>MIJG:H"������&�-G6CH���DBEJI���>DA���>D>C;DGB���� ��

+��,6K6<:�$��!:AA:G�2��1J�3�� =6=G6B6C>�0��-GJB6C�&�� G6CI�$���:C7N����0>A9+��!���;6HI�76N:H>6C�=>:G6G8=>86A�8AJHI:G>C<�;DG�B>8GD6GG6N�96I6+��!���;6HI�76N:H>6C�=>:G6G8=>86A�8AJHI:G>C<�;DG�B>8GD6GG6N�96I6�&���>D>C;DGB��� ������� ���E�����

+��,>7HDC,A>C@��6C�DEI>B6AAN�:;;>8>:CI�6A<DG>I=B�;DG�I=:�H>C<A:A>C@�8AJHI:G�B:I=D9,A>C@��6C�DEI>B6AAN�:;;>8>:CI�6A<DG>I=B�;DG�I=:�H>C<A:A>C@�8AJHI:G�B:I=D9�DBEJI��#������������EE��� V��

+�+��,D@6A���#��+D=A;�>DB:IGN�I=:�)G>C8>EA:H�6C9�)G68I>8:�D;�,I6I>HI>8H�>C��>DAD<>86A�+:H:6G8=�>DB:IGN�I=:�)G>C8>EA:H�6C9�)G68I>8:�D;�,I6I>HI>8H�>C��>DAD<>86A�+:H:6G8=�G::B6C�0�!�����DBE6CN�������

��#��/::CB6C�&�#�-��+:>C9:GH-=:�C:6G:HI� HJ78A6HH�8A6HH>;>:G��6�8DBEGDB>H:�7:IL::C� I=:�C:6G:HI�B:6C�6C9�C:6G:HI-=:�C:6G:HI� HJ78A6HH�8A6HH>;>:G��6�8DBEGDB>H:�7:IL::C� I=:�C:6G:HI�B:6C�6C9�C:6G:HIC:><=7DG�8A6HH>;>:GC:><=7DG�8A6HH>;>:G"����-G6CH��)6II:GC��C6A��&68=��"CI:AA����������� ���EE������V����

0��06C<�#��26C<�+�+��&JCIO�,I>C<��6�HI6I>HI>86A�>C;DGB6I>DC�<G>9�6EEGD68=�ID�HE6I>6A�96I6�B>C>C<�>C��/:GN%6G<:��6I6��6H:H������EE�����V����

0��06C<�#��26C<�+�+��&JCIO�C�6EEGD68=�ID�68I>K:�HE6I>6A�96I6�B>C>C<�76H:9�DC�HI6I>HI>86A�>C;DGB6I>DC�C�6EEGD68=�ID�68I>K:�HE6I>6A�96I6�B>C>C<�76H:9�DC�HI6I>HI>86A�>C;DGB6I>DC"����-G6CH��$CDLA���6I6��C<����������� ��EE�����V���

$�2��2:JC<���+��!6NCDG�0�%��+JOOD/6A>96I>C<�8AJHI:G>C<�;DG�<:C:�:MEG:HH>DC�96I6/6A>96I>C<�8AJHI:G>C<�;DG�<:C:�:MEG:HH>DC�96I6�>D>C;DGB6I>8H���������� ���EE��� �V���

-��3=6C<�+��+6B6@G>H=C6C�&��%>KCN�>G8=��6�C:L�96I6�8AJHI:G>C<�6A<DG>I=B�6C9�>IH�6EEA>86I>DCH�>G8=��6�C:L�96I6�8AJHI:G>C<�6A<DG>I=B�6C9�>IH�6EEA>86I>DCH�6I6�&>C��$CDLA���>H8DK���������������EE�����V���

�DGG:HEDC9>C<�6JI=DG��-:A������� �����������;6M������ ������ ���0:�I=6C@�"G>I� 6I/>@H�6C9�,JHB>I6��6II6�;DG�EGDK>9>C<�JH�I=:�8D9:�D;�I=:>G�EGD?:8IH�6C9�B6CN�JH:;JA9:I6>AH�;DG�JH>C<�>I�EGDE:GAN�

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 21 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

About ScienceDirect Contact and support Information for advertisers

Terms and conditions Privacy policy

-=:�HD;IL6G:�>H�6K6>A67A:�6I�=IIE���LLL�H:6G8=�8E6C�DG<�9>HI� (-:GB�>C9:G��

�DENG><=I�P�� ����AH:K>:G�"C8���AA�G><=IH�G:H:GK:9�

Copyright © 2014 Elsevier B.V. except certain content provided by third parties. ScienceDirect® is a registered trademark ofElsevier B.V.

Cookies are used by this site. To decline or learn more, visit our Cookies page

Switch to Mobile Site

��+:8DBB:C9:9�6GI>8A:H+:8DBB:C9:9�6GI>8A:H

,=6E:�8A6HH>;>86I>DC�7N�B6C>;DA9�A:6GC>C<,=6E:�8A6HH>;>86I>DC�7N�B6C>;DA9�A:6GC>C<[[

"C8G:B:CI6A�A:6GC>C<�76H:9�BJAI>D7?:8I>K"C8G:B:CI6A�A:6GC>C<�76H:9�BJAI>D7?:8I>K[[

+D7JHI�<:C:�H><C6IJG:H�;GDB�B>8GD6GG6N�+D7JHI�<:C:�H><C6IJG:H�;GDB�B>8GD6GG6N�[[

/>:L�BDG:�6GI>8A:H�R

� ���"C;DGB6I>DC�,8>:C8:H BDG:

� ���"C;DGB6I>DC�,8>:C8:H BDG:

� ���#DJGC6A�D;��>DB:9>86A�"C;DGB6I>8H BDG:

���>I >C<�6GI>8A:H���>I >C<�6GI>8A:H������

��+:A6I:9�7DD@�8DCI:CI+:A6I:9�7DD@�8DCI:CI

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 22 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 23 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 24 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html

8/26/14 3:05 PMAnalysing microarray expression data through effective clustering

Page 25 of 25file:///Users/carlo/Desktop/papers14/Analysing%20microarray%20expression%20data%20through%20effective%20clustering.html