Professional Documents
Culture Documents
Badrul Sarwar,
George Karypis,
Joseph
!Konstan,
#"%$&(')*and
&+' John Riedl
GroupLens Research Group / Army HPC Research Center
Department of Computer Science and Engineering
University of Minnesota
Minneapolis, MN 55455
Permission to make digital or hard copies of all or part of this work for
1.1 Problem Statement
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that copies
gh4a=>RD(:y@6?@i-98FjK-V8>-0:P-F?8>/!R~=>R-0:P-V=hK1b/!R6?AEAE-04L-0:=>1L-9=>R-98Fj
bear this notice and the full citation on the first page. To copy otherwise, to UW ;I:<=>T57;7DE4L34-9K?465 -97DE:<=>DE4Lb?AEL18>DB=>R23:y=>R?=yR6?FO-[=>R-V@i1N
republish, to post on servers or to redistribute to lists, requires prior specific =>-04U=>D(?A=>1DE23@8>1^O-uWi1=>R:P/F?A(?WDEAEDB=h;*?465SUT6?AEDB=h;s1YM8>-0/012QN
permission and/or a fee. 23-045-98.:<;7:<=>-023:0f{R-98>-_R?:.Wi-0-04aAEDB=P=>AE-MK]18>HI14a-97@i-98>DE23-04N
EC00, October 17-20, 2000, Minneapolis, Minnesota. =!?AO?AED(5?^=>D(14I1Y8>-0/012323-04657-98:<;7:<=>-023:?L?DE4:<=.?_:P-\=1Y8>-0?ABN
Copyright 2000 ACM 1-58113-272-7/00/0010 .. 5.00
K]18>A(5b5?=!?:P-9=>: KDB=>R3=>R-41=!?WAE-p-97/0-0@7=>DE1431Yk ^\fkI18>-y-9JN 1Y_@i-01@AE-~Y+8>12 ?%/0AE1:P-9NnH74DB=a/01232_T4DE=;Uj
:PT/!R*?:a?41
3/0-
@i-\8>D(23-94J=!?AO?A(D(5?=>DE14 DE:4-9-F5-F5C?L?DE4:<=8>-F?AENK]18>A(5a5?^=!?:P-9=>:0j K18>HJL8>1T@#fp1FK]-0O-98Fjd8>-0/912323-045-98M:<;7:<=>-02 Yo18`A(?8>L-C/012QN
?465ZDB=D(:]D(23@i18P=!?4U==>R?==>R-0:P-5?=!?:P-\=>:
D(4/0AET657-cdN/912323-98>/0- 2`T4DB=>DE-0:w/F?4Q41=5-0@i-9465V14Q-F?/\R_@i-98>:P14_HJ41FKDE4Lp=>R-1=>R-98>:0f
5?=!?Q?:.K]-0AEA?:y/914U=>-04U=.5?^=!?7f ?^=>-98k14Q:P-0O-98!?AJ8!?=>DE4L:<NnW6?:P-053?T=>12Z?=>-F5M8>-0/012323-9465-\8d:<;7:<N
{R-3Yo1U/0T:`1Y
=>RDE:_@?@i-98MDE:_=K1NYl1A(5fuwDB8>:<=FjdK]-b@78>1^O7D(57-Z? =>-023:K-98>-`5-0O-0AE1@i-F5f{R - y8>1T@6-04:.8>-0:P-F?^8>/\R:<;7:<=>-02 Uj
:<;7:<=>-02Z?^=>DE/ -97@i-98>DE23-04U=!?A
-0O?AET?=>DE14v1Y[57DBz-98>-04U=3=>-0/!R4D(SUT-9: ^F@8>1^OJD(5-0:]?V@:P-0T65714U;7231T:/91AEA(?Wi18!?^=>DEO-yxAE=>-\8>D(4L_:P1AET=>DE14
Yl18_8>-0/012323-9465-\8V:<;:<=>-923:0jG?45q:P-0/0145jdK]-b@78>-0:P-04U=M4-9K|?ABN Yo1 8 y:P-04-9=.4-9K:p?465 231^OJDE-0:0f,.DE4L1a G?46 5 yD+57-013,.-0/012QN
L18>DB=>R23:_=>R6?=Q?8>- @?8P=>DE/0TA+?^8>AB;:PTDB=>-F5%Yo18Q:P@6?8>:P- 5?=!?u:P-9=>:0j 23-045-98pEF^?8>--92Z?DEA?45ZK]-0WNnW6?:P-F5Z:<;:<=>-923:]=>R6?^=
L-04-98!?=>-
:PT/\RI?:
=>R1:P-p=>R6?^=.?8>-/91232314bDE4 cdNn/012323-98>/9-p?@@AEDE/F?^=>D(14: 8>-0/012323-0465?=>DE14:14I2`T:PDE/p?465b231^O7DE-0:8>-9:P@i-0/9=>DEO-0AB;Uf :P@i-\N
1Yd8>-0/012323-04657-98=>-0/!R41AE1L;Uf{R-0:P-M?AEL18>DE=>R23:R6?FO-[/!R6?8!?/9N /0D(?A.DE:P:PT-a1YMe]1232_T4DE/F?^=>DE14:31Y=>R-I.e
^y@8>-0:P-04U=>:Z?
=>-\8>D(:<=>DE/0:Q=>R?=`2Z?H-Z=>R-02 AEDEH-0AB;%=>1uWi-CY?:<=>-98ZDE414AEDE4-C@i-98PN 4JT2_Wi-98p1Y57DBz-98>-04U=p8>-0/912323-045-98y:<;7:<=>-023:0f[.AE=>R1TLRa=>R-0:P-
Yl18>2Z?4/0-=>R?4I2Z?4U;Z@8>-0OJDE1T:PAE;b:<=>T5DE-F5I?AEL18>DB=>R23:0j6?465bK]- :<;7:<=>-023:R?^O-QWi-0-04:PT/0/9-0:P:<YoTAkDE4~=>R-Q@6?:<=Fj#=>R-0DB8[KD(5-0:P@78>-F?5
:P-9-0Ht=>1D(4JO-9:<=>D(L?=>-uR1FK =>R-uSUT?AEDB=h;t1Y[=>R-0DB8b8>-9/012323-045?^N T:P-[R?:.-97@i1:P-F5C:P123-[1Yd=>R-0DB8.A(DE23DB=!?=>DE14:.:PT/!Ra?:
=>R-[@78>1W7N
=>DE14:/0123@6?^8>-0:k=>1V1=>R-98
?AEL18>DB=>R23:]T45-98]5DBz-98>-94J=]@8!?/9=>DE/F?A AE-023:M1Y.:P@6?^8>:PDE=;qDE4%=>R-Z5?=!?a:P-9=FjG@8>1WAE-023:_?:P:P1U/0D(?^=>-F5KDB=>R
/9DE8>/9T23:<=!?4/0-0:0f RDELRu5DE23-04:PDE146?AEDB=h; ?45 :P1b14#
f J@6?8>:PDB=; @8>1WAE-02 DE4 8>-0/012QN
g4@i-98PYo18>23D(4L1T8I-97@i-98>DE23-04U=!?AyO?A(D(5?=>DE14jK-T:P-u=hK]1 23-045-98M:<;7:<=>-02 R6?:_Wi-0-94%?5578>-0:P:P-F5qDE4} Uj^^f{R-b@78>1W7N
5?=!?:P-9=>:0fwDB8>:<=FjK-[T:P-V5?=!?_Y+8>12 ?`A(?8>L-McGNn/012323-98>/0-/012QN AE-023:Z?:P:P1U/9D+?^=>-F5sKDB=>RsRD(LR57D(23-94:PDE14?AEDB=h;DE4v8>-0/912323-045-98
@?4U;U j koJU\P J
7<>fZwDE4L-\8>R7T7=V:P-0AEAE:_?ZKD(5-QO?8>DBN :<;7:<=>-023:`R?FO-CWi-9-04s57DE:P/0T:P:P-F5DE4 Fj
?45v?@@A(DE/F?^=>DE14s1Y57DBN
-\=h;Q1YR-9=>-\8>1L-94-01T:]@78>1J5T/9=>:0j8!?4LDE4L[DE43@78>D(/9-Y+8>12X?8>1T465 23-04:PD(146?A(DB=;8>-F5T/9=>DE14=>-0/!R4D(SJT-0:Z=>1v?55J8>-0:P:I=>R-9:P-~DE:P:PT-9:
=>-94u51AEA(?8>:=>13:P-0O-\8!?AwRJT4578>-F5b51AEA+?^8>:0
f 7-0/91465jK]-MT:P-M5?=!? R6?:Wi-0-04IDE4JO-0:<=>DEL?=>-F5IDE4q f
Y+8>12 1T781^K4_8>-0/012323-9465-\8G:<;:<=>-92r8>-0:P-F?8>/!RQ:PDB=>-j7I1^OJDE-F#-94:
K
K
K[f 231^OJDE-0AE-04:0f T234f -F5T \f{R1TLR I1OJDE-F-04:
DE:?_/914U=>-04U= g48>-0/0-04U=b;U-F?^8>:0j.KDB=>R
5?=!?I:PDB=>-j=>R-3DB=>-023:[DB=V8>-0/012323-9465:?8>-3@8>1J57T/9=>:[=>R?=V/014N Personalization
=>R-k?57O-04U=w1Y6cdNhe]1in 2323E-Commerce.
-98>/9-=>R-k4-0-F5Yo18G@i-98>:P14?AEDEm0-F5_:P-\8>O7DE/0-9:
:PT23-98>:Q?^8>-I:P-0-9H7DE4Lu=>1@T78>/\R?:P-j:P1uK-bYo-0-0A=>R- I1^OJDE-F#-94: R6?:pWi-0-94a-023@R?:PDEm0-F5f]T:PDE4-9:P:p8>-0:P-F?^8>/\R-98>:R6?FO-V?5O1U/0?=>-F5
?46?AE;7:PDE:.D(:.?A(:P1Q8>-9A(-9O?4J==>13?4 cdNn/012323-98>/9-[?T5DE-04/9-f =>R-y4-0-05ZYl18.14-9N=>1Nn14-p2Z?8>H-9=>DE4Lb f 4-9N=>1N14-y2Z?8>H-9=PN
DE4La?^=P=>-023@=>:=>1ID(23@78>1^O-_=>R-346?=>T78>-Q1Y2Z?^8>H-9=>DE4LCWU;uT:PDE4L
1.2 Contributions =>-0/!R41AE1L;}=>1v?:P:PDE:<=uWT:PDE4-0:P:P-0: DE4}=P8>-F?=>DE4Lv-F?/\R/0T:<=>123-98
{RDE:.@?@i-98.R?:
=>R8>-0-p@8>DE2Z?8P;Z8>-0:P-F?^8>/\RI/014U=P8>DEWT7=>D(14:0 DE4657D(OJD(5T?AEAB;Uf*{1qWi- :PT/0/0-0:P:<YlTAyD(4vDE4/98>-F?:PDE4LAB;s/0123@i-9=>DB=>DEO-
gh4U=>-98>4-9=2Z?^8>H-9=>@A+?/0-j8>-9:P-F?8>/!R-98>:
R?FO-y:<=P8>-9:P:P-F5Z=>R-p4-0-F5QYo18
fy4?46?AE;7:PDE:Q1Y.=>R-C-\z-0/9=>DEO-04-0:P:`1Y.8>-9/012323-045-98M:<;7:<N /F?@=>T8>DE4L_/0T:<=>123-98A(1F;J?AB=h;u nfk,.-0/012323-9465-\8]:<;7:<=>-023:k/F?4
=>-023:14M?/\=>T6?AU/9T:<=>123-\8w5?=!?Yl8>12r?4[-\N/912323-98>/0-G:PDB=>-f T:P-WT:PDE4-0:P:P-0:w?/!RDE-0O-d=>R-9:P-kL1?AE:0
f J/\R?Yo-\8w-9=w?AfEj FJ@8>-0:P-94J=
?V5-9=!?DEA(-05Z=!?^1412_;`?4653-9?23@AE-0:1Y8>-0/012323-9465-\8k:<;7:<=>-023:
Uf /0123@6?^8>D(:P14%1Y
=>R-3@i-98PYo18>2Z?4/9-31Y:P-0O-\8!?A57DBz-98>-04U= T:P-05DE4%cGNn/012323-98>/0-3?465R1FKX=>R-9;/F?4%@78>1OJD(5-314-9N=>1N14-
8>-0/012323-04657-98k?AEL18>DB=>R23:0j7DE4/0AET657DE4L[18>DELD(4?A/01A(A(?Wi18!?N @i-98>:P146?AEDEmF?^=>D(14%?45u?=p=>R-_:>?23-_/0?4/F?@7=>T8>-M/0T:<=>123-98AE1F;JN
=>DEO-Qx6AB=>-98>DE4L~?AEL18>DB=>R23:0j?AEL18>DE=>R23:_W?:P-F5q14%5DE23-047N ?AB=;Uf
:PDE14?AEDB=h;8>-05T/\=>D(14#j`?45/0A(?:P:PDE/F?AZ5?=!?v23D(4DE4L}?A(L1N
8>DB=>R23:0f =>-0/!RN
Knowledge 4D(SUT-9:_B! ^j?Discovery AE:P1ZH741FK4u?in:#Databases (KDD).
"n%$_ooUj6T:PT6?AEAB;I8>-\Yo-98y=>13-9JN
fs4-9Kv?@@78>1?/\R`=>1yYo18>23DE4L8>-0/012323-9465?^=>DE14:w=>R?=GR?:
14AED(4-`-
Q/0DE-04/9;a?5O?4U=!?L-0:O-98>:PT:y@8>-0OJDE1T:PAE; :<=>T65DE-F5 =P8!?/\=>D(141Y
DE23@AEDE/0DB=[WT7=[T:P-9YlTAGDE4Yl18>2Z?=>DE14Yl8>12 5?^=!?W6?:P-0:0f
?AEL18>DB=>R23:0j?465[=>R6?^=G?AE:P1yR6?:GSUT?AEDB=h;V?5O?4U=!?L-0:dDE4M=>R- {]K1u2Z?D(4L1?AE:Q1Y
=>R-0:P-3=>-0/\R4D(SUT-0:M?^8>-Z=>1q\& $3i( 'ZWJ;
@8>-9:P-04/0-C1YyO-98P;:P@6?8>:P-I5?^=!?:P-9=>:0j]:PT/!R?:QDE:Q/01232314 5DE:P/01^O-\8>D(4L~=>R-I@i1=>-94J=>D(?A.Yl18Z-
3/9D(-94/0DE-0:0j
183=>) 1 $Q+ * $3P
KDB=>RacdN/912323-98>/0-p@T8>/!R6?:P-[5?^=!?7f $3( 'aWU;s5DE:P/01^O-98>DE4LqK?0;:Z=>1q:P-0AEA2318>-a@8>1J57T/9=>:3=>1%/0T:<N
=>123-\8>:0fI18VDE4:<=!?4/0-jw/0123@?4DE-0:M?^8>-3T:PDE4La5?=!?I23DE4D(4LC=>1
5DE:P/01^O-\8yKRD(/!Ra@8>1J57T/9=>::P-0AEAK]-0AEAd?=KRD(/!R =>DE23-0:1YG;U-F?^8Fj6:P1
1.3 Organization
{R-38>-0:<=`1Y
=>R-3@?@i-98_DE:_18>L?4DEm0-F5%?:VYo1AEA(1FK:0f~{R-Z4-9J= =>R-\;_/0?4`2Z?46?L-=>R-9DE8d8>-\=!?DEA:<=>18>-D(4JO-94J=>18P;V2318>-(-
3/0DE-04U=>AB;Uj
P: -9/9=>DE14s@78>1^O7D(57-0:3?~W78>DE-9Yp1^O-\8>O7DE-9K 1Y:P123-b8>-0A(?^=>-F5t8>-0:P-0?8>/!R @i1=>-94J=>D(?A(AB;C:>?FO7DE4LQ23DEA(AEDE14:1YG571AEA(?8>:y?_;U-F?^8V ^f =>R-98/012QN
K]18>Hif.{R-M:P-0/9=>DE14uYl1AEAE1FKD(4Lb=>R?=y@8>1^OJD(5-0:y5-9=!?DEA(-05?46?AE;7:PDE: @6?4DE-0:V?^8>-QT:PDE4L =>1I5DE:P/01^O-98[KRDE/\R/0T:<=>123-98>:KDEAEAkWi-
1Yd57DEz-\8>-04U=]8>-0/012323-04657-98k:<;:<=>-92X=!?:PHJ:
?465QYo18>2`TA+?^=>-0::P123- 231:<=VDE4U=>-98>-0:<=>-05~DE4 ?C:P@i-0/0D(?A1z-98Fj#8>-F5T/0DE4Lb=>R-`/91:<=>:V1Y57DBN
Hi-dimensional
Most frequent items
{x, y , z} {a , b, c}
(aggregate)
Low-dimensional Association rule
!#"%$&'() +*,$-. +/ +10-2"#!# 43
Neighborhood Formed
{R- H!I $.
*, /!#%g4%?55DB=>DE14q=>1 =>R-Z?Wi1^O-3231^O7DE-
Yo?/9=_=>R6?^=_=>R-bAE1FK 57DE23-04:PinDE14Low-dimensional
?Ak:P@6?/9-CDE:`AE-0:P:_:P@6?^Space.
8>:P-b=>R?4qDB=>: 5?=!?JjK]-T:P-
RD(:<=>18>DE/F?A7-9Nn/012323-\8>/0-]@T8>/!R6?:P-5?=!?.Yl8>12
RDELR%57D(23-94:PDE14?AG/01T4U=>-98>@6?^8P=VA(-05~T:=>1CYo18>2=>R-Q4-9D(LRJWi18PN wD(4L-98>RJT7=agh4/fEj[?A+?^8>L-%-9Nn/012323-98>/0-u/0123@6?4U;Uf {RDE:
R1U1J5bDE438>-F57T/0-F53:P@?/0-fd-.x8>:<=T:P-?V5DE23-04:PD(146?A(DB=;38>-F5T/9N 5?=!? :P-\=`/014U=!?DE4:M@T78>/\R?:P-3DE4Yl18>2Z?=>DE14%1
Y ! /0T:<N
=>DE14u=>-9/\R4D(SUT- -f LfEj JD(4LTA+?^ 8 k?AET- -9/0123@i1:PDE=>DE14 =>123-98>:k14C \ M/F?^=!?AE1L_@8>1J5T/9=>:0fGgh4Q=>1=!?AjJ=>RDE:5?=!?
=>1@8>1J57T/0-C?aAE1^K5DE23-04:PDE146?A8>-9@8>-0:P -04U=!?^=>D(14#j=>R-04qK]-I T:P- :P-9=Q/014U=!?DE4 : , Ua@T78>/\R?:P-Z8>-0/018!57:0f%.:_Wi-9Yl18>-jK-
O-0/9=>18k:PDE23DEA+?^8>DB=h; >>oi d=>1[/9123@T7=>-]=>R-@78>1^7DE23DB=h;MWi-9=K-0-04 5DEOJD+57-F5v=>R-u5?=!?%:P-9=CDE4U=>1%?q=P8!?DE4}:P-9=I?465s?=>-0:<=I:P-\=
/9T:<=>123-\8>:y?465bR-04/0-p=>1`Yl18>2=>R-4-0DELRJWi18>R1U1J5f WJ;bT:PD(4L`=>R-[:>?23-[ \KJ \|=P8!?DE4 jF=>-0:<=.8!?=>DE1fkq-[?A(:P1
/0123@T=>-.=>R-[:P@?8>:PDB=h;bAE-0O-0AYo18y=>RDE:5?=!?_:P-9=y?465ZYo1T465
DB=
=>1QWi- # ,, ,f]-y=>-98>2X=>RDE:.5?=!?M:P-9 = aYl18=>R-.8>-0:<= Neighborhood sensitivity study (entire data set )
F1 metric
4.2 Evaluation Metrics 0.17
{ 1q-0O?AET6?^=>- =>1@Nn 8>-0/012323-0465?=>DE14tK-aT:P-C=hK123-9=P8>DE/0:
# 0.15
KD(57-0AB; T:P-F5D(4=>R-DE4Yl18>2Z?=>DE14 8>-9=P8>DE-0O?A gh, b/91232`T4DB=h;
4?23-0AB;Uj[P>>l]?465v6PP9o> B!^Qfry1FK-0O-\8Fj.K]-u:PA(DELRJ=>AB;
0.13
T78V2Z?DE4L1?AkDE:[=>1IAE1U1HDE4U=>1C=>R-`=>-0:<=V:P-9= Df -fEj=>R-QRD(55-04 )(I
8#*,!$&W +-MU$
$-$/ "J* $ +*,$-
@i18P=>DE14 1Y=>R-p@T8>/!R6?:P-5?=!? ?465Z2Z?=>/!RC@78>1J5T/9=>:KDB=>RI1T78 +/ !J$ 6L!N23 ,+U
+I!.- "/$ MI2
=>1@Nn:P-9=FfG8>1J57T/9=>:p=>R?=p?@@i-F?8[DE4uWi1=>Ra:P-\=>:[?8>-_23-02`Wi-98>: "# L !#!J!J +-!J ./!#R"# ! -!#$v \ !# /* \
1Y#?V:P@i-9/0D(?A6:P-9=FjUK]-/F?AEA=>R-Uod9\fdq-41FKs5-9x4-
8>-0/F?A(A6?465 !# +"#!)/ !#3
@78>-0/0DE:PDE14aDE4 1T8/014J=>-\7=Ff
K
8>DB=P=>-04a?: 8U5+XX ;
f
0
10
30
50
70
90
11
13
15
17
19
21
23
Neighborhood size
=>T78>-f18QDE4:<=!?4/0-jDE4/98>-0?:PDE4Lu=>R-b4JT2`Wi-\8 =>-04657:_=>1uDE4N
/\8>-F?:P-.8>-0/F?AEAiWT=]5-0/98>-0?:P-0:@78>-0/0DE:PDE14f]{R-.Yo?/9=k=>R6?^=Wi1=>RZ?^8>- )/I
8#*,!$&W +-MU$
$-$/ "J* $ +*,$-
/\8>DE=>DE/F?AYo18=>R-[SUT6?AEDE=# ; -hT657L23-04U=
AE-F?5:T:
=>1QT:P-V?`/012_WDE46?^N +/ !J$ 6IL!K23 "R ,+IU +
-!0- "/$ MI2
=>DE14`1Y=>R-=hK17fdgh4M@6?^8P=>DE/0TA(?8FjK-T:P-]=>R-k:<=!?45?8!# 5 $Z\! & !J 4"# L!#!# !J !# / !# "# !I! $ \ \
!# / ;+L/!#$ / !#3
; !
[ [$[# [
: : : : "! fd-y/0123@T7=>
,d=>R6?^=LD(O-0:p F
- U
S
T
?
A ]
K 0
- E
D
L U
R p
= >
= 3
1 >
=
R 9
- 2 i
W
1 >
=
R
? 6
4
5 E
D :y/0123@T=>-F5
? : `[ - `.Yo18-F?/!RCDE45DEOJD+5JN
T?Ad/0T:<=>123
-98?4 5u
/0? AE/0%T! A(?^=>-`=>R-_?FO-98!?L-`O?AET-V=>1bT:P-Q?:1T78
23-\=P8>D(/f 4.3.1 Experiments with neighborhood size.
{R-I:PD(m9-a1Yp=>R- 4-9D(LRJWi18>R1U1J5tR6?:3:PD(L4DBx6/0?4U=ZDE23@6?/\=Z14
=>R-
8>-0/912323-045?=>DE14ZSJT?AEDB=h;IE fk{1M5-9=>-\8>23D(4-
=>R--9z-0/9=]1Y
4.3 Experimental Results 4-9D(LRJWi18>R1U1J5%:PDEm0-jGK]-C@i-98PYl18>23-F5%?4%-97@i-98>DE23-04U=[KR-98>-ZK-
g4%=>RDE:Q:P-0/9=>DE14vK-I@8>-0:P-04U=Q?~57-9=!?DEAE-F5v-\@i-\8>D(23-94J=!?A-0O?ABN O?8>DE-F5I=>R-V4-0DELRJWi18>R1U1J5u:PDEm0-V=>1Z5-\=>-98>23DE4-=>R-V-9z-9/9=>DEO-04-9:P:
T ?=>DE141Yy=>R-C57DBz-98>-04U=Q?AEL18>DB=>R23D(/b/!R1DE/0-9:`Yo18`=>R-Z:<=>-0@:`1Y 1Y]=>R-`8>-0/912323-045?=>DE14:WU;u/0123@T=>DE4Lb=>R1 - `Q23-9=P8>DE/f_q-
=>R-.eNnW6?:P-05_8>-0/012323-04657-98d:<;7:<=>-023:G?45`/0123@6?8>-]DE=>:G@i-98PYl18PN 8!?4I1T8=>-0:<=>:14 Wi1=>RI5?=!?:P-9=>:.T:PDE4LQWi1=>R RDELR 5DE23-04:PDE146?A
2Z?4/0-k=>1p=>R6?^=?/!RDE-0O-F5MWU;M=P8!?5DB=>DE14?A6?:P:P1U/0D(?=>DE147N8>TAE-yW?:P-F5 ?45IAE1FKr57D(23-94:PDE14?A8>-0@8>-0:P-94J=!?^=>DE14:0f]gh4C/F?:P-V1YGAE1FK5DE23-04N
?@@78>1?/!R-9:0f T8.2Z?DE4 L1?AwDE:.=>13-97@A(18>-[=>R-[@i1:P:PDEWD(AEDB=>DE-0:1Y :PDE14?AG8>-0@8>-0:P-94J=!?^=>DE141YDE4@T=5?=!?Jj#K]-QT:P-Q?Zx-054JT2_Wi-98
/912`WDE4DE4L5DBz-98>-94J=`:PTW7=!?:PHJ:`=>1uYo18>2_TA(?=>-I?4-
3/9D(-94J=`8>-0/9N 1Y57DE23-04:PDE14:0f T8
8>-0:PTAB=>:y?^8>-[:PR1FK4 DE4awDELT78>-MJf
12323-0465?=>DE14s?AEL18>DB=>R2 fy:Q=>R-a/912`WDE46?=>DE141YV57DBz-98>-04U=
@?8!?23-9=>-98>:?465 =!?:PHJ:yD(:y-0418>231T:0jiK-M-97@i-98>DE23-04U=!?AEAB; -0O?ABN y:QK]- /F?4s:P-0-IY+8>12 wDELT78>-uJj]=>R- :PDEm0-a1Yp=>R-I4-0DELRJWi18PN
T?=>-Z-F?/\Rq@?8!?23-9=>-98MWU;~2Z?HJD(4LI8>-F?:P146?WAE-b/\R1DE/0-0:VYo18_=>R- R1U1J5u571U-0:[?z-9/9=p=>R-`SUT6?AEDE=; 1Yk=>1@N>! Q8>-0/012323-0465?=>DE14f.gh4
8>-9:<=Ff L-94-98!?Aj=>R-`SUT6?AEDE=;aDE4/98>-0?:P-0:[?:pK-_DE4/98>-F?:P-_=>R-_4JT2_Wi-98.1Y
g4t?AEA=>R- eNW?:P-F5%-97@i-98>DE23-04U=>:M=>R-b@8>1F7D(23DB=;qWi-9=K-0-04 4-9D(LRJWi18>:0fp1FK-9O-98Fj?Y+=>-98y?_/0-98P=!?DE4I@i1DE4U=FjJ=>R-yD(23@78>1^O-023-04U=
/9T:<=>123-\8>:QK
?:323-0?:PT8>-05tWU;%T:PD(4L>>oi`23-9=P8>DE/ ?45v-F?/\R L?DE4:.5DE23DE4DE:PRC?465Z=>R-[SUT?AEDB=h;ZWi-0/0123-0:K18>:P-f]y4bD(4U=>-98>-9:<=PN
/9T:<=>123-\8VO-0/9=>18MK?:3i( $Q ' &^ "`=>1 Wi-Q1YT4DB=VAE-04L=>R#fb{R- DE4LV1W:P-98>O?=>DE143Yl8>12|wDELT78>-DE:k=>R?==>R-.1@7=>D(2Z?A4JT2_Wi-981Y
/91:PDE4-Z23-9=P8>DE/`K?:M:P-0AE-0/9=>-F5qWi-0/0?T:P-3DB=MDE:M?@@AED(/0?WAE-ZWi1=>RD(4 4-9D(LRJWi18>:MD(:M5?=!? :P-9=_5-9@i-04657-04U=FfZgh4q/F?:P-Z1Y. DB=V8>-F?/!R-9:
18>DELDE46?A[?45sAE1^K]-98a57DE23-04:PDE14?A8>-0@8>-9:P-04U=!?=>DE14:0f {R-aT4DB= DB=>:@i-F?Hb:P123-9KR-98>-[DE4C=>R-p8!?4L-V1YG ^^ Jj6KR-\8>-F?:.DE4 /F?:P-
AE-04L=>RC418>2Z?AED(m0?=>DE14IK?:@i-98PYl18>23-F5b:P1`=>R6?^=
/0T:<=>123-98>:=>R6?^= 1Yckes=>R-@i-0?HbD(:
8>-F?/\R-F5IDE4C=>R-p8!?4L-V1Y]^ F7 7f
R?FO-[@T78>/\R?:P-F5I2Z?4U;ZDE=>-923:KDEAEAw41=y5123D(4?=>-[Wi1=>RI=>R-[?LN pDEO-04V=>R6?^=d=>R-]1@=>DE2Z?AJ4JT2_Wi-98#1Yi4-F?8>-0:<=G4-0DELRJWi18>:dDE:5DBY+N
L8>-0L?^=>-_4-0DELRJWi18>R1U1J5:y?:.K-0AEAw?:.=>R-[:PDE4LTA(?8O?AET-V5-0/012QN Yo-\8>-04U=Yo185DBz-98>-04U=5?=!?V:P-9=>:0jJDB=
DE:DE23@i18P=!?4U=]=>1V:P-0-pDBY#K-./F?4
@i1:PDB=>D(14#fwDE46?AEAE;UjDE4`?AEA71T8d-\@i-\8>D(23-94J=>:K-x7-F5=>R-k47T2`Wi-98 ?/9/0T8!?^=>-0AB;a-0:<=>DE2Z?=>-V=>R-V1@7=>DE2Z?A4JT2`Wi-98.1Y4-0DELRJWi18>:yT:PDE4L
1Yd8>-0/012323-0465?=>DE14:?=[! Df -fEj=>1@N>! \f =>R-.=P8!?DE4DE4L35?^=!?M:P-9=.?AE14-f 4-yK?F;b1Yd51DE4L_=>RDE:
D(:=>1MYoT78PN
Dimension sensitivity study (ML data set) Dimension sensitivity study (EC data set)
0.235 0.15
0.225
0.13
0.215
F1 metric
0.11
0.205
0.195 0.09
0.185 0.07
0.175
0.05
0.165 10 20 30 40 50 60 70 80 90 100 50 100 150 200 250 300 350 400 450 500 600 700
Dimension, k Dimension, k
ML-train ML-total EC-train EC-total
8#*,!W$&!J I OME $&\/ +"J$ 4 I
8 * ! $& !J 4 MU O$&/ +"J$ I
/!# "# +! / !#."# !
:P-94J=>-05QD(4QwD(LT8>- fe]123@6?^8>D(4LVwDELT8>-y?4653wDELT78>- K]-:P-0- E-commerce data High dimensional 0.16654 0.16654
=>R?==>R-p:P-04:PDB=>DEOJDE=;I14C=>R-4JT2`Wi-\8
1Yd4-0DELRJWi18>:.DE:
=>R-:>?23-
Yl18bWi1=>Rt=>R- /0?:P-0: -0O-94t=>R1TLR57T-C=>1D(4/98>-F?:P-F5sAE-0O-0A1Y Low dimensional
(k = 300)
0.12158 0.13209
F1 metric
0.1
Low dimensional 0.12158 0.08579 0.08
(k = 300) 0.06
0.04
0.2
F1 metric
0.15
0.1
A(?8>L-Q?45aRDELRSUT6?A(DB=; 4-9D(LRJWi18>R1U1J57:0fp-`?^8>-_/F?^8>-9YoTAEAE;uDE47N
O-0:<=>DEL?=>DE4LQ=>RD(:
8>-0:PTAB=Ff
0.05
0 o<\!% cp18PK]-0AEAj Mf
8>-FSUT-04U=
g=>-023:P-\=>:0 .Po(9 "[l!!J "
$y JoJJ c B^F 14:<=!?4#1j fEjIDEAEAE-98Fj yfEji ?AB=>mj f(jp-\8>A(1U/!H-98F1j fEj
FVyL8!?0K
?Aj,MfEjgh23DE-0AEDE4:PHJDj6{pfEj UK
?23DjVf , , \fi6?:<=
p18!514#jfEji?465b,yDE-F57A1 j f ! ,, \f y8>1T@6#-94:0
yAEL18>DB=>R23:
Yo18pID(4DE4LZy:P:P1U/9D+?^=>DE14a,.TAE-0:0fgh4 y@@AB;DE4L3e]1AEA(?Wi18!?=>DEO-_wDEAE=>-\8>D(4L`=> 1 y:P-04-9=.y-9K:0f
PF!> "oJVhyl
FF\P\i! c@@f ^N ,,
l $$#7>V<l! 2G9 \j@@#f6FNJf
]R?=P=!?/!R6?^8P;J;J?7j f! ,,7f DB8>-0/9=p ?8>H-9=>DE4LQ,.-0:P@i14:P- BF^M#DE4L7jiey1f :_fEj?465I#Deyf , , \f ?^=!?3IDE4DE4LQYl18 DB8>-0/9=
I1J5-9A(:.T:PDE4& L p-04-\=>D(/V.AEL 18>DB=>R23:0fg4 PF!> "oJ_< ?8>H-9=>DE4LkG8>1WAE-023:.?45 J1AET=> DE14:0f6g4 PF!> "o JM<
l # J\lb !\!ii FF\P\i!`
V6 ( "FU l. lb
!\!ii F0\>\i!`9 [i ( "FU
[l9>\ 'C "MnK GuloUj@@#f N> 7f [o\>\ 'I "Mn GuooJj7@@fi Nh ,Jf
]-98P8P;Uj6~f7rfEj T2Z?DE:0j f{pfEji?45 P k8>D(?4j _f7rf B! ,^Md?8>Hij 7f(jie]R-04fi~fEj 4]Tjdf ! ,, \fiy4IcGz-9/9=>DEO-
,, \f p:PDE4L3# DE4-0?8yyAEL-9W8!?`Yo18.gh4U=>-0AEAED(L-04U= ?:PR7NnW6?:P-05IyAEL18>DB=>R2|Yl18ID(4DE4L3.:P:P1U/0D(?=>DE14u,.TAE-0:0f
gh47Yo18>2Z?=>DE14 ,.-9=P8>DE-0O?Af
9GA\Fn
Fj6@@#f
!