21.06.2013 Views

JAM: Java agents for Meta-Learning over Distributed Databases

JAM: Java agents for Meta-Learning over Distributed Databases

JAM: Java agents for Meta-Learning over Distributed Databases

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

(adierentone<strong>for</strong>eachbank)ratherthanDB.)EachsuchmappingisFi. // (Thisisexactlyasinstep(iii)ofapproach1,exceptthedatasetused<strong>for</strong>combiningisTi Figure4:Sharingknowledgewithoutsharingdata //<br />

reallyexpectthatFiisallthatmuchbetterinpredictiveaccuracythanaclassiersimply DBcomesfrom,orhowitmightbe<strong>for</strong>med.However,thereisadierentissue-wouldone trainedontheentiresetofavailabledata,DBiSTi?Afterall,theFiarecreatedbylooking solelyatbanki'slocaldata;thefj6=iareinessencejustnewfeaturesbankicanusetolook Thislatterapproachisdepictedingure4.Notethatnowthereisnoissueofwhere iv)EachbankusesitsFiasinstep(iv)ofapproach1.<br />

Classifier<br />

Hereweprovideageneralviewofthedataschema<strong>for</strong>thelabelledtransactiondatasets aredescribednext. 5CreditCardFraudTransactionData atitsdata.Formalstudiesareunderwaytoanswerthisquestion.Somepreliminaryresults<br />

compiledbyabankandusedbyoursystem.Forpurposesofourresearchanddevelopment recordsspanningoneyear,sampling,onaverage,42,000permonth,fromNovember1995to October1996. activity,severaldatasetsarebeingacquiredfromseveralbanks,eachproviding.5million in<strong>for</strong>mationisnotdisclosedhere.(Afterallweseeknottoteach\wanabethieves"important schemaofthisdataisprovidedinsuchawaythatimportantcondentialandproprietary about30numericattributesincludingthebinaryclassication(fraud/legitimatetransac- lessonsonhowtohonetheirskills.)Therecordshaveaxedlengthof137byteseachand ysisbybankpersonneltocaptureimportantin<strong>for</strong>mation<strong>for</strong>frauddetection.Thegeneraltion).Someoftheeldsarearithmeticandtherestcategorical,i.e.numberswereusedtoTheschemaofthedatabasewasdeveloped<strong>over</strong>yearsofexperienceandcontinuousanal- representafewdiscretecategories.Thein<strong>for</strong>mationineachrecordincludes: A(non-revealing)hashedcreditcardaccountnumber. Scoresproducedbyacommercialauthorization/detectionsystem Thedateandtimeofeachtransaction10<br />

Local<br />

Classifier<br />

Remote<br />

Classifier 1<br />

Local<br />

<strong>Meta</strong>classifier<br />

<strong>Meta</strong>-level<br />

Training<br />

Data<br />

Remote<br />

2<br />

Remote<br />

Classifier n

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!