資源描述:
《Cancer classification and prediction using logistic regression with Bayesian gene selection》由會員上傳分享,免費(fèi)在線閱讀,更多相關(guān)內(nèi)容在行業(yè)資料-天天文庫。
1、JournalofBiomedicalInformatics37(2004)249–259www.elsevier.com/locate/yjbinCancerclassi?cationandpredictionusinglogisticregressionwithBayesiangeneselectiona,ba,ba,b,*XiaoboZhou,Kuang-YuLiu,StephenT.C.WongaHarvardCenterforNeurodegenerationandRepair—CenterforBioinformatics,HarvardMedicalSchool,220Longw
2、oodAvenue,Boston,MA02115,USAbRadiologyDepartment,HarvardMedicalSchoolandBrighamandWomen?sHospital,77FrancisStreet,Boston,MA02115,USAReceived13June2004Availableonline11September2004AbstractInmicroarray-basedcancerclassi?cationandprediction,geneselectionisanimportantresearchproblemowingtothelargenum-b
3、erofgenesandthesmallnumberofexperimentalconditions.Inthispaper,weproposeaBayesianapproachtogeneselectionandclassi?cationusingthelogisticregressionmodel.Thebasicideaofourapproachisinconjunctionwithalogisticregressionmodeltorelatethegeneexpressionwiththeclasslabels.WeuseGibbssamplingandMarkovchainMont
4、eCarlo(MCMC)methodstodis-coverimportantgenes.ToimplementGibbsSamplerandMCMCsearch,wederiveaposteriordistributionofselectedgenesgiventheobserveddata.Aftertheimportantgenesareidenti?ed,thesamelogisticregressionmodelisthenusedforcancerclassi?cationandprediction.Issuesfore?cientimplementationforthepropo
5、sedmethodarediscussed.Theproposedmethodisevaluatedagainstseverallargemicroarraydatasets,includinghereditarybreastcancer,smallroundblue-celltumors,andacuteleukemia.Theresultsshowthatthemethodcane?ectivelyidentifyimportantgenesconsistentwiththeknownbiological?ndingswhiletheaccuracyoftheclassi?cationis
6、alsohigh.Finally,therobustnessandsensitivitypropertiesoftheproposedmethodarealsoinvestigated.ó2004ElsevierInc.Allrightsreserved.Keywords:Genemicroarray;Logisticregression;Bayesiangeneselection;Cancerclassi?cation1.Introductionformodelselection[12],andthelogisticregressionmod-el[3].Thelogisticregress
7、ionmodel,alsoknownaslogitCancerclassi?cationandpredictionhasbecomeoneintheliterature,isoneofthemostcommonmodelsforofthemostimportantapplicationsofDNAmicroarrayprediction,regression,andclassi?cationofd