資源描述:
《基于WEB日志挖掘系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)》由會(huì)員上傳分享,免費(fèi)在線閱讀,更多相關(guān)內(nèi)容在學(xué)術(shù)論文-天天文庫(kù)。
1、哈爾濱工程大學(xué)碩士學(xué)位論文基于Web日志挖掘系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)姓名:劉鑫申請(qǐng)學(xué)位級(jí)別:碩士專業(yè):計(jì)算機(jī)應(yīng)用技術(shù)指導(dǎo)教師:楊永田20060101哈爾濱1程人學(xué)碩十學(xué)位論文ABSTRACTAsthefastdevelopingandspreadingofInternet,Webusageinformationgrowsquickly.PeoplebegintopaycloseattentiontoIIliningva]uableinformationfrom1argeamountofdata.TheWorldWideweb(www)continuestogrowata
2、nastoundingrateinboththesheervolumeoftrafficandthesizeandcomplexityofWebsites.ThecomplexityoftaskssuckasWebsitedesign,Webserverdesign,andofsimplynavigatingthroughaWebsitehaveincreasedalongwiththisgrowth.AnimportantinputtothesedesigntasksistheanalysisofhowaWebsiteisbeingused.Loganalys
3、isincludesstraightforwardstatistics,suchaspageaccessfrequency,aswellasmoresophisticatedformsofanalysis,suchasfindingthecor啪ontraversalpathsthroughaWebsite.WebLogMiningistheapplicationofdataminingtechniquestoserver109soflargeWebdatarepositoriesinordertoproduceresultsthatcanbeusedinthe
4、designtasksmentionedabove.Inourresearch,weexplaintheconcept,researchworks,keytechnologiesofWeblogminingandrelatedresearchathomeandabroad,andthenusedataminingtechnologytoanalyzetheWebusageinformationofonedistrictgovernmentsoastofindouttheusagepatternandpreferenceofenterprisesandindiVi
5、dualsasthebetterdecision—makingaidforwebsiteexecutives.Thethesisachievesthef01lowingtasks:first,studyingthepreprocessingofrawWeb109,analyzingthedifficultiesanddescribingtheprocess,suchasdatacleaning,useridentification,sessionidentification,pathsupplement:second,onthebaseofanatomizing
6、classicalAprioriA190rithm,improvingtheperformancebyreducingthenumberofitemsetsanddevelopinganewalgorithm,calledM—Apriori:Lhird,studying哈爾濱工程大學(xué)碩士學(xué)位論文thePathPatternMiningtechn0109yandapplyingitintheminingofthewebsiteofthedistrictgovernment,suchasMFP,F(xiàn)P:finally,applyingtheM—AprioriAlgor
7、ithmtotheminingtoolWekatoimproVeitsperformance,thenminingthewebsiteofthedistrictgovernII】entbyimprovedminingtoolWeka,andgivingadVicetoimproVethewebsiteofthedistrictgovernment.Keywords:Weblogmining,assoeiationrule,pathpattern,AprioriA190rithm,Weka哈爾濱工程大學(xué)學(xué)位論文原創(chuàng)性聲明本人鄭重聲明:本論文的所有工作,是在導(dǎo)師的指
8、導(dǎo)下,由作者本人獨(dú)立完成