資源描述:
《計算機網(wǎng)絡(luò) 外文文獻 英文文獻 外文翻譯 探討搜索引擎爬蟲.doc》由會員上傳分享,免費在線閱讀,更多相關(guān)內(nèi)容在工程資料-天天文庫。
1、[13].S?Chakrabarti,K?Punera,M.Subramanyam,"Acceleratedfocusedcrawlingthroughonlinerelevancefeedback,VWW2002.pp.148-159.[14].M.Diligenti,F.Coetzee,S.Lawrence,C.L.Giles,M.Gori,"FocusedCrawlingUsingContextGraphs",VLDB20009pp.527-534.[15].C.Aggarual,F?Al-Garawi,P.Yu,''Intelligentcra
2、wlingontheWorldWideWebwitharbitrarypredicates",WWW2001.pp?96-105.[16[.C.Chung,C.Clarke,'Topic-orientedcollaborativecrawling;CIKM2002,pp.3M[17].Brin,SergeyandPageLawrence.<4Theanatomyofalarge-scalehypcrtcxfualWebsearchengine5'?ComputerNe^orksandISDNSystems,April1998[18].Grossan,B?
3、"SearchEngines:Whattheyare,howtheywork,andpracticalsuggestionsforgettingthemostoutofthem/'Februan1997.[19].?WcbicfQiencc.com[20].Chakrabcirti,Soumcn."MiningtheWeb:AnalysisofHypertextandSemiStructured2003.[21].JunHirai,SriramRaghavaii,IlectorGarcia-Molina,andAndreasPaepcke?WebBase
4、:ArepositoryofWebpages?InProceedingsoftheNinthInternationalWorldWideWebConference,pages277-293,May2000.[22].TheInternetArchive.hg://www?nichivc?oig/j23j?MartijnKoster.TheWebRobotsPages.h(tD:〃infb?¥cbcmwlcr?com/mak/iMoiccts/rob(Hs/roho(s?h(ml[24].OliverA.McBryan?GENVI,andWWWW:Too
5、lsforTamingtheWeb.InProceedingsoftheFirstIntemationcdWorldWideWebConference^pages79-90,1994.[25].BrianPinkerton.FindingWhatPeopleWant:ExperienceswiththeWebCrawler.InProceedingsoftheSecondInternationalWorldWideWebConference,1994.[26」.MikeBurner.CrawlingtowardsEternity:Buildinganar
6、chiveoftheWorldWideWeb.WebTechniquesMagazine,2(5),May1997.[27」.JunghooCho,HectorGarcia-Molina,andLawrencePage.EfficientcrawlingthroughURLordering.DiscussiononWebCrawlersofSearchEngineM.P.S.Bhatia*,DivyaGupta***NetajiSubhasInstituteofTechnology,UniversityofDellii,India,**GuruPremS
7、ukliMemorialCollegeofEngineering,GGSIPUniversity,DelhiAbstractWiththeprecipitousexpansionoftheWeb,extractingknowledgefronttheWebisbecominggraduallyimportantandpopular.ThisisduetotheWebsconvenienceandrichnessofinformation.Tofi)rdWebpages,onetypicallyusessearchenginesthatarebasedon
8、theWebcrawlingframework.Thispaperdescrib