登录
首页 » Java » nutch-0.8

nutch-0.8

于 2007-07-18 发布 文件大小:445KB
0 219
下载积分: 1 下载次数: 32

代码说明:

  nutch-0.8刚出来不久的一个很好用的搜索引擎工具 nutch-0.8刚出来不久的一个很好用的搜索引(nutch-0.8 has just come out near a very good tool to use search engine nutch-0.8 has just come out soon with a good primer of english)

文件列表:

META-INF
........\MANIFEST.MF
nutch-default.xml
nutch-site.xml
org
...\apache
...\......\nutch
...\......\.....\analysis
...\......\.....\........\AnalyzerFactory.class
...\......\.....\........\CharStream.class
...\......\.....\........\CommonGrams$ArrayTokens.class
...\......\.....\........\CommonGrams$Filter.class
...\......\.....\........\CommonGrams.class
...\......\.....\........\FastCharStream.class
...\......\.....\........\NutchAnalysis$1.class
...\......\.....\........\NutchAnalysis$JJCalls.class
...\......\.....\........\NutchAnalysis$LookaheadSuccess.class
...\......\.....\........\NutchAnalysis.class
...\......\.....\........\NutchAnalysisConstants.class
...\......\.....\........\NutchAnalysisTokenManager.class
...\......\.....\........\NutchAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$1.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorFilter.class
...\......\.....\........\NutchDocumentAnalyzer$ContentAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer.class
...\......\.....\........\NutchDocumentTokenizer.class
...\......\.....\........\ParseException.class
...\......\.....\........\Token.class
...\......\.....\........\TokenManager.class
...\......\.....\........\TokenMgrError.class
...\......\.....\clustering
...\......\.....\..........\HitsCluster.class
...\......\.....\..........\OnlineClusterer$1.class
...\......\.....\..........\OnlineClusterer.class
...\......\.....\..........\OnlineClustererFactory.class
...\......\.....\crawl
...\......\.....\.....\Crawl.class
...\......\.....\.....\CrawlDatum$Comparator.class
...\......\.....\.....\CrawlDatum.class
...\......\.....\.....\CrawlDb.class
...\......\.....\.....\CrawlDbMerger$Merger.class
...\......\.....\.....\CrawlDbMerger.class
...\......\.....\.....\CrawlDbReader$CrawlDbDumpReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNReducer.class
...\......\.....\.....\CrawlDbReader.class
...\......\.....\.....\CrawlDbReducer.class
...\......\.....\.....\Generator$HashComparator.class
...\......\.....\.....\Generator$Selector.class
...\......\.....\.....\Generator$SelectorEntry.class
...\......\.....\.....\Generator$SelectorInverseMapper.class
...\......\.....\.....\Generator.class
...\......\.....\.....\Injector$InjectMapper.class
...\......\.....\.....\Injector$InjectReducer.class
...\......\.....\.....\Injector.class
...\......\.....\.....\Inlink.class
...\......\.....\.....\Inlinks.class
...\......\.....\.....\LinkDb$1.class
...\......\.....\.....\LinkDb$2.class
...\......\.....\.....\LinkDb$Merger.class
...\......\.....\.....\LinkDb.class
...\......\.....\.....\LinkDbMerger.class
...\......\.....\.....\LinkDbReader.class
...\......\.....\.....\MapWritable$ClassIdEntry.class
...\......\.....\.....\MapWritable$KeyValueEntry.class
...\......\.....\.....\MapWritable.class
...\......\.....\.....\MD5Signature.class
...\......\.....\.....\PartitionUrlByHost.class
...\......\.....\.....\Signature.class
...\......\.....\.....\SignatureComparator.class
...\......\.....\.....\SignatureFactory.class
...\......\.....\.....\TextProfileSignature$1.class
...\......\.....\.....\TextProfileSignature$Token.class
...\......\.....\.....\TextProfileSignature$TokenComparator.class
...\......\.....\.....\TextProfileSignature.class
...\......\.....\fetcher
...\......\.....\.......\Fetcher$FetcherThread.class
...\......\.....\.......\Fetcher$InputFormat.class
...\......\.....\.......\Fetcher.class
...\......\.....\.......\FetcherOutput.class
...\......\.....\.......\FetcherOutputFormat$1.class
...\......\.....\.......\FetcherOutputFormat.class
...\......\.....\html
...\......\.....\....\Entities.class
...\......\.....\indexer
...\......\.....\.......\DeleteDuplicates$1.class
...\......\.....\.......\DeleteDuplicates$2.class
...\......\.....\.......\DeleteDuplicates$HashPartitioner.class
...\......\.....\.......\DeleteDuplicates$HashReducer.class
...\......\.....\.......\DeleteDuplicates$HashScore.class
...\......\.....\.......\DeleteDuplicates$IndexDoc.class
...\......\.....\.......\DeleteDuplicates$InputFormat.class
...\......\.....\.......\DeleteDuplicates.class
...\......\.....\.......\FsDirectory$1.class
...\......\.....\.......\FsDirectory$DfsIndexInput$Descriptor.class
...\......\.....\.......\FsDirectory$DfsIndexInput.class
...\......\.....\.......\FsDirectory$DfsIndexOutput.class

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • 区间平均下浮双边曲线算法之改进
    实际投标中,由于竞争对手的投标策略是随着市场环境及自身条件的变化而变化的,因此就是对同一个包(货物类型以及对应数量完全相同),同一厂家在不同批次中的报价也可能是不同的。还需要注意价格下浮比例(或称下浮系数)以及减分速率指数对报价也会产生一定的影响
    2022-03-19 10:29:23下载
    积分:1
  • auto_spyder4jiandan
    通过python爬虫抓取煎蛋网妹子图中多个页面的图片并保存到本地。(Through the python spiders crawling Fried egg nets sister figure multiple pages of pictures and saved to the local.)
    2016-06-01 10:56:16下载
    积分:1
  • zhinengsousuo
    智能搜索功能11111111111111111(Intelligent search function)
    2013-11-22 14:24:06下载
    积分:1
  • 57594
    简单的搜索源码程序,例程结合易语言应用接口支持库,调用“打开指定网址”命令,根据网址的规则搜索。(Simple search source program , a routine application interface combined with easy language support library , call open the specified URL command , according to the rules of the search site .)
    2016-12-13 10:00:10下载
    积分:1
  • xbbs1.3
    1,添加了search搜索模块。 2,修正了一个安全漏洞。 3,界面上的修改。(1, added search search module. 2, fixes a security vulnerability. 3, modify the interface.)
    2016-03-10 21:12:56下载
    积分:1
  • a
    搜索引擎专辑之,网络树形搜索引擎的设计及其验证(Album of the search engine, Web search engine design tree and its verification)
    2008-05-21 21:46:33下载
    积分:1
  • pageRank
    使用pagerank算法实现网络爬虫扒下的资源的排名(Use the pagerank algorithm to rank the website.)
    2020-11-27 08:19:30下载
    积分:1
  • python_sina_crawl
    新浪微博的爬虫程序。程序运行方式:保存所有代码后,打开Main.py,修改LoginName为你的新浪微博帐号,PassWord为你的密码。运行Main.py,程序会在当前目录下生成CrawledPages文件夹,并保存所有爬取到的文件在这个文件夹中。(Sina microblogging reptiles. Program operation: save all the code, open Main.py, modify LoginName for your Sina Weibo account, PassWord for your password. Run Main.py, the program will generate CrawledPages in the current directory folder and save all files to crawling in this folder.)
    2021-04-08 16:39:00下载
    积分:1
  • E90
    宝马E底盘汽车,加装原厂摄像头,CIC模块编成代码
    2022-04-26 15:04:36下载
    积分:1
  • Fastsearch
    search-文件快速搜索工具:优化win7/win8的文件搜索功能,能够搜索文件内容中的关键字。(Search- file search tool: optimize the file search function of win7/win8, can search the key words in the file content.)
    2015-06-27 21:17:22下载
    积分:1
  • 696516资源总数
  • 106562会员总数
  • 4今日下载