登录
首页 » Java » nutch-0.8

nutch-0.8

于 2007-07-18 发布 文件大小:445KB
0 174
下载积分: 1 下载次数: 32

代码说明:

  nutch-0.8刚出来不久的一个很好用的搜索引擎工具 nutch-0.8刚出来不久的一个很好用的搜索引(nutch-0.8 has just come out near a very good tool to use search engine nutch-0.8 has just come out soon with a good primer of english)

文件列表:

META-INF
........\MANIFEST.MF
nutch-default.xml
nutch-site.xml
org
...\apache
...\......\nutch
...\......\.....\analysis
...\......\.....\........\AnalyzerFactory.class
...\......\.....\........\CharStream.class
...\......\.....\........\CommonGrams$ArrayTokens.class
...\......\.....\........\CommonGrams$Filter.class
...\......\.....\........\CommonGrams.class
...\......\.....\........\FastCharStream.class
...\......\.....\........\NutchAnalysis$1.class
...\......\.....\........\NutchAnalysis$JJCalls.class
...\......\.....\........\NutchAnalysis$LookaheadSuccess.class
...\......\.....\........\NutchAnalysis.class
...\......\.....\........\NutchAnalysisConstants.class
...\......\.....\........\NutchAnalysisTokenManager.class
...\......\.....\........\NutchAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$1.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorFilter.class
...\......\.....\........\NutchDocumentAnalyzer$ContentAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer.class
...\......\.....\........\NutchDocumentTokenizer.class
...\......\.....\........\ParseException.class
...\......\.....\........\Token.class
...\......\.....\........\TokenManager.class
...\......\.....\........\TokenMgrError.class
...\......\.....\clustering
...\......\.....\..........\HitsCluster.class
...\......\.....\..........\OnlineClusterer$1.class
...\......\.....\..........\OnlineClusterer.class
...\......\.....\..........\OnlineClustererFactory.class
...\......\.....\crawl
...\......\.....\.....\Crawl.class
...\......\.....\.....\CrawlDatum$Comparator.class
...\......\.....\.....\CrawlDatum.class
...\......\.....\.....\CrawlDb.class
...\......\.....\.....\CrawlDbMerger$Merger.class
...\......\.....\.....\CrawlDbMerger.class
...\......\.....\.....\CrawlDbReader$CrawlDbDumpReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNReducer.class
...\......\.....\.....\CrawlDbReader.class
...\......\.....\.....\CrawlDbReducer.class
...\......\.....\.....\Generator$HashComparator.class
...\......\.....\.....\Generator$Selector.class
...\......\.....\.....\Generator$SelectorEntry.class
...\......\.....\.....\Generator$SelectorInverseMapper.class
...\......\.....\.....\Generator.class
...\......\.....\.....\Injector$InjectMapper.class
...\......\.....\.....\Injector$InjectReducer.class
...\......\.....\.....\Injector.class
...\......\.....\.....\Inlink.class
...\......\.....\.....\Inlinks.class
...\......\.....\.....\LinkDb$1.class
...\......\.....\.....\LinkDb$2.class
...\......\.....\.....\LinkDb$Merger.class
...\......\.....\.....\LinkDb.class
...\......\.....\.....\LinkDbMerger.class
...\......\.....\.....\LinkDbReader.class
...\......\.....\.....\MapWritable$ClassIdEntry.class
...\......\.....\.....\MapWritable$KeyValueEntry.class
...\......\.....\.....\MapWritable.class
...\......\.....\.....\MD5Signature.class
...\......\.....\.....\PartitionUrlByHost.class
...\......\.....\.....\Signature.class
...\......\.....\.....\SignatureComparator.class
...\......\.....\.....\SignatureFactory.class
...\......\.....\.....\TextProfileSignature$1.class
...\......\.....\.....\TextProfileSignature$Token.class
...\......\.....\.....\TextProfileSignature$TokenComparator.class
...\......\.....\.....\TextProfileSignature.class
...\......\.....\fetcher
...\......\.....\.......\Fetcher$FetcherThread.class
...\......\.....\.......\Fetcher$InputFormat.class
...\......\.....\.......\Fetcher.class
...\......\.....\.......\FetcherOutput.class
...\......\.....\.......\FetcherOutputFormat$1.class
...\......\.....\.......\FetcherOutputFormat.class
...\......\.....\html
...\......\.....\....\Entities.class
...\......\.....\indexer
...\......\.....\.......\DeleteDuplicates$1.class
...\......\.....\.......\DeleteDuplicates$2.class
...\......\.....\.......\DeleteDuplicates$HashPartitioner.class
...\......\.....\.......\DeleteDuplicates$HashReducer.class
...\......\.....\.......\DeleteDuplicates$HashScore.class
...\......\.....\.......\DeleteDuplicates$IndexDoc.class
...\......\.....\.......\DeleteDuplicates$InputFormat.class
...\......\.....\.......\DeleteDuplicates.class
...\......\.....\.......\FsDirectory$1.class
...\......\.....\.......\FsDirectory$DfsIndexInput$Descriptor.class
...\......\.....\.......\FsDirectory$DfsIndexInput.class
...\......\.....\.......\FsDirectory$DfsIndexOutput.class

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • users_spider_new
    微博爬虫,用于爬去某地点附近发生的签到事件,并将数据写入xls文件(Microblogging reptile, used to climb to a place near the attendance event, and write data to the xls file)
    2016-12-26 12:28:54下载
    积分:1
  • googlesf
    说明:  google搜索引擎算法 软件语言 简体中文 运行环境 Delphi (google search engine algorithm software operating environment Delphi Language Simplified Chinese)
    2010-03-19 17:00:12下载
    积分:1
  • 载软件和影视等的软件 Apian
    下载软件和影视等的软件,非常的好用。还可以搜索A片及下载。(Download software and video software, etc., very easy to use. A chip can also search and download.)
    2020-06-26 02:20:02下载
    积分:1
  • proxysearcher
    vs2012开发的代理搜索、验证软件,可以自动搜索代理、网页吸取、google三种模式。(vs2012 development agency search, verification software can automatically search proxy, web lessons, google three modes.)
    2013-12-02 22:07:25下载
    积分:1
  • in0436news
    相关链接、新闻搜索、今日要闻、历史新闻查询,取消查看新闻时生成本地文件(Related links, news search, today highlights, news about history, creating a local file to view the news Cancel)
    2016-03-26 19:26:21下载
    积分:1
  • data_collect.tar
    新浪微薄爬虫,抓取好友信息和好友的关注列表, 保存文件格式为文本(Sina meager reptiles, grab your friends and friends of watchlist information, save the file as text format)
    2014-01-29 18:32:17下载
    积分:1
  • tpxt_v8.1
    企管美女美图小偷功能简介: 1.采用php小偷技术自动同步更新. 2.支持二级目录,二级域名。程序自带后台 3.搜索引擎蜘蛛访问记录(Business Mito beautiful thief Features: 1. Using php thief technology automatic synchronization update. 2. Support secondary directory, two domain names. The program comes with background 3. Search engine spiders access to records)
    2020-11-14 15:09:42下载
    积分:1
  • xx_20030222
    下一代天网文件搜索引擎(next generation Skynet document search engine)
    2005-01-08 11:27:09下载
    积分:1
  • chinafenci
    中文分词,读取txt文档然后给词分类,中文分词,读取txt文档然后给词分类,中文分词,读取txt文档然后给词分类(Chinese word segmentation, read txt document and then to the word classification, the Chinese word segmentation, read txt document and then to the word classification, the Chinese word segmentation, read txt document and then to the word category)
    2009-11-18 23:03:20下载
    积分:1
  • Crawler4OtherTransit
    爬虫,简单语句,爬虫快速 支持单线程操作,容易改正,并有注释(Reptiles, simple statement, reptiles fast support single-threaded operation, easy to correct, and Notes)
    2012-09-24 10:46:49下载
    积分:1
  • 696518资源总数
  • 105559会员总数
  • 1今日下载