登录
首页 » Java » nutch-0.8

nutch-0.8

于 2007-07-18 发布 文件大小:445KB
0 235
下载积分: 1 下载次数: 32

代码说明:

  nutch-0.8刚出来不久的一个很好用的搜索引擎工具 nutch-0.8刚出来不久的一个很好用的搜索引(nutch-0.8 has just come out near a very good tool to use search engine nutch-0.8 has just come out soon with a good primer of english)

文件列表:

META-INF
........\MANIFEST.MF
nutch-default.xml
nutch-site.xml
org
...\apache
...\......\nutch
...\......\.....\analysis
...\......\.....\........\AnalyzerFactory.class
...\......\.....\........\CharStream.class
...\......\.....\........\CommonGrams$ArrayTokens.class
...\......\.....\........\CommonGrams$Filter.class
...\......\.....\........\CommonGrams.class
...\......\.....\........\FastCharStream.class
...\......\.....\........\NutchAnalysis$1.class
...\......\.....\........\NutchAnalysis$JJCalls.class
...\......\.....\........\NutchAnalysis$LookaheadSuccess.class
...\......\.....\........\NutchAnalysis.class
...\......\.....\........\NutchAnalysisConstants.class
...\......\.....\........\NutchAnalysisTokenManager.class
...\......\.....\........\NutchAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$1.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorFilter.class
...\......\.....\........\NutchDocumentAnalyzer$ContentAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer.class
...\......\.....\........\NutchDocumentTokenizer.class
...\......\.....\........\ParseException.class
...\......\.....\........\Token.class
...\......\.....\........\TokenManager.class
...\......\.....\........\TokenMgrError.class
...\......\.....\clustering
...\......\.....\..........\HitsCluster.class
...\......\.....\..........\OnlineClusterer$1.class
...\......\.....\..........\OnlineClusterer.class
...\......\.....\..........\OnlineClustererFactory.class
...\......\.....\crawl
...\......\.....\.....\Crawl.class
...\......\.....\.....\CrawlDatum$Comparator.class
...\......\.....\.....\CrawlDatum.class
...\......\.....\.....\CrawlDb.class
...\......\.....\.....\CrawlDbMerger$Merger.class
...\......\.....\.....\CrawlDbMerger.class
...\......\.....\.....\CrawlDbReader$CrawlDbDumpReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNReducer.class
...\......\.....\.....\CrawlDbReader.class
...\......\.....\.....\CrawlDbReducer.class
...\......\.....\.....\Generator$HashComparator.class
...\......\.....\.....\Generator$Selector.class
...\......\.....\.....\Generator$SelectorEntry.class
...\......\.....\.....\Generator$SelectorInverseMapper.class
...\......\.....\.....\Generator.class
...\......\.....\.....\Injector$InjectMapper.class
...\......\.....\.....\Injector$InjectReducer.class
...\......\.....\.....\Injector.class
...\......\.....\.....\Inlink.class
...\......\.....\.....\Inlinks.class
...\......\.....\.....\LinkDb$1.class
...\......\.....\.....\LinkDb$2.class
...\......\.....\.....\LinkDb$Merger.class
...\......\.....\.....\LinkDb.class
...\......\.....\.....\LinkDbMerger.class
...\......\.....\.....\LinkDbReader.class
...\......\.....\.....\MapWritable$ClassIdEntry.class
...\......\.....\.....\MapWritable$KeyValueEntry.class
...\......\.....\.....\MapWritable.class
...\......\.....\.....\MD5Signature.class
...\......\.....\.....\PartitionUrlByHost.class
...\......\.....\.....\Signature.class
...\......\.....\.....\SignatureComparator.class
...\......\.....\.....\SignatureFactory.class
...\......\.....\.....\TextProfileSignature$1.class
...\......\.....\.....\TextProfileSignature$Token.class
...\......\.....\.....\TextProfileSignature$TokenComparator.class
...\......\.....\.....\TextProfileSignature.class
...\......\.....\fetcher
...\......\.....\.......\Fetcher$FetcherThread.class
...\......\.....\.......\Fetcher$InputFormat.class
...\......\.....\.......\Fetcher.class
...\......\.....\.......\FetcherOutput.class
...\......\.....\.......\FetcherOutputFormat$1.class
...\......\.....\.......\FetcherOutputFormat.class
...\......\.....\html
...\......\.....\....\Entities.class
...\......\.....\indexer
...\......\.....\.......\DeleteDuplicates$1.class
...\......\.....\.......\DeleteDuplicates$2.class
...\......\.....\.......\DeleteDuplicates$HashPartitioner.class
...\......\.....\.......\DeleteDuplicates$HashReducer.class
...\......\.....\.......\DeleteDuplicates$HashScore.class
...\......\.....\.......\DeleteDuplicates$IndexDoc.class
...\......\.....\.......\DeleteDuplicates$InputFormat.class
...\......\.....\.......\DeleteDuplicates.class
...\......\.....\.......\FsDirectory$1.class
...\......\.....\.......\FsDirectory$DfsIndexInput$Descriptor.class
...\......\.....\.......\FsDirectory$DfsIndexInput.class
...\......\.....\.......\FsDirectory$DfsIndexOutput.class

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • 006
    说明:  淘宝爬虫urllib实现,可以爬某个商品的价格等(Taobao crawler urlib implementation, can get the price of a commodity)
    2020-09-30 18:11:33下载
    积分:1
  • Chess
    基于剪枝技术的一字棋博弈系统,理解和掌握博弈树的启发式搜索过程,能够用某种程序语言建立一个简单的博弈系统(Pruning techniques based word chess game systems, understand and master the game tree heuristic search process, we can build a simple game system in some programming language)
    2015-12-20 15:56:45下载
    积分:1
  • 翻墙工具
    可以翻墙用谷歌,但是要关闭防火墙才能用。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。
    2023-05-08 00:55:04下载
    积分:1
  • zhizhupc
    使用网络爬虫技术实现自动查找指定网页上的新闻链接(Using web crawler technology automatically find links to news on a given page)
    2012-06-21 16:28:45下载
    积分:1
  • Experiment8
    书本搜索 可以根据关键字搜索客户需要的书(find books)
    2013-12-02 16:33:08下载
    积分:1
  • 打砖块,共六关
    在游戏中,玩家要操作一根屏幕上水平的"棒子",让一颗不断弹来弹去的"球"在撞击作为过关目标消去的"砖块"的途中不会落到屏幕底下。球碰到砖块、棒子与底下以外的三边会反弹,落到底下会失去一颗球,把砖块全部消去就可以破关。一共六关,超级好玩
    2022-04-19 02:35:46下载
    积分:1
  • zhinengsousuo
    智能搜索功能11111111111111111(Intelligent search function)
    2013-11-22 14:24:06下载
    积分:1
  • DuplicateHandle
    利用句柄复制进行文件占坑实现阻止文件删除,外加枚举进程内所有打开的文件、搜索文件句柄所在进程、卸载相关句柄。(Copy files using handles accounting realize pit stop delete files, plus the process of enumeration of all open files, search file handles where the process of unloading the relevant handler.)
    2009-02-20 19:12:36下载
    积分:1
  • PageRank2
    python写的一个pagerank计算的程序,需要用到python处理数组矩阵的模块numpy,pagerank算法在google搜索引擎网页重要程度排序中用到(python write a pagerank calculation procedures, the need to use python modules deal with the array matrix numpy, pagerank algorithm in google search engine ranking the importance of web pages used in)
    2007-10-31 00:42:29下载
    积分:1
  • 1024crawer-master
    说明:  基于python的1024爬虫,可爬下1024的文章和图片放到当前目录上(1024 crawler based on python, which can climb down 1024 articles and pictures and put them on the current directory)
    2020-04-05 17:35:35下载
    积分:1
  • 696516资源总数
  • 106927会员总数
  • 1今日下载