登录
首页 » Java » nutch-0.8

nutch-0.8

于 2007-07-18 发布 文件大小:445KB
0 84
下载积分: 1 下载次数: 32

代码说明:

  nutch-0.8刚出来不久的一个很好用的搜索引擎工具 nutch-0.8刚出来不久的一个很好用的搜索引(nutch-0.8 has just come out near a very good tool to use search engine nutch-0.8 has just come out soon with a good primer of english)

文件列表:

META-INF
........\MANIFEST.MF
nutch-default.xml
nutch-site.xml
org
...\apache
...\......\nutch
...\......\.....\analysis
...\......\.....\........\AnalyzerFactory.class
...\......\.....\........\CharStream.class
...\......\.....\........\CommonGrams$ArrayTokens.class
...\......\.....\........\CommonGrams$Filter.class
...\......\.....\........\CommonGrams.class
...\......\.....\........\FastCharStream.class
...\......\.....\........\NutchAnalysis$1.class
...\......\.....\........\NutchAnalysis$JJCalls.class
...\......\.....\........\NutchAnalysis$LookaheadSuccess.class
...\......\.....\........\NutchAnalysis.class
...\......\.....\........\NutchAnalysisConstants.class
...\......\.....\........\NutchAnalysisTokenManager.class
...\......\.....\........\NutchAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$1.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorFilter.class
...\......\.....\........\NutchDocumentAnalyzer$ContentAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer.class
...\......\.....\........\NutchDocumentTokenizer.class
...\......\.....\........\ParseException.class
...\......\.....\........\Token.class
...\......\.....\........\TokenManager.class
...\......\.....\........\TokenMgrError.class
...\......\.....\clustering
...\......\.....\..........\HitsCluster.class
...\......\.....\..........\OnlineClusterer$1.class
...\......\.....\..........\OnlineClusterer.class
...\......\.....\..........\OnlineClustererFactory.class
...\......\.....\crawl
...\......\.....\.....\Crawl.class
...\......\.....\.....\CrawlDatum$Comparator.class
...\......\.....\.....\CrawlDatum.class
...\......\.....\.....\CrawlDb.class
...\......\.....\.....\CrawlDbMerger$Merger.class
...\......\.....\.....\CrawlDbMerger.class
...\......\.....\.....\CrawlDbReader$CrawlDbDumpReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNReducer.class
...\......\.....\.....\CrawlDbReader.class
...\......\.....\.....\CrawlDbReducer.class
...\......\.....\.....\Generator$HashComparator.class
...\......\.....\.....\Generator$Selector.class
...\......\.....\.....\Generator$SelectorEntry.class
...\......\.....\.....\Generator$SelectorInverseMapper.class
...\......\.....\.....\Generator.class
...\......\.....\.....\Injector$InjectMapper.class
...\......\.....\.....\Injector$InjectReducer.class
...\......\.....\.....\Injector.class
...\......\.....\.....\Inlink.class
...\......\.....\.....\Inlinks.class
...\......\.....\.....\LinkDb$1.class
...\......\.....\.....\LinkDb$2.class
...\......\.....\.....\LinkDb$Merger.class
...\......\.....\.....\LinkDb.class
...\......\.....\.....\LinkDbMerger.class
...\......\.....\.....\LinkDbReader.class
...\......\.....\.....\MapWritable$ClassIdEntry.class
...\......\.....\.....\MapWritable$KeyValueEntry.class
...\......\.....\.....\MapWritable.class
...\......\.....\.....\MD5Signature.class
...\......\.....\.....\PartitionUrlByHost.class
...\......\.....\.....\Signature.class
...\......\.....\.....\SignatureComparator.class
...\......\.....\.....\SignatureFactory.class
...\......\.....\.....\TextProfileSignature$1.class
...\......\.....\.....\TextProfileSignature$Token.class
...\......\.....\.....\TextProfileSignature$TokenComparator.class
...\......\.....\.....\TextProfileSignature.class
...\......\.....\fetcher
...\......\.....\.......\Fetcher$FetcherThread.class
...\......\.....\.......\Fetcher$InputFormat.class
...\......\.....\.......\Fetcher.class
...\......\.....\.......\FetcherOutput.class
...\......\.....\.......\FetcherOutputFormat$1.class
...\......\.....\.......\FetcherOutputFormat.class
...\......\.....\html
...\......\.....\....\Entities.class
...\......\.....\indexer
...\......\.....\.......\DeleteDuplicates$1.class
...\......\.....\.......\DeleteDuplicates$2.class
...\......\.....\.......\DeleteDuplicates$HashPartitioner.class
...\......\.....\.......\DeleteDuplicates$HashReducer.class
...\......\.....\.......\DeleteDuplicates$HashScore.class
...\......\.....\.......\DeleteDuplicates$IndexDoc.class
...\......\.....\.......\DeleteDuplicates$InputFormat.class
...\......\.....\.......\DeleteDuplicates.class
...\......\.....\.......\FsDirectory$1.class
...\......\.....\.......\FsDirectory$DfsIndexInput$Descriptor.class
...\......\.....\.......\FsDirectory$DfsIndexInput.class
...\......\.....\.......\FsDirectory$DfsIndexOutput.class

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • PageRank2
    python写的一个pagerank计算的程序,需要用到python处理数组矩阵的模块numpy,pagerank算法在google搜索引擎网页重要程度排序中用到(python write a pagerank calculation procedures, the need to use python modules deal with the array matrix numpy, pagerank algorithm in google search engine ranking the importance of web pages used in)
    2007-10-31 00:42:29下载
    积分:1
  • cjk_baidu
    该小偷程序模拟baidu搜索功能,将输入的搜索词发送到baidu搜索,然后将返回的搜索结果进行解析处理,功能强大,一用就明白。(The thief program to simulate baidu search function, enter search terms sent to baidu search, and then parse the returned search results processing, powerful, with a will understand.)
    2016-10-24 22:06:52下载
    积分:1
  • HL-Python-Reptile
    说明:  用于获取页面信息以及获取页面图片,并下载保存到本地(Used to obtain page information and pictures, and download and save them locally)
    2020-01-20 15:20:59下载
    积分:1
  • zhinengsousuo
    智能搜索功能11111111111111111(Intelligent search function)
    2013-11-22 14:24:06下载
    积分:1
  • compass
    说明:  compass对lucene行进了封装 这个是对compass的应用,index的建立,搜索以及高级搜索(compass on the lucene marching compass of the package, this is the application, index creation, search and advanced search)
    2010-03-29 14:50:25下载
    积分:1
  • 1
    说明:  自己动手写搜索引擎第三章代码,随书光盘中的内容,整个太大,只能分别上传(Chapter code search engine to write himself, with the contents of the CD-ROM, the whole is too big, we were only able to upload)
    2013-03-05 10:55:47下载
    积分:1
  • soso
    集成11个搜索引擎的山寨搜索引擎,HTML版的 LOGO 自己做个就可以了(Integrated 11 search engines cottage search engine, HTML version of the LOGO to be a can of their own)
    2010-05-12 16:09:00下载
    积分:1
  • 均匀传输线理论
    描述均匀传输线理论,包括分布参数电路,均匀传输线方程,均匀传输线方程的正铉稳态解,方程原参数及复参数,无损耗传输线及其方程通解,无损耗线的波过程等内容。
    2022-01-30 19:45:12下载
    积分:1
  • yahoo-pang
    这个是一个雅虎搜索的爬虫,首先你在雅虎搜索上输入你要搜素的东西,然后程序可以自动爬取下来。默认爬取20页,你也可以修改(This is a Yahoo! search of reptiles, first you enter the search elements you want to search something on Yahoo, then the program can automatically crawl down. The default crawling 20, you can also modify)
    2016-11-08 11:08:47下载
    积分:1
  • 3--blog_move-4-18
    新浪博客,CSDN博客,腾讯空间的简单的爬虫系统源码,java版。(blog.sina.com,csdn, qzone, spider java source)
    2013-06-17 11:05:45下载
    积分:1
  • 696532资源总数
  • 103702会员总数
  • 113今日下载