登录
首页 » Java » nutch-0.8

nutch-0.8

于 2007-07-18 发布 文件大小:445KB
0 188
下载积分: 1 下载次数: 32

代码说明:

  nutch-0.8刚出来不久的一个很好用的搜索引擎工具 nutch-0.8刚出来不久的一个很好用的搜索引(nutch-0.8 has just come out near a very good tool to use search engine nutch-0.8 has just come out soon with a good primer of english)

文件列表:

META-INF
........\MANIFEST.MF
nutch-default.xml
nutch-site.xml
org
...\apache
...\......\nutch
...\......\.....\analysis
...\......\.....\........\AnalyzerFactory.class
...\......\.....\........\CharStream.class
...\......\.....\........\CommonGrams$ArrayTokens.class
...\......\.....\........\CommonGrams$Filter.class
...\......\.....\........\CommonGrams.class
...\......\.....\........\FastCharStream.class
...\......\.....\........\NutchAnalysis$1.class
...\......\.....\........\NutchAnalysis$JJCalls.class
...\......\.....\........\NutchAnalysis$LookaheadSuccess.class
...\......\.....\........\NutchAnalysis.class
...\......\.....\........\NutchAnalysisConstants.class
...\......\.....\........\NutchAnalysisTokenManager.class
...\......\.....\........\NutchAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$1.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorFilter.class
...\......\.....\........\NutchDocumentAnalyzer$ContentAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer.class
...\......\.....\........\NutchDocumentTokenizer.class
...\......\.....\........\ParseException.class
...\......\.....\........\Token.class
...\......\.....\........\TokenManager.class
...\......\.....\........\TokenMgrError.class
...\......\.....\clustering
...\......\.....\..........\HitsCluster.class
...\......\.....\..........\OnlineClusterer$1.class
...\......\.....\..........\OnlineClusterer.class
...\......\.....\..........\OnlineClustererFactory.class
...\......\.....\crawl
...\......\.....\.....\Crawl.class
...\......\.....\.....\CrawlDatum$Comparator.class
...\......\.....\.....\CrawlDatum.class
...\......\.....\.....\CrawlDb.class
...\......\.....\.....\CrawlDbMerger$Merger.class
...\......\.....\.....\CrawlDbMerger.class
...\......\.....\.....\CrawlDbReader$CrawlDbDumpReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNReducer.class
...\......\.....\.....\CrawlDbReader.class
...\......\.....\.....\CrawlDbReducer.class
...\......\.....\.....\Generator$HashComparator.class
...\......\.....\.....\Generator$Selector.class
...\......\.....\.....\Generator$SelectorEntry.class
...\......\.....\.....\Generator$SelectorInverseMapper.class
...\......\.....\.....\Generator.class
...\......\.....\.....\Injector$InjectMapper.class
...\......\.....\.....\Injector$InjectReducer.class
...\......\.....\.....\Injector.class
...\......\.....\.....\Inlink.class
...\......\.....\.....\Inlinks.class
...\......\.....\.....\LinkDb$1.class
...\......\.....\.....\LinkDb$2.class
...\......\.....\.....\LinkDb$Merger.class
...\......\.....\.....\LinkDb.class
...\......\.....\.....\LinkDbMerger.class
...\......\.....\.....\LinkDbReader.class
...\......\.....\.....\MapWritable$ClassIdEntry.class
...\......\.....\.....\MapWritable$KeyValueEntry.class
...\......\.....\.....\MapWritable.class
...\......\.....\.....\MD5Signature.class
...\......\.....\.....\PartitionUrlByHost.class
...\......\.....\.....\Signature.class
...\......\.....\.....\SignatureComparator.class
...\......\.....\.....\SignatureFactory.class
...\......\.....\.....\TextProfileSignature$1.class
...\......\.....\.....\TextProfileSignature$Token.class
...\......\.....\.....\TextProfileSignature$TokenComparator.class
...\......\.....\.....\TextProfileSignature.class
...\......\.....\fetcher
...\......\.....\.......\Fetcher$FetcherThread.class
...\......\.....\.......\Fetcher$InputFormat.class
...\......\.....\.......\Fetcher.class
...\......\.....\.......\FetcherOutput.class
...\......\.....\.......\FetcherOutputFormat$1.class
...\......\.....\.......\FetcherOutputFormat.class
...\......\.....\html
...\......\.....\....\Entities.class
...\......\.....\indexer
...\......\.....\.......\DeleteDuplicates$1.class
...\......\.....\.......\DeleteDuplicates$2.class
...\......\.....\.......\DeleteDuplicates$HashPartitioner.class
...\......\.....\.......\DeleteDuplicates$HashReducer.class
...\......\.....\.......\DeleteDuplicates$HashScore.class
...\......\.....\.......\DeleteDuplicates$IndexDoc.class
...\......\.....\.......\DeleteDuplicates$InputFormat.class
...\......\.....\.......\DeleteDuplicates.class
...\......\.....\.......\FsDirectory$1.class
...\......\.....\.......\FsDirectory$DfsIndexInput$Descriptor.class
...\......\.....\.......\FsDirectory$DfsIndexInput.class
...\......\.....\.......\FsDirectory$DfsIndexOutput.class

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • in0436news
    相关链接、新闻搜索、今日要闻、历史新闻查询,取消查看新闻时生成本地文件(Related links, news search, today highlights, news about history, creating a local file to view the news Cancel)
    2016-03-26 19:26:21下载
    积分:1
  • 均匀传输线理论
    描述均匀传输线理论,包括分布参数电路,均匀传输线方程,均匀传输线方程的正铉稳态解,方程原参数及复参数,无损耗传输线及其方程通解,无损耗线的波过程等内容。
    2022-01-30 19:45:12下载
    积分:1
  • uneshop_2005
    软件介绍 操作简单,功能上比较实用,特别对爬虫类搜索引擎,如google,yahoo 等做了URL,title等优化控制(Software introduction The operation is simple, the function is more practical, especially for the crawler search engines, such as Google, Yahoo and so on to do the URL, title and other optimization control)
    2016-05-10 13:21:42下载
    积分:1
  • jas代码
    深入了解js代码,有利于开发和利用。
    2022-07-11 15:07:11下载
    积分:1
  • Chess
    基于剪枝技术的一字棋博弈系统,理解和掌握博弈树的启发式搜索过程,能够用某种程序语言建立一个简单的博弈系统(Pruning techniques based word chess game systems, understand and master the game tree heuristic search process, we can build a simple game system in some programming language)
    2015-12-20 15:56:45下载
    积分:1
  • WebCrawlers
    搜索是网络技术的热门课题,网络爬虫是搜索技术的基础,用VC++编写的网络爬虫可以很好的学习这门课题,适合初学者(Search is a hot topic of network technology, network reptiles is based on search technology, using VC++ Prepared reptiles can be a very good network of learning in this subject, suitable for beginners)
    2008-06-29 23:48:22下载
    积分:1
  • 复旦大学自然语言处理包(含示例)
    复旦大学自然语言处理包,由复旦大学研究的中文自然语言处理包,内有全部源码及使用示例。对于搜索引擎、分词器研究和应用非常有用。
    2022-01-25 17:48:28下载
    积分:1
  • users_spider_new
    微博爬虫,用于爬去某地点附近发生的签到事件,并将数据写入xls文件(Microblogging reptile, used to climb to a place near the attendance event, and write data to the xls file)
    2016-12-26 12:28:54下载
    积分:1
  • DuplicateHandle
    利用句柄复制进行文件占坑实现阻止文件删除,外加枚举进程内所有打开的文件、搜索文件句柄所在进程、卸载相关句柄。(Copy files using handles accounting realize pit stop delete files, plus the process of enumeration of all open files, search file handles where the process of unloading the relevant handler.)
    2009-02-20 19:12:36下载
    积分:1
  • SimpleSpider-master
    使用libevent和nanomsg开发的网络爬虫,内附教程(libevent and nanomsg Web Crawler)
    2021-01-26 15:58:37下载
    积分:1
  • 696518资源总数
  • 105873会员总数
  • 12今日下载