登录
首页 » Java » nutch-0.8

nutch-0.8

于 2007-07-18 发布 文件大小:445KB
0 210
下载积分: 1 下载次数: 32

代码说明:

  nutch-0.8刚出来不久的一个很好用的搜索引擎工具 nutch-0.8刚出来不久的一个很好用的搜索引(nutch-0.8 has just come out near a very good tool to use search engine nutch-0.8 has just come out soon with a good primer of english)

文件列表:

META-INF
........\MANIFEST.MF
nutch-default.xml
nutch-site.xml
org
...\apache
...\......\nutch
...\......\.....\analysis
...\......\.....\........\AnalyzerFactory.class
...\......\.....\........\CharStream.class
...\......\.....\........\CommonGrams$ArrayTokens.class
...\......\.....\........\CommonGrams$Filter.class
...\......\.....\........\CommonGrams.class
...\......\.....\........\FastCharStream.class
...\......\.....\........\NutchAnalysis$1.class
...\......\.....\........\NutchAnalysis$JJCalls.class
...\......\.....\........\NutchAnalysis$LookaheadSuccess.class
...\......\.....\........\NutchAnalysis.class
...\......\.....\........\NutchAnalysisConstants.class
...\......\.....\........\NutchAnalysisTokenManager.class
...\......\.....\........\NutchAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$1.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorFilter.class
...\......\.....\........\NutchDocumentAnalyzer$ContentAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer.class
...\......\.....\........\NutchDocumentTokenizer.class
...\......\.....\........\ParseException.class
...\......\.....\........\Token.class
...\......\.....\........\TokenManager.class
...\......\.....\........\TokenMgrError.class
...\......\.....\clustering
...\......\.....\..........\HitsCluster.class
...\......\.....\..........\OnlineClusterer$1.class
...\......\.....\..........\OnlineClusterer.class
...\......\.....\..........\OnlineClustererFactory.class
...\......\.....\crawl
...\......\.....\.....\Crawl.class
...\......\.....\.....\CrawlDatum$Comparator.class
...\......\.....\.....\CrawlDatum.class
...\......\.....\.....\CrawlDb.class
...\......\.....\.....\CrawlDbMerger$Merger.class
...\......\.....\.....\CrawlDbMerger.class
...\......\.....\.....\CrawlDbReader$CrawlDbDumpReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNReducer.class
...\......\.....\.....\CrawlDbReader.class
...\......\.....\.....\CrawlDbReducer.class
...\......\.....\.....\Generator$HashComparator.class
...\......\.....\.....\Generator$Selector.class
...\......\.....\.....\Generator$SelectorEntry.class
...\......\.....\.....\Generator$SelectorInverseMapper.class
...\......\.....\.....\Generator.class
...\......\.....\.....\Injector$InjectMapper.class
...\......\.....\.....\Injector$InjectReducer.class
...\......\.....\.....\Injector.class
...\......\.....\.....\Inlink.class
...\......\.....\.....\Inlinks.class
...\......\.....\.....\LinkDb$1.class
...\......\.....\.....\LinkDb$2.class
...\......\.....\.....\LinkDb$Merger.class
...\......\.....\.....\LinkDb.class
...\......\.....\.....\LinkDbMerger.class
...\......\.....\.....\LinkDbReader.class
...\......\.....\.....\MapWritable$ClassIdEntry.class
...\......\.....\.....\MapWritable$KeyValueEntry.class
...\......\.....\.....\MapWritable.class
...\......\.....\.....\MD5Signature.class
...\......\.....\.....\PartitionUrlByHost.class
...\......\.....\.....\Signature.class
...\......\.....\.....\SignatureComparator.class
...\......\.....\.....\SignatureFactory.class
...\......\.....\.....\TextProfileSignature$1.class
...\......\.....\.....\TextProfileSignature$Token.class
...\......\.....\.....\TextProfileSignature$TokenComparator.class
...\......\.....\.....\TextProfileSignature.class
...\......\.....\fetcher
...\......\.....\.......\Fetcher$FetcherThread.class
...\......\.....\.......\Fetcher$InputFormat.class
...\......\.....\.......\Fetcher.class
...\......\.....\.......\FetcherOutput.class
...\......\.....\.......\FetcherOutputFormat$1.class
...\......\.....\.......\FetcherOutputFormat.class
...\......\.....\html
...\......\.....\....\Entities.class
...\......\.....\indexer
...\......\.....\.......\DeleteDuplicates$1.class
...\......\.....\.......\DeleteDuplicates$2.class
...\......\.....\.......\DeleteDuplicates$HashPartitioner.class
...\......\.....\.......\DeleteDuplicates$HashReducer.class
...\......\.....\.......\DeleteDuplicates$HashScore.class
...\......\.....\.......\DeleteDuplicates$IndexDoc.class
...\......\.....\.......\DeleteDuplicates$InputFormat.class
...\......\.....\.......\DeleteDuplicates.class
...\......\.....\.......\FsDirectory$1.class
...\......\.....\.......\FsDirectory$DfsIndexInput$Descriptor.class
...\......\.....\.......\FsDirectory$DfsIndexInput.class
...\......\.....\.......\FsDirectory$DfsIndexOutput.class

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • proxysearcher
    vs2012开发的代理搜索、验证软件,可以自动搜索代理、网页吸取、google三种模式。(vs2012 development agency search, verification software can automatically search proxy, web lessons, google three modes.)
    2013-12-02 22:07:25下载
    积分:1
  • svpWm fpga实现
    用fpga实现svpwm,以实现,用quartusII编写的,很好,比用dsp效果要好,所以很推荐,希望帮助大家学习,共同进步了。
    2023-02-03 06:05:08下载
    积分:1
  • 048575
    百度搜索源码例程,程序结合易语言超文本浏览框支持库,提交URL搜索地址在百度进行搜索。(Baidu search code samples , combined with easy language program hypertext browsing box support library , submit URL address search Baidu search.)
    2016-01-04 15:04:09下载
    积分:1
  • 知网爬虫
    说明:  自己写的一个关于python的小型爬虫,性能感觉还算可以。(Himself wrote a small crawler about python, performance feel return calculate can.)
    2020-12-09 19:29:27下载
    积分:1
  • MetaSeeker-4.11.2
    主要应用领域: • 垂直搜索(Vertical Search):也称为专业搜索,高速、海量和精确抓取是定题网络爬虫DataScraper的强项,每天24小时每周7天无人值守自主调度的周期性批量采集,加上断点续传和软件看门狗(Watch Dog),确保您高枕无忧 • 移动互联网:手机搜索、手机混搭(mashup)、移动社交网络、移动电子商务都离不开结构化的数据内容,DataScraper实时高效地 采集内容,输出富含语义元数据的XML格式的抓取结果文件,确保自动化的数据集成和加工,跨越小尺寸屏幕展现和高精准信息检索的障碍。手机互联网不是 Web的子集而是全部,由MetaSeeker架设桥梁 • 企业竞争情报采集/数据挖掘:俗称商业智能(Business Intelligence),噪音信息滤除、结构化转换,确保数据的准确性和时效性,独有的广域分布式架构,赋予DataScraper无与伦比的情报采 集渗透能力,AJAX/Javascript动态页面、服务器动态网页、静态页面、各种鉴权认证机制,一视同仁。在微博网站数据采集和舆情监测领域远远领 先其它产品。(The main application areas: • Vertical Search (Vertical Search): also known as professional search, speed, mass and precision is the SDI Web crawler to crawl the strengths DataScraper 24 hours a day 7 days a week periodic unattended batch capture self-scheduling, Canada and software watchdog on the HTTP (Watch Dog), make sure you sit back and relax • Mobile Internet: mobile search, mobile mashups (mashup), mobile social networking, mobile commerce are inseparable from the structure of the data content, DataScraper efficiently capture real-time content, the output is rich semantic metadata XML format for the capture outcome document, to ensure that automated data integration and processing, across the small size screen display and high precision information retrieval obstacles. Mobile Internet is not a subset of Web but all, by building bridges MetaSeeker • Competitive intelligence gathering/data mining: commonly known as Business Intelligence (Business Intelli)
    2011-06-14 20:36:50下载
    积分:1
  • 作业
    应用背景清华大学搜索引擎pagerank算法,采用C编写而成,处理大数据,大规模的网页,能够完美运行,数据格式为a-b,a代表源网页,b代表目的网页关键技术pagerank算法,大数据处理,数据格式见应用背景,编译环境是dev-c++,在windows下完美运行
    2022-03-03 09:44:49下载
    积分:1
  • 51job
    51job自动登录 投放简历,搜索职位 刷新简历,搜索职位(Auto Login 51job running resume, search jobs refresh resume, search jobs)
    2009-06-25 16:55:08下载
    积分:1
  • address_book
    让一个地址簿,包括ID、名称、 电话号码、地址和公司。 可以插入、显示、搜索和删除。(make an address book,include ID,Name, Phone number,Address and company number. can Insert,Display,Search and Delete.)
    2012-12-31 22:49:58下载
    积分:1
  • yttx
    1.增加页面浏览统计功能 2.优化了统计代码,统计执行更快 用户:admin 密码:123456">
    2013-01-16 21:36:27下载
    积分:1
  • tpxt_v8.1
    企管美女美图小偷功能简介: 1.采用php小偷技术自动同步更新. 2.支持二级目录,二级域名。程序自带后台 3.搜索引擎蜘蛛访问记录(Business Mito beautiful thief Features: 1. Using php thief technology automatic synchronization update. 2. Support secondary directory, two domain names. The program comes with background 3. Search engine spiders access to records)
    2020-11-14 15:09:42下载
    积分:1
  • 696516资源总数
  • 106409会员总数
  • 8今日下载