-
Chinese-WordCut
这是一个中文分词程序,读入一个Txt文档,可以对里面的段落进行分词(This is a Chinese word segmentation program that reads a Txt document segmentation paragraphs inside)
- 2012-11-18 17:44:16下载
- 积分:1
-
m_seq
此函数用来生成最大长度线性移位寄存器序列(m序列)(This function used to generate the maximum length linear shift register sequence (m sequence))
- 2008-05-05 19:37:59下载
- 积分:1
-
ChineseSegment
根据输入的中文词来进行检索~检索出用户想要的内容(ChineseSegment)
- 2009-09-11 21:39:14下载
- 积分:1
-
HMM
说明: HMM中文分词,基于隐马尔科夫模型 。需要进行训练后试验(Word Segmentation Based on Hidden Markov Model)
- 2019-04-07 11:58:08下载
- 积分:1
-
4305685
应用中文分词源码程序,结合易语言模块彗星HTTP应用模块.ec,实现中文分词的效果。(Application of Chinese Word source program, combined with easy language module Comet HTTP application modules .ec, realize the effect of the Chinese word .)
- 2017-01-11 23:13:31下载
- 积分:1
-
tranditionized
中文简繁转换 GreenBrowser/TheWorld2.0插件(Tranditional Chinese Script Conversion GreenBrowser/TheWorld2.0 Plug-in)
- 2010-02-24 19:20:05下载
- 积分:1
-
201411149222244
随便下载一篇中文的文本文档,通过这个程序可以将文档进行分词处理,还能够统计词语出现的次数(To download a Chinese text documents, through this program can be word processing document, will also be able to statistics the number of occurrences of words and phrases)
- 2015-10-23 10:53:54下载
- 积分:1
-
lucene
java中lucene的源代码,用于文本分类的一个很好的工具,是由一个著名的语言研究者编写的(lucene code for java)
- 2009-03-30 17:28:22下载
- 积分:1
-
Reader
在中文分词之前,要对文档进行读取,本代码是实现了从磁盘读取的任务。(In the Chinese word prior to reading the document, the code is read from the disk to achieve the task.)
- 2013-09-10 11:09:28下载
- 积分:1
-
GB2312ToUnicode
GBK 转 unicode 提供二分法查询(translate the gbk to the unicode,with the bianary search way)
- 2009-12-31 13:17:44下载
- 积分:1