Ò׽ؽØÍ¼Èí¼þ¡¢µ¥Îļþ¡¢Ãâ°²×°¡¢´¿ÂÌÉ«¡¢½ö160KB

ÖпÆÔº·Ö´Ê¹¤¾ßimdict chinese analyzerѧϰ java·Ö´Ê

ÏÂÔØÁ´½Óhttp://ictclas.org/Down_OpenSrc.asp
¼òµ¥½éÉÜ£º
 imdict-chinese-analyzerÊÇ imdictÖÇÄܴʵäµÄÖÇÄÜÖÐÎÄ·Ö´ÊÄ£¿é£¬×÷Õ߸ßСƽ£¬Ëã·¨»ùÓÚÒþÂí
¶û¿Æ·òÄ£ÐÍ(Hidden Markov Model, HMM)£¬ÊÇÖйú¿ÆÑ§Ôº¼ÆËã¼¼ÊõÑо¿ËùµÄictclasÖÐÎķִʳÌÐò
µÄÖØÐÂʵÏÖ£¨»ùÓÚJava£©£¬¿ÉÒÔÖ±½ÓΪluceneËÑË÷ÒýÇæÌṩÖÐÎÄ·Ö´ÊÖ§³Ö¡£
Ó¦Óãº
ϵ½µÄѹËõ°ü½âѹºó¾ÍÊÇÒ»¸öjava¹¤³Ì£¬eclipseÖ±½Óµ¼Èë¼´¿É£¬µ«ÓÉÓÚÆä¿ª·¢µÄ»·¾³ÊÇUTF8ËùÒÔ
Òª½«eclipseµÄ¹¤×÷¿Õ¼äµÄ±àÂëÒ²ÉèÖÃΪutf8£¬test°üÀïÃæµÄAnalyzerTest¾ÍÊÇÆäÓ÷¨£¬¿´ÁËÒÔºó
¾Í¿ÉÒÔÖ±½ÓÓÃÁË
¹¦ÄÜ£ºÖÐÎÄ·Ö´Ê¡¢Í£Ö¹´Ê¹ýÂË
Óŵ㣺¿ªÔ´£¬·Ö´ÊËٶȿ죬ЧÂʸß
ȱµã£º²»Ö§³Ö×Ô¼ºÌí¼Ó´Ê¿â£¬²»Ö§³Ö´ÊÐÔ±ê×¢£¨¿ª·¢ÈËÔ±×Ô¼ºËµÊÇΪÁËÌá¸ßËÙ¶È£©£¬dataÎļþ¼Ð½ö
×Ô´øÁËÁ½¸ö×ÖµäcoredictºËÐÄ×ֵ䡢bigramdict´Ê¹ØÏµ×ֵ䣬ÕâÊÇÁ½¸ö×îÖØÒªµÄ´Êµä£¬Ã»ÓеØÃûºÍ
ÈËÃûµÄ´Êµä£¬ËùÒÔҪʶ±ðÈËÃûµØÃû±È½ÏÂé·³£¬¾Ý˵ҪÓòã´Îhmm£¬ÏÈ´Ö·ÖÔÚϸ·Ö¡£
ÉîÈëѧϰ£ºÖ÷ÀàÊÇnet.imdict.analysis.chineseÖеÄChineseAnalyzer.javaËü¼Ì³ÐÁËluceneµÄ
AnalyzerÀ࣬ÓÐÁ½¸ö¹¹Ôì·½·¨£ºpublic ChineseAnalyzer()¡¢public ChineseAnalyzer
(Set<String> stopWords)µÚ¶þ¸ö¹¹Ôì·½·¨Ö§³ÖÍ£Óôʣ¬×îÖØÒªµÄÊÇtokenStreamº¯Êý£¬ËüÓÃÁË
SentenceTokenizerºÍnew WordTokenizer£¬Ç°Ò»¸öÊǽ«ÎÄÕ·ֳɾä×Ó£¬ºóÒ»¸öÊǽ«¾ä×ӷֳɵ¥´Ê£¬
µ¥´ÊºÍ¾ä×Ó¶¼ÊÇÓÃLuceneµÄToken£¨´Ê£©µÄÀà´æ´¢µÄ£¬£¨TokenÊÇÒ»¸ö³éÏóÀ࣬TokenStreamÊÇToken
ÀàµÄ×ÓÀ࣬µ«Ò²ÊÇÒ»¸ö³éÏóÀ࣬TokenizerºÍTokenFilterÔòÊÇTokenStreamµÄ¾ßÌåʵÏÖ£¬ËûÃÇʵÏÖ
ÁËTokenStreamµÄnext()·½·¨£¬TokenizerµÄnext·½·¨·µ»ØµÄÊÇԭʼµÄ¡¢ÇзֳöÀ´µÄ´Ê£¬¶ø
TokenFilter·½·¨·µ»ØµÄÊÇÒ»¸ö¾­¹ý¹ýÂ˵ĴÊÌõ£¬ËûÃǽáºÏÆðÀ´ÐγÉLucene·ÖÎöÆ÷µÄºËÐĽṹ£©Èç
Token token = new Token()£¬È»ºóͨ¹ýtoken.reinit(buffer.toString(), tokenStart,
tokenEnd, "sentence");ÖмäÁ½¸ö²ÎÊýÊÇToken´æ´¢µÄ×Ö·û´®µÄÆðֹλÖã¬ÒÔ0¿ªÊ¼¼ÆÊý£¬ÒýÓÃ
tokenÖÐ×Ö·û´®µÄº¯ÊýÊÇtoken.term()£¬ÕæÕýµ÷Ó÷ִʺËÐÄËã·¨µÄWordSegmenterµÄ
segmentSentence·½·¨¶Ô¾ä×Ó½øÐзִʣ¬ÔÚWordTokenizerÀàÖе÷ÓÃËüµÃµ½·Ö´Ê½á¹û¡£ÔÚÍùϲãµÄ´ú
ÂëÎÒ¾Íû¿´ÁË¡£
Á½¸ö¸Ä¶¯£º
£¨1£©ChineseAnalyzerÖ»ÄܶÔÎļþ½øÐзִʣ¬ÈçºÎ¶ÔÒ»¸ö×Ö·û´®½øÐзִʣ¬¸Ä¶¯ÈçÏÂ
/*  TokenStream ts = ca.tokenStream("sentence", new InputStreamRe


Ïà¹ØÎĵµ£º

javaÖеÄÏÝÚ壬Äã×¢ÒâÁËô£¿

´ð°¸Òþ²ØÁË£¬Ctrl+AÏÔʾ¡£½¨ÒéÏÈ˼¿¼Ò»Ï½á¹û£¬È»ºóÔËÐдúÂëÊÔÑé¡£Ò²ÐíÄã»á»ÐÈ»´óÎò¡£
1¡¢ÕÒÆæÊý£º
view plaincopy to clipboardprint?
public static boolean isOdd(int i){
return i % 2 == 1;
}
public static boolean isOdd(int i){
return i % 2 == 1;
}
ÉÏÃæµÄ·½·¨ÕæµÄÄÜÕÒµ ......

JavaÐòÁл¯µÄ»úÖÆºÍÔ­Àí

ÓйØJava¶ÔÏóµÄÐòÁл¯ºÍ·´ÐòÁл¯Ò²ËãÊÇJava»ù´¡µÄÒ»²¿·Ö£¬ÏÂÃæ¶ÔJavaÐòÁл¯µÄ»úÖÆºÍÔ­Àí½øÐÐһЩ½éÉÜ¡£
¡¡¡¡JavaÐòÁл¯Ë㷨͸Îö
¡¡¡¡Serialization£¨ÐòÁл¯£©ÊÇÒ»ÖÖ½«¶ÔÏóÒÔÒ»Á¬´®µÄ×Ö½ÚÃèÊöµÄ¹ý³Ì£»·´ÐòÁл¯deserializationÊÇÒ»ÖÖ½«ÕâЩ×Ö½ÚÖØ½¨³ÉÒ»¸ö¶ÔÏóµÄ¹ý³Ì¡£JavaÐòÁл¯APIÌṩһÖÖ´¦Àí¶ÔÏóÐòÁл¯µÄ±ê×¼»úÖÆ¡£ÔÚÕâÀ ......

ImageMagick for java ʹÓÃJmagick´¦Àí¸ßÖÊÁ¿Í¼Æ¬

ÔÚ×öpdfÎĵµ×ª³ÉjpgµÄʱºò£¬·¢ÏÖÁËJmagickµÄ´´½¨¸ßÖÊÁ¿µÄͼƬµÄÒ»¸öjavaÀà¿â,×Ô¼ºÒÔǰʹÓÃÁíÍâµÄÒ»¸öÀà¿â,¸Ð¾õÕâ¸ö¸üºÃµã,¾ÍÊÔ×ÅÓÃÁËÏÂ,¸Ð¾õ²»´í
1.ʹÓõÄwindowsϵÄjmagick-win-6.3.9-Q16.zip µØÖ·ÊÇ£ºhttp://downloads.jmagick.org/6.3.9/
2.doc¶ÔÓ¦µÄapiµØÖ·£ºhttp://downloads.jmagick.org/jmagick-doc/
3.°²×°Ima ......

java ½Ó¿ÚÓë³éÏóÀàµÄÇø±ð£¨×ª£©

Ò»¸öÈí¼þÉè¼ÆµÄºÃ»µ£¬ÎÒÏëºÜ´ó³Ì¶ÈÉÏÈ¡¾öÓÚËüµÄÕûÌå¼Ü¹¹£¬¶øÕâ¸öÕûÌå¼Ü¹¹Æäʵ¾ÍÊÇÄã¶ÔÕû¸öºê¹ÛÉÌÒµÒµÎñµÄ³éÏó¿ò¼Ü£¬µ±´ú±íÒµÎñÂß¼­µÄ¸ß²ã³éÏó²ã½á¹¹ ºÏÀíʱ£¬Äãµ×²ãµÄ¾ßÌåʵÏÖÐèÒª¿¼Âǵľͽö½öÊÇһЩËã·¨ºÍһЩ¾ßÌåµÄÒµÎñʵÏÖÁË¡£µ±ÄãÐèÒªÔÙ¿ª·¢ÁíÒ»¸öÏà½üµÄÏîĿʱ£¬ÄãÒÔǰµÄ³éÏó²ã˵²»¶¨»¹¿ÉÒÔÔÙ´ÎÀûÓÃ ÄØ£¬Ãæ¶Ô¶ÔÏóµÄÉè¼Æ ......

JAVA¶ÁÈ¡txtÎļþ

import java.io.BufferedReader;
import java.io.File;
import java.io.FileNotFoundException;
import java.io.FileReader;
import java.io.IOException;
/**
* @author dengshaohua
*/
public class ReadPhone {
/**
* ¶ÁÈ¡Êý¾Ý
*/
public void ReadData(){
try {
FileReader read = new File ......
© 2009 ej38.com All Rights Reserved. ¹ØÓÚE½¡ÍøÁªÏµÎÒÃÇ | Õ¾µãµØÍ¼ | ¸ÓICP±¸09004571ºÅ