a国产,中文字幕久久波多野结衣AV,欧美粗大猛烈老熟妇,女人av天堂

當(dāng)前位置:主頁 > 科技論文 > 自動(dòng)化論文 >

基于循環(huán)神經(jīng)網(wǎng)絡(luò)的中文人名識(shí)別的研究

發(fā)布時(shí)間:2018-05-20 10:19

  本文選題:中文人名識(shí)別 + 詞向量 ; 參考:《大連理工大學(xué)》2016年碩士論文


【摘要】:中文人名識(shí)別任務(wù)是中文信息處理領(lǐng)域中的基礎(chǔ)任務(wù),其性能的好壞將直接影響到其他任務(wù)的性能。中文人名的隨意性使其在未登錄詞中占有較大的比重,解決未登錄詞識(shí)別問題首先要解決人名識(shí)別問題。因此,解決中文人名識(shí)別問題具有重要的意義,F(xiàn)有基于統(tǒng)計(jì)的中文人名識(shí)別方法存在特征選取復(fù)雜和人工干預(yù)等問題,針對(duì)這些問題,本文提出了一種基于循環(huán)神經(jīng)網(wǎng)絡(luò)(Recurrent Neural Networks)的中文人名識(shí)別方法,該方法僅采用詞向量作為模型的特征且無需人工干預(yù),有效降低了特征選取的復(fù)雜性和人工干預(yù)對(duì)實(shí)驗(yàn)造成的影響。此外,詞向量可以通過大量未標(biāo)注的中文數(shù)據(jù)訓(xùn)練獲得,然后將蘊(yùn)含豐富語義信息的詞向量作為循環(huán)神經(jīng)網(wǎng)絡(luò)模型的輸入,可以使模型學(xué)習(xí)到更多的信息,提升模型的性能。本文將模型分為兩個(gè)階段:模型構(gòu)建階段和后處理階段。在模型構(gòu)建階段,我們將重點(diǎn)放在詞向量的優(yōu)化策略上。針對(duì)詞向量的優(yōu)化問題,本文提出了三種策略:(1)將word2vec訓(xùn)練得到的詞向量替換循環(huán)神經(jīng)網(wǎng)絡(luò)模型的隨機(jī)初始詞向量(2)對(duì)詞向量訓(xùn)練語料進(jìn)行數(shù)詞泛化操作(3)改進(jìn)word2vec模型,將特征信息融入詞向量實(shí)驗(yàn)結(jié)果表明,通過詞向量的優(yōu)化操作,中文人名識(shí)別模型的F值提高了2.23%。在后處理階段,通過上下文規(guī)則對(duì)候選人名進(jìn)行過濾;采用基于篇章的全局?jǐn)U散操作召回在某一位置由于信息不足識(shí)別不出而在其他位置能夠被識(shí)別的人名;使用基于篇章的局部擴(kuò)散操作識(shí)別篇章信息中有名無姓或者有姓無名的人名。實(shí)驗(yàn)結(jié)果表明,通過規(guī)則過濾和擴(kuò)散操作,中文人名識(shí)別模型的F值提高了4.74%。
[Abstract]:The task of Chinese name recognition is the basic task in the field of Chinese information processing, and its performance will directly affect the performance of other tasks. The randomness of Chinese names makes them occupy a large proportion in unrecorded words. To solve the problem of unrecorded words recognition, we must first solve the problem of personal name recognition. Therefore, it is of great significance to solve the problem of Chinese name recognition. The existing Chinese name recognition methods based on statistics have the problems of complex feature selection and artificial intervention. In view of these problems, this paper proposes a Chinese name recognition method based on cyclic neural network (Recurrent Neural Network). This method only uses word vector as the feature of the model and does not need human intervention, which effectively reduces the complexity of feature selection and the influence of artificial intervention on the experiment. In addition, the word vector can be obtained through a large number of unlabeled Chinese data training, and then the word vector with rich semantic information can be used as the input of the cyclic neural network model, so that the model can learn more information and improve the performance of the model. This paper divides the model into two stages: model construction stage and post-processing phase. In the stage of model construction, we focus on the optimization strategy of word vector. To solve the problem of word vector optimization, this paper proposes three strategies: 1) the word vector is replaced by the random initial word vector of the neural network model, which is trained by word2vec, and the random initial word vector is used to generalize the word vector training corpus. (3) the word2vec model is improved. The experimental results show that the F value of the Chinese name recognition model is increased by 2.233 by the optimization of the word vector. In the post-processing stage, the candidate's name is filtered by contextual rules, and the text based global diffusion operation is used to recall the names of people who can be recognized in other places because of the lack of information. A text-based local diffusion operation is used to identify a person with no or no name in the text information. The experimental results show that the F value of the Chinese name recognition model is increased by 4.74 by regular filtering and diffusion operation.
【學(xué)位授予單位】:大連理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2016
【分類號(hào)】:TP391.1;TP183

【參考文獻(xiàn)】

相關(guān)期刊論文 前10條

1 王s,

本文編號(hào):1914230


資料下載
論文發(fā)表

本文鏈接:http://www.wukwdryxk.cn/kejilunwen/zidonghuakongzhilunwen/1914230.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶f64c7***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com
亚洲国产精品成人综合久久久久久久| 焦作市| 一本色道久久综合| 一本之道av不卡精品| 亚洲综合色区另类av| 黑人精品欧美一区二区蜜桃| 一本久道东京热| 国产女人十八毛片| 50女潮喷和高潮| 国产视频久久久久| 人妻少妇久久中文字幕一区二区 | 中文字幕乱码一区二区免费| 黑人巨大精品欧美一区二区| 国产精品人妻熟女丝袜13p| 三级短视频| 噜噜噜久久,亚洲精品国产品| 欧美日韩国产片| 97综合网| 奇米影视亚洲春色| 九色91蝌蚪| 国产又黄又大又粗的视频| 欧美三级不卡在线观看| 国产精品jizz在线观看网站| 美女扒开尿口让男人桶| 久久久久亚洲AV无码去区首| 按摩师舌头进去添的我好舒服| 人人爽人人爽人人片av免费| 国产电影无码午夜在线播放| 色丁狠狠桃花久久综合网| 亚洲国产无套无码av电影| 亚洲av无码av男人的天堂| 国产精品成人无码免费| 亚洲欧美国产国产综合一区| 精品成人av一区二区三区| 久久久久成人精品| 欧乱色国产精品兔费视频| 国精品午夜福利视频不卡| 免费午夜爽爽爽www视频十八禁| 成人无码精品一区二区三区| 亚洲精品TV久久久久久久久久 | 久久精品国产曰本波多野结衣|