LOPEline: A Linguistic Open pipeline for Chinese NLP and eHumanities

[ Preprocessing ]

Language model

Syllable identifier

Word segmentator

Non-linguistic units (Emoticon) detector

POS tagger

Sense tagger 

NER 

Parser

[DeepLEX API]

[ Sentiment and Opinion ]

[ e-Humanities ]

LRs construction for less-resource languages

[Ref]

python中文NLP工具集 https://github.com/masr/pynlpini

功能介绍 

汉语言处理包 中文分词 词性标注 命名实体识别 依存句法分析 关键词提取 自动摘要 短语提取 拼音 简繁转换 

http://hanlp.linrunsoft.com/

https://github.com/hankcs/HanLP !

介面服務  text analytics and digital forensics  

http://www.basistech.com/about/startup/

哈工大語言雲

復旦大學 FudanNLP

Stanford Chinese NLP

Spacy.io

Concrete Chinese NLP pipeline

https://github.com/hltcoe/concrete

Concrete is an attempt to map out various NLP data types in a Thrift schema for use in projects across Johns Hopkins University. This standardized schema allows researchers to use a common, underlying data model for all NLP tasks, and thus, facilitating integration between projects. in Chinese/PYTHON!

Freeling 3.0 http://nlp.lsi.upc.edu/freeling/

Spell Checking for Chinese 2012 LREC

Using LRs in Humanities research 2012 LREC Marta Villegas, Nuria Bel, Carlos Gonzalo, Amparo Moreno and Nuria Simelio