Reference documents:
1、Chinese-Annotator
Project Address: https://github.com/crownpku/Chinese-Annotator
Download Item: git clone https://github.com/crownpku/Chinese-Annotator.git
By reading the contents of the document on github and reference documents, found no write how to install and use Chinese-Annotator, then see the question was raised in the Issues in https://github.com/crownpku/Chinese-Annotator/issues/11 , authors' reply "to an available system is still far from the stage ......", so I can not test it.
2, IEPY
(The reason was not successful network installation)
Code: https://github.com/machinalis/iepy
Description: http://iepy.readthedocs.io/en/latest/index .html
The whole project is more complete, user management systems. The front end a little heavy, not very user-friendly.
IEPY is a focus on open source information relation extraction extraction tool. It is primarily intended for users and scientists want to try a new algorithm for large data sets need information extraction. Currently only fully supports English, Spanish and German
Language Support: http://iepy.readthedocs.io/en/latest/language.html
installation:
Require Python 3.4+
pip install iepy
Mark the document, a paragraph mark, creating an entity, management entities
3, DeepDive (Mindtagger)
(The reason was not successful network installation)
Description: http://deepdive.stanford.edu/labeling
Front-end code: https://github.com/HazyResearch/mind Bender
The front is relatively simple, user friendly interface.
The corenlp DeepDive part of the code into Chinese support to try:
https://github.com/SongRb/DeepDiveChineseApps
https://github.com/qiangsiwei/DeepDive_Chinese
https://github.com/mcavdar/deepdive/commit/6882178cbd38a5bbbf4eee8b76b1e215537425b2
Use Cases:
http://blog.csdn.net/u013412066/article/details/68065518
Download and install (does not support windows system):
bash <(curl -fsSL git.io/getdeepdive)
Documents marked
4、BRAT
(Not successfully installed after downloading)
BRAT is a web-based text annotation tools, mainly for the structured text annotation, the annotation result generated by BRAT able to unstructured structured original text for computer processing. It can easily get tagged corpus of NLP tasks required to use the tool.
Support Chinese.
Description: http://brat.nlplab.org/index.html
Online trial: http://weaver.nlplab.org/~brat/demo/ Latest / # /
Code: https://github.com/nlplab/brat
Documentation : http://brat.nlplab.org/introduction.html
Installation documentation: http://brat.nlplab.org/installation.html
Manual: http://brat.nlplab.org/manual.html
installation:
下载压缩包
brat-v1.3_Crunchy_Frog.tar.gz
解压
tar xzf brat-v1.3_Crunchy_Frog.tar.gz
cd brat-v1.3_Crunchy_Frog
安装 ./install.sh
Use Cases:
http://blog.csdn.net/owengbs/article/details/49780225
5, SUTDAnnotator
(Successful installation)
Code: https://github.com/jiesutd/SUTDAnnotator
Paper:https://github.com/jiesutd/SUTDAnnotator/blob/master/lrec2018.pdf
Using the documentation: H ttps: //github.com/jiesutd/SUTDAnnotator
But not the front page with the pythonGUI, but relatively lightweight. Support Chinese
Download: git clone https://github.com/jiesutd/SUTDAnnotator.git
Start: python YEDDA_Annotator.py
Use Cases:
https://www.cnblogs.com/combfish/p/7830807.html
6、Snorkel
(Not successfully installed)
A training data creation and management system, focusing on information extraction.
Page: https://hazyresearch.github.io/snorkel/
Github: https://github.com/HazyResearch/snorkel
Demo Paper: https://hazyresearch.github.io/snorkel/pdfs/snorkel_demo.pdf
Guidebook: https://github.com/HazyResearch/snorkel/tree/master/tutorials
Installation Guide: https: //github.com/HazyResearch/snorkel#installation-dependencies
7、 Slate
(Not Installed)
Code: https://bitbucket.org/dainkaplan/slate/
Paper: http://www.jlcl.org/2011_Heft2/11.pdf
Guide: https://bitbucket.org/dainkaplan/slate/wiki/manual
What Slate that?
Segment and Link-based Annotation Tool, Enhanced. Slate is a web-based system that requires users only to have installed a web browser and have access to the internet to use.
Installation Requirements:
installation steps:
1, install Tomcat 5.x or 6.x
2、Replace the webapps/ROOT/ folder with the contents of the war package (the war package must be built first; see "Building Slate below").
3、Run the SLATE_SCHEMA.sql file once as root (sudo) to create the DB and DB user
4、Start up Tomcat and hit it in web browser to configure (e.g. http://localhost:8080/)
8、Prodigy
(Not Installed)
Website: https://prodi.gy/docs/
Blog: https://explosion.ai/blog/prodigy-annotation-tool-active-learning
And is a well-known spacy do. Not open source
Probably looked at the document on the website, it seems not to support Chinese. In this project, we're ignoring all non-English text, as well as ambiguous titles.
功能包括:Text Classification、Entity Recognition、Image classification、A/B Evaluation
demo Institute Add: Https://Prodi.Gy/demo