Text annotation tool test

Reference documents:

https://mp.weixin.qq.com/s?__biz=MzI1MjQ2OTQ3Ng==&mid=2247486935&idx=1&sn=3beffc58b1360a2891c74539e35d2295&chksm=e9e2045cde958d4ac8a69d29d511a133155eeab2c062711cec45a2303789587195979c31bd6a&mpshare=1&scene=1&srcid=1126rUim2ks07dI1tj84FHYC&rd2werd=1#wechat_redirect

1、Chinese-Annotator

Project Address: https://github.com/crownpku/Chinese-Annotator

Download Item: git clone   https://github.com/crownpku/Chinese-Annotator.git

By reading the contents of the document on github and reference documents, found no write how to install and use Chinese-Annotator, then see the question was raised in the Issues in https://github.com/crownpku/Chinese-Annotator/issues/11 , authors' reply "to an available system is still far from the stage ......", so I can not test it.

2,  IEPY 

 (The reason was not successful network installation)

Code: https://github.com/machinalis/iepy 

Description: http://iepy.readthedocs.io/en/latest/index .html

The whole project is more complete, user management systems. The front end a little heavy, not very user-friendly.

IEPY is a focus on open source information relation extraction extraction tool. It is primarily intended for users and scientists want to try a new algorithm for large data sets need information extraction. Currently only fully supports English, Spanish and German

Language Support: http://iepy.readthedocs.io/en/latest/language.html

installation:

Require Python 3.4+

pip install iepy

Mark the document, a paragraph mark, creating an entity, management entities

3, DeepDive (Mindtagger)

  (The reason was not successful network installation)

Description: http://deepdive.stanford.edu/labeling 

Front-end code: https://github.com/HazyResearch/mind Bender 

The front is relatively simple, user friendly interface.

The corenlp DeepDive part of the code into Chinese support to try:

https://github.com/SongRb/DeepDiveChineseApps 

https://github.com/qiangsiwei/DeepDive_Chinese 

https://github.com/mcavdar/deepdive/commit/6882178cbd38a5bbbf4eee8b76b1e215537425b2

Use Cases:

http://blog.csdn.net/u013412066/article/details/68065518

Download and install (does not support windows system):

bash <(curl -fsSL git.io/getdeepdive)

Documents marked

4、BRAT

  (Not successfully installed after downloading)

BRAT is a web-based text annotation tools, mainly for the structured text annotation, the annotation result generated by BRAT able to unstructured structured original text for computer processing. It can easily get tagged corpus of NLP tasks required to use the tool.

Support Chinese.

Description: http://brat.nlplab.org/index.html 

Online trial: http://weaver.nlplab.org/~brat/demo/ Latest / # / 

Code: https://github.com/nlplab/brat

Documentation : http://brat.nlplab.org/introduction.html

Installation documentation: http://brat.nlplab.org/installation.html

Manual: http://brat.nlplab.org/manual.html

 

installation:

下载压缩包 
brat-v1.3_Crunchy_Frog.tar.gz

解压 
tar xzf brat-v1.3_Crunchy_Frog.tar.gz

cd brat-v1.3_Crunchy_Frog

安装 ./install.sh

Use Cases:

http://blog.csdn.net/owengbs/article/details/49780225

5, SUTDAnnotator

  (Successful installation)

Code: https://github.com/jiesutd/SUTDAnnotator 

Paper:https://github.com/jiesutd/SUTDAnnotator/blob/master/lrec2018.pdf

Using the documentation: H ttps: //github.com/jiesutd/SUTDAnnotator

But not the front page with the pythonGUI, but relatively lightweight. Support Chinese

 

Download: git clone  https://github.com/jiesutd/SUTDAnnotator.git

Start: python YEDDA_Annotator.py

Use Cases:

https://www.cnblogs.com/combfish/p/7830807.html

6、Snorkel

  (Not successfully installed)

A training data creation and management system, focusing on information extraction.

Page: https://hazyresearch.github.io/snorkel/ 

Github: https://github.com/HazyResearch/snorkel 

Demo Paper: https://hazyresearch.github.io/snorkel/pdfs/snorkel_demo.pdf

Guidebook: https://github.com/HazyResearch/snorkel/tree/master/tutorials

Installation Guide: https: //github.com/HazyResearch/snorkel#installation-dependencies

7、 Slate

  (Not Installed)

Code: https://bitbucket.org/dainkaplan/slate/ 

Paper: http://www.jlcl.org/2011_Heft2/11.pdf

Guide: https://bitbucket.org/dainkaplan/slate/wiki/manual

What Slate that?

Segment and Link-based Annotation Tool, Enhanced. Slate is a web-based system that requires users only to have installed a web browser and have access to the internet to use.

Installation Requirements:

installation steps:

1, install Tomcat 5.x or 6.x

2、Replace the webapps/ROOT/ folder with the contents of the war package (the war package must be built first; see "Building Slate below").

3、Run the SLATE_SCHEMA.sql file once as root (sudo) to create the DB and DB user

4、Start up Tomcat and hit it in web browser to configure (e.g. http://localhost:8080/)

8、Prodigy

   (Not Installed)

Website: https://prodi.gy/docs/ 

Blog: https://explosion.ai/blog/prodigy-annotation-tool-active-learning

And is a well-known spacy do. Not open source

 

Probably looked at the document on the website, it seems not to support Chinese. In this project, we're ignoring all non-English text, as well as ambiguous titles.

功能包括:Text ClassificationEntity RecognitionImage classificationA/B Evaluation

demo Institute Add: Https://Prodi.Gy/demo

Guess you like

Origin blog.csdn.net/zwahut/article/details/90637696