README.md 1019 Bytes
Newer Older
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32
1. DESCRIPTION
This tool annotates a document with URLs of corresponidng entities.

Linker parses annotated tokens/groups of tokens to spot expressions which
have to be described by mentioned URL.
URLs are taken from semantic graph of connected resources.

Linker accepts *.ccl file with annotations of following type:
  - mwe (multi-word expressions)
  - ne (named entities)
  - wsd (word sense disambiguation - annottated with syn_id)

Note: in current version, multi-word expressions (including named entity)
have precedence over disambiguated words. It's the case when
word annotated with wsd have been included in multi-word expression.
Then in many cases sesnse of this single word will differ from sense
of whole expression.

As a result, *.ccl document is returned with included URL annotations.

2. DEPENDENCIES
- python 2.x
- py2neo

All dependencies included in requirements.txt.

3. USAGE

elinker path/to/ccl/doc.xml path/for/results/output_doc.xml

For help and detailed description type: elkr "--help".