Skip to content
Snippets Groups Projects
Commit 51be6680 authored by Mateusz Klimaszewski's avatar Mateusz Klimaszewski Committed by Mateusz Klimaszewski
Browse files

Add documentation EUD model training.

parent 8e09e336
2 merge requests!9Enhanced dependency parsing develop to master,!8Enhanced dependency parsing
This commit is part of merge request !9. Comments created here will be created in the context of that merge request.
......@@ -43,7 +43,27 @@ Examples (for clarity without training/validation data paths):
```bash
combo --mode train --targets head,deprel --features token,char,upostag
```
## Enhanced UD
Training a model with Enhanced UD prediction **requires** data pre-processing.
```bash
combo --mode train \
--training_data_path your_preprocessed_training_path \
--validation_data_path your_preprocessed_validation_path \
--targets feats,upostag,xpostag,head,deprel,lemma,deps \
--config_path config.graph.template.jsonnet
```
### Data pre-processing
Download data from [IWPT20 Shared Task](https://universaldependencies.org/iwpt20/data.html).
It contains `enhanced_collapse_empty_nodes.pl` script which is required as pre-processing step.
Apply this script to training and validation data.
```bash
perl enhanced_collapse_empty_nodes.pl training.conllu > training.fixed.conllu
```
## Configuration
### Advanced
......
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment