Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
C
combo
Manage
Activity
Members
Labels
Plan
Issues
20
Issue boards
Milestones
Wiki
Redmine
Code
Merge requests
2
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container Registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Syntactic Tools
combo
Commits
54e37314
Commit
54e37314
authored
1 year ago
by
Martyna Wiącek
Browse files
Options
Downloads
Patches
Plain Diff
fix default_split_level
parent
a978579d
1 merge request
!47
Fixed multiword prediction + bug that made the code write empty predictions
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
combo/main.py
+5
-5
5 additions, 5 deletions
combo/main.py
with
5 additions
and
5 deletions
combo/main.py
+
5
−
5
View file @
54e37314
...
@@ -156,8 +156,8 @@ def get_defaults(dataset_reader: Optional[DatasetReader],
...
@@ -156,8 +156,8 @@ def get_defaults(dataset_reader: Optional[DatasetReader],
# Dataset reader is required to read training data and/or for training (and validation) data loader
# Dataset reader is required to read training data and/or for training (and validation) data loader
dataset_reader
=
default_ud_dataset_reader
(
FLAGS
.
pretrained_transformer_name
,
dataset_reader
=
default_ud_dataset_reader
(
FLAGS
.
pretrained_transformer_name
,
tokenizer
=
LamboTokenizer
(
FLAGS
.
tokenizer_language
,
tokenizer
=
LamboTokenizer
(
FLAGS
.
tokenizer_language
,
default_turns
=
FLAGS
.
turns
,
default_split_level
=
"
TURNS
"
if
FLAGS
.
turns
else
"
SENTENCES
"
,
default_split_subwords
=
FLAGS
.
split_subwords
)
default_split_subwords
=
FLAGS
.
split_subwords
)
)
)
if
not
training_data_loader
:
if
not
training_data_loader
:
...
@@ -403,9 +403,9 @@ def run(_):
...
@@ -403,9 +403,9 @@ def run(_):
logger
.
info
(
"
No dataset reader in the configuration or archive file - using a default UD dataset reader
"
,
logger
.
info
(
"
No dataset reader in the configuration or archive file - using a default UD dataset reader
"
,
prefix
=
prefix
)
prefix
=
prefix
)
dataset_reader
=
default_ud_dataset_reader
(
FLAGS
.
pretrained_transformer_name
,
dataset_reader
=
default_ud_dataset_reader
(
FLAGS
.
pretrained_transformer_name
,
tokenizer
=
LamboTokenizer
(
tokenizer_language
,
tokenizer
=
LamboTokenizer
(
tokenizer_language
,
default_turns
=
FLAGS
.
turns
,
default_split_level
=
"
TURNS
"
if
FLAGS
.
turns
else
"
SENTENCES
"
,
default_split_subwords
=
FLAGS
.
split_subwords
)
default_split_subwords
=
FLAGS
.
split_subwords
)
)
)
predictor
=
COMBO
(
model
,
dataset_reader
)
predictor
=
COMBO
(
model
,
dataset_reader
)
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment