Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
C
combo
Manage
Activity
Members
Labels
Plan
Issues
20
Issue boards
Milestones
Wiki
Redmine
Code
Merge requests
2
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container Registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Syntactic Tools
combo
Commits
ba8acc86
Commit
ba8acc86
authored
2 years ago
by
Piotr
Browse files
Options
Downloads
Patches
Plain Diff
LAMBO integration working.
parent
ac6cab41
Branches
Branches containing commit
Tags
Tags containing commit
No related merge requests found
Pipeline
#6087
failed with stage
in 5 minutes and 55 seconds
Changes
1
Pipelines
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
combo/utils/lambo.py
+3
-3
3 additions, 3 deletions
combo/utils/lambo.py
with
3 additions
and
3 deletions
combo/utils/lambo.py
+
3
−
3
View file @
ba8acc86
...
...
@@ -6,7 +6,7 @@ from lambo.segmenter.lambo import Lambo
class
LamboTokenizer
(
Tokenizer
):
def
__init__
(
self
,
model
:
str
=
"
en
"
,)
->
None
:
def
__init__
(
self
,
model
:
str
=
"
LAMBO_no_pretraining-UD_Polish-PDB
"
,)
->
None
:
self
.
lambo
=
Lambo
.
get
(
model
)
# Simple tokenisation: ignoring sentence split
...
...
@@ -20,13 +20,13 @@ class LamboTokenizer(Tokenizer):
return
result
# Full segmentation: divide into sentences and tokens
def
segment
(
self
,
text
:
str
)
->
List
[
List
[
Token
]]:
def
segment
(
self
,
text
:
str
)
->
List
[
List
[
str
]]:
result
=
[]
document
=
self
.
lambo
.
segment
(
text
)
for
turn
in
document
.
turns
:
for
sentence
in
turn
.
sentences
:
resultS
=
[]
for
token
in
sentence
.
tokens
:
resultS
.
append
(
Token
(
token
.
text
)
)
resultS
.
append
(
token
.
text
)
result
.
append
(
resultS
)
return
result
\ No newline at end of file
This diff is collapsed.
Click to expand it.
Preview
0%
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment