Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
I
Iobber
Manage
Activity
Members
Labels
Plan
Issues
4
Issue boards
Milestones
Wiki
Redmine
Code
Merge requests
0
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container Registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Syntactic Tools
Chunking
Iobber
Commits
3ea63ada
Commit
3ea63ada
authored
12 years ago
by
Adam Radziszewski
Browse files
Options
Downloads
Patches
Plain Diff
add nkjp-coarse config
parent
ecbf6f27
Branches
Branches containing commit
No related merge requests found
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
iobber/data/nkjp-coarse-layer1.txt
+61
-0
61 additions, 0 deletions
iobber/data/nkjp-coarse-layer1.txt
iobber/data/nkjp-coarse.ccl
+11
-0
11 additions, 0 deletions
iobber/data/nkjp-coarse.ccl
iobber/data/nkjp-coarse.ini
+26
-0
26 additions, 0 deletions
iobber/data/nkjp-coarse.ini
with
98 additions
and
0 deletions
iobber/data/nkjp-coarse-layer1.txt
0 → 100644
+
61
−
0
View file @
3ea63ada
# Unigram
# orth
U00:%x[-2,0]
U01:%x[-1,0]
U02:%x[0,0]
U03:%x[1,0]
U04:%x[2,0]
U05:%x[-1,0]/%x[0,0]
U06:%x[0,0]/%x[1,0]
# class
U10:%x[-2,1]
U11:%x[-1,1]
U12:%x[0,1]
U13:%x[1,1]
U14:%x[2,1]
U15:%x[-2,1]/%x[-1,1]
U16:%x[-1,1]/%x[0,1]
U17:%x[0,1]/%x[1,1]
U18:%x[1,1]/%x[2,1]
# cas
U20:%x[-2,2]
U21:%x[-1,2]
U22:%x[0,2]
U23:%x[1,2]
U24:%x[2,2]
# gnd
U30:%x[-2,3]
U31:%x[-1,3]
U32:%x[0,3]
U33:%x[1,3]
U34:%x[2,3]
# nmb
U40:%x[-2,4]
U41:%x[-1,4]
U42:%x[0,4]
U43:%x[1,4]
U44:%x[2,4]
# agr
U50:%x[-1,5] # agr(0,1) -> agr(-1,0)
U51:%x[0,5] # agr(0,1)
U52:%x[-1,6] # agr..(-1,1) -> agr(-2,0)
U53:%x[0,6] # (-1,1)
U54:%x[1,6] # ... -> (0,2)
# regex feats
#U60:%x[-1,7]/%x[-1,8]
U61:%x[0,7]/%x[0,8]
#U62:%x[1,7]/%x[1,8]
# wordclass trigrams
U80:%x[-2,1]/%x[-1,1]/%x[0,1]
U81:%x[-1,1]/%x[0,1]/%x[1,1]
U82:%x[0,1]/%x[1,1]/%x[2,1]
# Bigram
B
This diff is collapsed.
Click to expand it.
iobber/data/nkjp-coarse.ccl
0 → 100644
+
11
−
0
View file @
3ea63ada
@ "default" (
orth[0]; // 0
class[0]; // 1
cas[0]; // 2
gnd[0]; // 3
nmb[0]; // 4
agrpp(0,1,{nmb,gnd,cas}); // 5
and(inside(-1), inside(1), wagr(-1,1,{nmb,gnd,cas})); // 6
regex(orth[0], "\\P{Ll}.*"); regex(orth[0], "\\P{Lu}.*") // 7, 8
)
This diff is collapsed.
Click to expand it.
iobber/data/nkjp-coarse.ini
0 → 100644
+
26
−
0
View file @
3ea63ada
; Configuration for chunking of phrases taken from NKJP but subjected to merging.
; NKJP tagset.
;
; NP is merged from the following groups:
; * actual nominal groups (NG, NGadres, NGdata, NGgodz),
; * numeral groups (NumG*),
; prepositional-nominal and prepositional-numeral groups (PrepNG, PrepNumG, PrepNGadres, PrepNGb, PrepNGdata, PrepNGgodz, PrepNGp).
;
; AdjP is taken from top-level adjective and prep-adj groups (TODO: enumerate).
;
; VP is taken from syntactic words having verbal classes.
;
; There is only one layer for all the groups.
[general]
tagset
=
nkjp
tagged
=
yes
[layers]
; the layer ordering is inferred from alphabetical order of their names!
; channel names should contain no hyphens
layer1
=
chunk_vp,chunk_np,chunk_adjp
[crf]
params
=
-a CRF-L2
This diff is collapsed.
Click to expand it.
Preview
0%
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment