Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset format #2

Open
alessandra3265 opened this issue Jun 6, 2022 · 1 comment
Open

Dataset format #2

alessandra3265 opened this issue Jun 6, 2022 · 1 comment

Comments

@alessandra3265
Copy link

alessandra3265 commented Jun 6, 2022

Kind creator, I cannot understand the format of the datasets you use, i.e. input and output of the read_data, in this line of code in run_srl.py :

train, train_data_file, w_c_to_idx, c_c_to_idx, dep_lb_to_idx = read_data(train_file, {}, {}, {})

Looking on the conll2005/09/12 sites indicated, I cannot find an example compatible with the function.

Can you please provide a small example (one or two sentences) of "train_file"
or
an example of the output of the read_data function?

@wsvkmt
Copy link

wsvkmt commented Mar 22, 2024

In IN (TOP(S(PP* 5 case - * * * (ARGM-LOC*
an DT (NP(NP(NP* 5 det - * * * *
Oct. NNP (NML* 5 compound - * * * *
19 CD ) 5 nummod - * * * *
review NN ) 45 obl - * * * *
of IN (PP
9 case - * * * *
(NP
9 punct - * * * *
The DT (NP* 9 det - * * * *
Misanthrope NN ) 5 nmod - * * * *
'' '' * 9 punct - * * * *
at IN (PP
15 case - * * * *
Chicago NNP (NP(NP* 15 nmod:poss - * * * *
's POS ) 12 case - * * * *
Goodman NNP * 15 compound - * * * *
Theatre NNP ))))) 9 nmod - * * * *
( -LRB- (NP
20 punct - * * * *
* 20 punct - * * * *
Revitalized VBN (S(NP
19 amod revitalize (V*) (ARG0* * *
Classics NNS ) 20 nsubj - (ARG1) ) * *
Take VBP (VP
5 dep take * (V*) * *
the DT (NP* 22 det - * (ARG1* * *
Stage NN ) 20 obj - * ) * *
in IN (PP
25 case - * (ARGM-LOC
* *
Windy NNP (NP* 25 compound - * * * *
City NNP )))) 20 obl - * ) * *
, , * 20 punct - * * * *
'' '' * 20 punct - * * * *
Leisure NN (NP
20 dep - * * * *
& CC * 30 cc - * * * *
Arts NNS ) 28 conj - * * * *
) -RRB- ))) 20 punct - * * * )
, , * 45 punct - * * * *
the DT (NP(NP(NP
34 det - * * (ARG1
(ARG1

role NN ) 45 nsubj:pass - * * * *
of IN (PP
36 case - * * * *
Celimene NNP (NP
))) 34 nmod - * * ) *
, , * 34 punct - * * * *
played VBN (VP
34 acl play * * (V*) *
by IN (PP* 41 case - * * (ARG0* *
Kim NNP (NP* 41 compound - * * * *
Cattrall NNP ))) 38 obl - * * ) *
, , ) 34 punct - * * * )
was VBD (VP
45 aux:pass - * * * *
mistakenly RB (ADVP
) 45 advmod - * * * (ARGM-MNR
)
attributed VBN (VP
0 root attribute * * * (V*)
to TO (PP* 48 case - * * * (ARG2*
Christina NNP (NP* 48 compound - * * * *
Haag NNP *)))) 45 obl - * * * *)
. . *)) 45 punct - * * * *

Ms. NNP (TOP(S(NP* 2 compound - (ARG0*
Haag NNP ) 3 nsubj - )
plays VBZ (VP
0 root play (V
)
Elianti NNP (NP*)) 3 obj - (ARG1*)
. . *)) 3 punct - *

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants