-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
7 changed files
with
112 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
{ | ||
"Name": "Affect in Tweets PT", | ||
"URL": "https://hdl.handle.net/21.11129/0000-000E-75BA-D", | ||
"Family": "Manually annotated corpora", | ||
"Description": "This is a data set of Portuguese tweets labelled with the emotion conveyed in the tweet.\nEach tweet is labelled with an emotion (i.e., anger, fear, joy, sadness).\nThe corpus is available from PORTULAN.", | ||
"Language": ["por"], | ||
"Licence": "CC BY", | ||
"Size": ["11,219 tweets"], | ||
"Annotation": ["sentiment analysis"], | ||
"Infrastructure": "CLARIN", | ||
"Group": ["Sentiment analysis"], | ||
"Access": { | ||
"Download": "https://hdl.handle.net/21.11129/0000-000E-75BA-D" | ||
}, | ||
"Publication": "" | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
{ | ||
"Name": "DeepBankPT", | ||
"URL": "https://hdl.handle.net/21.11129/0000-000B-D350-C", | ||
"Family": "Manually annotated corpora", | ||
"Description": "This is a corpus of grammatical analyses conforming to the <a href=\"https://en.wikipedia.org/wiki/Head-driven_phrase_structure_grammar\">Head Driven Phrase Structure Grammar</a> framework.\nThe sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.", | ||
"Language": ["por"], | ||
"Licence": "CC BY", | ||
"Size": ["3,406 sentences", "44,598 tokens"], | ||
"Annotation": ["grammatical structure"], | ||
"Infrastructure": "CLARIN", | ||
"Group": ["Syntactic parsing"], | ||
"Access": { | ||
"Download": "https://hdl.handle.net/21.11129/0000-000B-D350-C" | ||
}, | ||
"Publication": "" | ||
} |
16 changes: 16 additions & 0 deletions
16
corpora/manually-annotated-corpora/dependency-bank-pt.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
{ | ||
"Name": "DependencyBankPT", | ||
"URL": "https://hdl.handle.net/21.11129/0000-000B-D34C-2", | ||
"Family": "Manually annotated corpora", | ||
"Description": "This is a corpus of syntactic dependencies.\nThe sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.", | ||
"Language": ["por"], | ||
"Licence": "CC BY", | ||
"Size": ["3,406 sentences", "44,598 tokens"], | ||
"Annotation": ["grammatical structure"], | ||
"Infrastructure": "CLARIN", | ||
"Group": ["Syntactic parsing"], | ||
"Access": { | ||
"Download": "https://hdl.handle.net/21.11129/0000-000B-D34C-2" | ||
}, | ||
"Publication": "" | ||
} |
16 changes: 16 additions & 0 deletions
16
corpora/manually-annotated-corpora/logical-form-bank-pt.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
{ | ||
"Name": "LogicalFormBankPT", | ||
"URL": "https://hdl.handle.net/21.11129/0000-000B-D34E-0", | ||
"Family": "Manually annotated corpora", | ||
"Description": "This is a corpus of sentences annotated with logical forms. The sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.", | ||
"Language": ["por"], | ||
"Licence": "CC BY", | ||
"Size": ["3,406 sentences", "44,598 tokens"], | ||
"Annotation": ["Semantic tags"], | ||
"Infrastructure": "CLARIN", | ||
"Group": ["Other annotation layers"], | ||
"Access": { | ||
"Download": "https://hdl.handle.net/21.11129/0000-000B-D34E-0" | ||
}, | ||
"Publication": "" | ||
} |
16 changes: 16 additions & 0 deletions
16
corpora/manually-annotated-corpora/manual-for-teaching.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
{ | ||
"Name": "Manually annotated corpora for teaching and learning purposes of Brazilian Portuguese, Dutch, Estonian, and Slovene", | ||
"URL": "https://hdl.handle.net/21.11129/0000-0010-05DA-3 ", | ||
"Family": "Manually annotated corpora", | ||
"Description": "These are manually annotated corpora for teaching and learning purposes of Brazilian Portuguese, Dutch, Estonian, and Slovene.\nSentences are annotated with “problematic” or “non-problematic” labels, from the point of usage for pedagogical purposes.\nThe corpus is available from PORTULAN.", | ||
"Language": ["est", "nld", "slv", "por"], | ||
"Licence": "CC BY", | ||
"Size": ["10,000 sentences"], | ||
"Annotation": ["error tagging"], | ||
"Infrastructure": "CLARIN", | ||
"Group": ["Other annotation layers"], | ||
"Access": { | ||
"Download": "https://hdl.handle.net/21.11129/0000-0010-05DA-3" | ||
}, | ||
"Publication": "" | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
{ | ||
"Name": "PropBankPT", | ||
"URL": "https://hdl.handle.net/21.11129/0000-000B-D34B-3", | ||
"Family": "Manually annotated corpora", | ||
"Description": "This is a corpus of sentences annotated with their constituency structure and semantic role tags. The sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.", | ||
"Language": ["por"], | ||
"Licence": "CC BY", | ||
"Size": ["3,406 sentences", "44,598 tokens"], | ||
"Annotation": ["Syntactic parsing", "Semantic role tags"], | ||
"Infrastructure": "CLARIN", | ||
"Group": ["Syntactic parsing", "Other annotation layers"], | ||
"Access": { | ||
"Download": "https://hdl.handle.net/21.11129/0000-000B-D34B-3" | ||
}, | ||
"Publication":"" | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
{ | ||
"Name": "TreeBankPT", | ||
"URL": "https://hdl.handle.net/21.11129/0000-000B-D34B-3", | ||
"Family": "Manually annotated corpora", | ||
"Description": "This is a corpus of syntactic constituency trees. The sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.", | ||
"Language": ["por"], | ||
"Licence": "CC BY", | ||
"Size": ["3,406 sentences", "4,598 tokens"], | ||
"Annotation": ["Syntactic parsing"], | ||
"Infrastructure": "CLARIN", | ||
"Group": ["Syntactic parsing"], | ||
"Access": { | ||
"Download": "https://hdl.handle.net/21.11129/0000-000B-D34B-3" | ||
}, | ||
"Publication": "" | ||
} |