Skip to content

Commit

Permalink
Add files via upload
Browse files Browse the repository at this point in the history
  • Loading branch information
jakoble authored Oct 28, 2024
1 parent e3dc89d commit 3595f90
Show file tree
Hide file tree
Showing 7 changed files with 112 additions and 0 deletions.
16 changes: 16 additions & 0 deletions corpora/manually-annotated-corpora/affect-in-tweets.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
{
"Name": "Affect in Tweets PT",
"URL": "https://hdl.handle.net/21.11129/0000-000E-75BA-D",
"Family": "Manually annotated corpora",
"Description": "This is a data set of Portuguese tweets labelled with the emotion conveyed in the tweet.\nEach tweet is labelled with an emotion (i.e., anger, fear, joy, sadness).\nThe corpus is available from PORTULAN.",
"Language": ["por"],
"Licence": "CC BY",
"Size": ["11,219 tweets"],
"Annotation": ["sentiment analysis"],
"Infrastructure": "CLARIN",
"Group": ["Sentiment analysis"],
"Access": {
"Download": "https://hdl.handle.net/21.11129/0000-000E-75BA-D"
},
"Publication": ""
}
16 changes: 16 additions & 0 deletions corpora/manually-annotated-corpora/deepbankpt.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
{
"Name": "DeepBankPT",
"URL": "https://hdl.handle.net/21.11129/0000-000B-D350-C",
"Family": "Manually annotated corpora",
"Description": "This is a corpus of grammatical analyses conforming to the <a href=\"https://en.wikipedia.org/wiki/Head-driven_phrase_structure_grammar\">Head Driven Phrase Structure Grammar</a> framework.\nThe sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.",
"Language": ["por"],
"Licence": "CC BY",
"Size": ["3,406 sentences", "44,598 tokens"],
"Annotation": ["grammatical structure"],
"Infrastructure": "CLARIN",
"Group": ["Syntactic parsing"],
"Access": {
"Download": "https://hdl.handle.net/21.11129/0000-000B-D350-C"
},
"Publication": ""
}
16 changes: 16 additions & 0 deletions corpora/manually-annotated-corpora/dependency-bank-pt.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
{
"Name": "DependencyBankPT",
"URL": "https://hdl.handle.net/21.11129/0000-000B-D34C-2",
"Family": "Manually annotated corpora",
"Description": "This is a corpus of syntactic dependencies.\nThe sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.",
"Language": ["por"],
"Licence": "CC BY",
"Size": ["3,406 sentences", "44,598 tokens"],
"Annotation": ["grammatical structure"],
"Infrastructure": "CLARIN",
"Group": ["Syntactic parsing"],
"Access": {
"Download": "https://hdl.handle.net/21.11129/0000-000B-D34C-2"
},
"Publication": ""
}
16 changes: 16 additions & 0 deletions corpora/manually-annotated-corpora/logical-form-bank-pt.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
{
"Name": "LogicalFormBankPT",
"URL": "https://hdl.handle.net/21.11129/0000-000B-D34E-0",
"Family": "Manually annotated corpora",
"Description": "This is a corpus of sentences annotated with logical forms. The sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.",
"Language": ["por"],
"Licence": "CC BY",
"Size": ["3,406 sentences", "44,598 tokens"],
"Annotation": ["Semantic tags"],
"Infrastructure": "CLARIN",
"Group": ["Other annotation layers"],
"Access": {
"Download": "https://hdl.handle.net/21.11129/0000-000B-D34E-0"
},
"Publication": ""
}
16 changes: 16 additions & 0 deletions corpora/manually-annotated-corpora/manual-for-teaching.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
{
"Name": "Manually annotated corpora for teaching and learning purposes of Brazilian Portuguese, Dutch, Estonian, and Slovene",
"URL": "https://hdl.handle.net/21.11129/0000-0010-05DA-3 ",
"Family": "Manually annotated corpora",
"Description": "These are manually annotated corpora for teaching and learning purposes of Brazilian Portuguese, Dutch, Estonian, and Slovene.\nSentences are annotated with “problematic” or “non-problematic” labels, from the point of usage for pedagogical purposes.\nThe corpus is available from PORTULAN.",
"Language": ["est", "nld", "slv", "por"],
"Licence": "CC BY",
"Size": ["10,000 sentences"],
"Annotation": ["error tagging"],
"Infrastructure": "CLARIN",
"Group": ["Other annotation layers"],
"Access": {
"Download": "https://hdl.handle.net/21.11129/0000-0010-05DA-3"
},
"Publication": ""
}
16 changes: 16 additions & 0 deletions corpora/manually-annotated-corpora/propbankpt.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
{
"Name": "PropBankPT",
"URL": "https://hdl.handle.net/21.11129/0000-000B-D34B-3",
"Family": "Manually annotated corpora",
"Description": "This is a corpus of sentences annotated with their constituency structure and semantic role tags. The sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.",
"Language": ["por"],
"Licence": "CC BY",
"Size": ["3,406 sentences", "44,598 tokens"],
"Annotation": ["Syntactic parsing", "Semantic role tags"],
"Infrastructure": "CLARIN",
"Group": ["Syntactic parsing", "Other annotation layers"],
"Access": {
"Download": "https://hdl.handle.net/21.11129/0000-000B-D34B-3"
},
"Publication":""
}
16 changes: 16 additions & 0 deletions corpora/manually-annotated-corpora/treebank-pt.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
{
"Name": "TreeBankPT",
"URL": "https://hdl.handle.net/21.11129/0000-000B-D34B-3",
"Family": "Manually annotated corpora",
"Description": "This is a corpus of syntactic constituency trees. The sentences are translations from the Wall Street Journal.\nThe corpus is available from PORTULAN.",
"Language": ["por"],
"Licence": "CC BY",
"Size": ["3,406 sentences", "4,598 tokens"],
"Annotation": ["Syntactic parsing"],
"Infrastructure": "CLARIN",
"Group": ["Syntactic parsing"],
"Access": {
"Download": "https://hdl.handle.net/21.11129/0000-000B-D34B-3"
},
"Publication": ""
}

0 comments on commit 3595f90

Please sign in to comment.