Skip to content
Change the repository type filter

All

    Repositories list

    • LINDAT/CLARIN digital repository based on DSpace
      Java
      BSD 3-Clause "New" or "Revised" License
      1.3k0120Updated Nov 12, 2024Nov 12, 2024
    • Repo for tracking resources for the Mezzanine project
      0000Updated Nov 12, 2024Nov 12, 2024
    • An ever-expanding overview of the knowledge on large language models (LLMs), speech technologies, and other NLP technologies for Slovenian language.
      0000Updated Oct 30, 2024Oct 30, 2024
    • STARK

      Public
      Python
      Apache License 2.0
      1401Updated Oct 30, 2024Oct 30, 2024
    • Repository for SloBench evaluation docker images
      Perl
      3100Updated Oct 29, 2024Oct 29, 2024
    • mte-msd

      Public
      MULTEXT-East morphosyntactic specifications
      HTML
      11000Updated Sep 23, 2024Sep 23, 2024
    • Code for ParlaSent research note
      Jupyter Notebook
      GNU General Public License v3.0
      1001Updated Sep 21, 2024Sep 21, 2024
    • Editor for normalising learner texts (error annotation and tagging.)
      TypeScript
      MIT License
      3000Updated Sep 4, 2024Sep 4, 2024
    • Tool for extracting linguistic features with highest (known) variation among the HBS standards
      Python
      0000Updated Jul 17, 2024Jul 17, 2024
    • A two-mode (standard, nonstandard) tokeniser for South Slavic languages
      Python
      Apache License 2.0
      7521Updated Jul 9, 2024Jul 9, 2024
    • rsdo_gos

      Public
      Software for the GOS corpus of spoken Slovenian
      C#
      0000Updated May 9, 2024May 9, 2024
    • Data for the DIALECT-COPA unshared task of dialectal causal common-sense reasoning
      0200Updated Apr 23, 2024Apr 23, 2024
    • classla

      Public
      CLASSLA Fork of the Official Stanford NLP Python Library for Many Human Languages
      Python
      Other
      8933821Updated Apr 10, 2024Apr 10, 2024
    • Python
      Apache License 2.0
      0000Updated Apr 3, 2024Apr 3, 2024
    • drevesnik

      Public
      Web portal for searching and displaying syntacically annotated corpora
      JavaScript
      0100Updated Mar 1, 2024Mar 1, 2024
    • Python
      0100Updated Feb 29, 2024Feb 29, 2024
    • A converter that converts Slovene words to their IPA and/or SAMPA transcriptions.
      Python
      Apache License 2.0
      2001Updated Jan 30, 2024Jan 30, 2024
    • Slovene text normalization tool
      Python
      Apache License 2.0
      2101Updated Jan 24, 2024Jan 24, 2024
    • benchich

      Public
      BENCHić - the benchmark for Bosnian, Croatian, Montenegrin, Serbian (and friends)
      Python
      0210Updated Jan 10, 2024Jan 10, 2024
    • cordex

      Public
      Python
      MIT License
      0110Updated Dec 15, 2023Dec 15, 2023
    • Benchmarking NLP tools on Slovene, Croatian and Serbian
      Python
      3710Updated Dec 7, 2023Dec 7, 2023
    • Python
      0000Updated Nov 24, 2023Nov 24, 2023
    • TypeScript
      Apache License 2.0
      0000Updated Nov 22, 2023Nov 22, 2023
    • Python
      Apache License 2.0
      0000Updated Nov 22, 2023Nov 22, 2023
    • SemSex

      Public
      HTML
      1000Updated Nov 12, 2023Nov 12, 2023
    • Python
      MIT License
      0000Updated Sep 29, 2023Sep 29, 2023
    • Apache License 2.0
      0000Updated Sep 12, 2023Sep 12, 2023
    • Neural Machine Translation tool
      Python
      Apache License 2.0
      1301Updated Sep 6, 2023Sep 6, 2023
    • Apache License 2.0
      0000Updated Aug 7, 2023Aug 7, 2023
    • Automatic Speech Recognition tool
      Python
      Apache License 2.0
      21621Updated Aug 5, 2023Aug 5, 2023