Extractive Nepali Question Answering System

There is noticeable gap in language processing tools and resources for Nepali, a language spoken by millions yet significantly underrepresented in the field of computational linguistics. Understanding this gap, we have developed a web application and a browser extension that lets you ask questions in Nepali and get answers extracted from your documents or information on the web. Whether it’s students looking up facts for school or professionals searching for news and updates, our project aims to create easier access to information, thereby empowering Nepali speakers to thrive in the digital age.

Here's our Demo Video.

Methodology & Architecture

Recognizing the scarcity of dedicated Nepali datasets, the existing dataset is utilized by translating them to Nepali. Traditional translation methods often fail to maintain the integrity of answer spans, so we employ translation-invariant tokens to preserve answer spans across different languages, enhancing the fidelity of the translated data. To further boost the model's performance, we translate data into multiple Indo-Aryan languages, leveraging their similar linguistic structures. We study the quality of the dataset through qualitative analysis via human evaluation and quantitative assessment using LLM.

We fine-tune MURIL to accurately predict the start and end tokens of answers given a passage and a question. Longer passages are segmented into blocks that fit the model’s context length. We then use the probability of tokens to compare and select the best answers from multiple blocks.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
extension		extension
frontend		frontend
server		server
translation		translation
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Extractive Nepali Question Answering System

Here's our Demo Video.

Methodology & Architecture

Training Pipeline

Example Result

About

Releases

Packages

Contributors 2

Languages

Yunika-Bajracharya/Extractive-Nepali-QA

Folders and files

Latest commit

History

Repository files navigation

Extractive Nepali Question Answering System

Here's our Demo Video.

Methodology & Architecture

Training Pipeline

Example Result

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages