Performed as part of COMS6998 Course@Columbia University
Samanantar : https://huggingface.co/datasets/ai4bharat/samanantar
IndicSentenceSummarization : https://huggingface.co/datasets/ai4bharat/IndicSentenceSummarization
Bloomz-560m : https://huggingface.co/bigscience/bloomz-560m
mBART-large : https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt
IndicBART-XXEN : https://huggingface.co/ai4bharat/IndicBART-XXEN
For each model the existing code base available at https://huggingface.co/ is recfactored to perform the following tasks:
- Machine Translation
- Summarization
Each file obtained as the output is further processed to find the mean values for the following metrics
- METEOR
- BLEU
- ROUGE
- BERTScore