Optimizing Sukoon's Evaluation Pipeline with Deep Learning Models #8
Replies: 3 comments 1 reply
-
Current Approach We are currently using an analysis script that makes API calls with the following prompt:
Example Discussion Points: I invite everyone to share their insights, experiences, and suggestions on the following: Technical Approaches: Any ideas on model architectures, training methods, or pipelines that could work for us. |
Beta Was this translation helpful? Give feedback.
-
I found some relevant kaggle dataset for empathy judgement https://www.kaggle.com/datasets/atharvjairath/empathetic-dialogues-facebook-ai/data But the problem is, this is trained on 1-2 liner reponses, which also seems just. The bigger the response the harder it will be for our model to tell the empathy score. Also summarising the bots output will make it lose its context and the pattern in the text thus hampering our task again |
Beta Was this translation helpful? Give feedback.
-
https://www.canva.com/design/DAGSI3wDVfY/VKBQYBWo1_vERGVXqpuxXA/edit?utm_content=DAGSI3wDVfY&utm_campaign=designshare&utm_medium=link2&utm_source=sharebutton |
Beta Was this translation helpful? Give feedback.
-
Hi Devs,
We are working on enhancing the evaluation pipeline for Sukoon, our mental health chatbot. Currently, we use an analysis script that makes API calls with a specific prompt to evaluate conversations and generate JSON-formatted analyses. We want to try out some deep learning models for the same.
Objective:
Automate and Scale: Reduce reliance on external API calls to improve scalability.
Enhance Analysis: Utilize deep learning models for more nuanced and accurate evaluations.
Seamless Integration: Incorporate these capabilities directly into our pipeline.
Current Approach: Our script analyzes user messages and Sukoon's responses to produce a JSON output that includes:
Implementing Structured Output:
How to map model predictions directly to our required JSON format.
Best practices for generating structured data from model outputs.
Evaluation Metrics:
Appropriate metrics to assess model performance in this context.
Methods to validate the accuracy and helpfulness of the analyses.
Your Expertise Matters!
Your insights and suggestions will be invaluable in helping us improve Sukoon's evaluation pipeline. Any resources, experiences, or ideas you can share would be greatly appreciated.
Thank you and let's work together!
Beta Was this translation helpful? Give feedback.
All reactions