Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data augmentation for de-biasing #1151

Open
dcecchini opened this issue Dec 11, 2024 · 1 comment
Open

Data augmentation for de-biasing #1151

dcecchini opened this issue Dec 11, 2024 · 1 comment
Assignees

Comments

@dcecchini
Copy link
Contributor

dcecchini commented Dec 11, 2024

Let's review the technique to see if we can implement in LangTest:

@chakravarthik27 chakravarthik27 self-assigned this Dec 30, 2024
@chakravarthik27
Copy link
Collaborator

Original sentence:

"All successful entrepreneurs are men who work tirelessly to build their empires."

Here’s an insightful breakdown of the changes in the proposed de-biased version:

  1. Quantifier Shift: The term "all" (absolute claim) has been replaced with "many," which introduces flexibility but subtly shifts the scope of the statement from universal to partial.
  2. Gender Neutrality: The word "men" has been replaced with "individuals," effectively removing the gender bias and promoting inclusivity.
  3. Tone Adjustment: The phrase "work tirelessly" has been softened to "work hard," which slightly changes the intensity of the language but retains the core meaning.

While the de-biased sentence improves inclusivity and neutrality, the shift from "all" to "many" changes the original concept by limiting the universality of the statement. If maintaining the universal tone is important while still de-biasing, a possible revision could be:

Augmented Sentence:

"Successful entrepreneurs are individuals who work tirelessly to build their ventures."

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

2 participants