Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: initial red teaming orchestrator setup #2

Merged
merged 1 commit into from
Apr 30, 2024

Conversation

QDAP-Fred
Copy link

wip - generated prompts
wip - add scenarios and handle db

This pull request primarily introduces a new red teaming functionality to the codebase. The most significant changes include the addition of new environment variables, the creation of a red teaming orchestrator in src/red_teaming_orchestrator.py, and the addition of new prompts in src/scenarios/prompts.json.

Environment Variables:

  • .env_example: Added DATABASE_NAME and MAX_CONVERSATION_TURN environment variables. These variables are used to set the database name and to limit the number of conversation turns respectively.

Red Teaming Orchestrator:

  • src/red_teaming_orchestrator.py: This new file contains the logic for the red teaming functionality. It reads prompts from a JSON file, sets up a red teaming orchestrator, and applies an attack strategy until a conversation objective is reached or the maximum number of turns is reached. It also sets up logging and starts a new thread for each prompt.

Prompts:

  • src/scenarios/prompts.json: Added new prompts for the red teaming functionality. These prompts are read by the red teaming orchestrator and used to guide the conversation.

wip - generated prompts
wip - add scenarios and handle db
@keith-oak keith-oak merged commit 2aef012 into QDAP-DATAAI:main Apr 30, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants