feat: initial red teaming orchestrator setup #2

QDAP-Fred · 2024-04-29T06:13:58Z

wip - generated prompts
wip - add scenarios and handle db

This pull request primarily introduces a new red teaming functionality to the codebase. The most significant changes include the addition of new environment variables, the creation of a red teaming orchestrator in src/red_teaming_orchestrator.py, and the addition of new prompts in src/scenarios/prompts.json.

Environment Variables:

.env_example: Added DATABASE_NAME and MAX_CONVERSATION_TURN environment variables. These variables are used to set the database name and to limit the number of conversation turns respectively.

Red Teaming Orchestrator:

src/red_teaming_orchestrator.py: This new file contains the logic for the red teaming functionality. It reads prompts from a JSON file, sets up a red teaming orchestrator, and applies an attack strategy until a conversation objective is reached or the maximum number of turns is reached. It also sets up logging and starts a new thread for each prompt.

Prompts:

src/scenarios/prompts.json: Added new prompts for the red teaming functionality. These prompts are read by the red teaming orchestrator and used to guide the conversation.

wip - generated prompts wip - add scenarios and handle db

QDAP-Fred force-pushed the main branch from 78b4a1c to cf4c350 Compare April 29, 2024 06:23

feat: initial red teaming orchestrator setup

f93dd00

wip - generated prompts wip - add scenarios and handle db

QDAP-Fred force-pushed the main branch from cf4c350 to f93dd00 Compare April 29, 2024 06:33

keith-oak self-requested a review April 30, 2024 03:42

keith-oak approved these changes Apr 30, 2024

View reviewed changes

keith-oak merged commit 2aef012 into QDAP-DATAAI:main Apr 30, 2024
2 checks passed

QDAP-Fred added a commit that referenced this pull request May 4, 2024

feat: initial red teaming orchestrator setup (#2)

3c114fd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: initial red teaming orchestrator setup #2

feat: initial red teaming orchestrator setup #2

QDAP-Fred commented Apr 29, 2024

feat: initial red teaming orchestrator setup #2

feat: initial red teaming orchestrator setup #2

Conversation

QDAP-Fred commented Apr 29, 2024