attn

Here are 4 public repositories matching this topic...

kyegomez / MultiModalCrossAttn

The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"

artificial-intelligence attention attention-mechanism attention-is-all-you-need multimodal multimodal-deep-learning gpt4 attn

Updated Mar 11, 2024
Python

zer0int / ComfyUI-CLIP-Flux-Layer-Shuffle

Star

Comfy Nodes (and a CLI script) for shuffling around layers in transformer models, creating a curious confusion.

flux research experimental layer attention mlp shuffle attn flux1

Updated Oct 14, 2024
Python

zer0int / ComfyUI-GPT2-Layer-Shuffle-Prompting

Star

Shuffle GPT-2's layers and have it prompt an image. Node works with any model - Flux, SD3, SDXL...

flux research experimental layer attention mlp shuffle gpt-2 gpt2 sd3 llm prompting comfyui sdxl attn flux1

Updated Oct 13, 2024
Python

Agora-Lab-AI / HydraNet

Star

HydraNet is a state-of-the-art transformer architecture that combines Multi-Query Attention (MQA), Mixture of Experts (MoE), and continuous learning capabilities.

transformers moe attention agora attn liquid-models lfms agoralabs

Updated Dec 23, 2024
Shell

Improve this page

Add a description, image, and links to the attn topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the attn topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attn

Here are 4 public repositories matching this topic...

kyegomez / MultiModalCrossAttn

zer0int / ComfyUI-CLIP-Flux-Layer-Shuffle

zer0int / ComfyUI-GPT2-Layer-Shuffle-Prompting

Agora-Lab-AI / HydraNet

Improve this page

Add this topic to your repo