Skip to content
View mihirp1998's full-sized avatar
  • Pittsburgh

Highlights

  • Pro

Block or report mihirp1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. AlignProp AlignProp Public

    AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…

    Python 228 7

  2. VADER VADER Public

    Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

    Python 196 15

  3. Diffusion-TTA Diffusion-TTA Public

    Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.

    Python 49 4

  4. Slot-TTA Slot-TTA Public

    Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.

    Python 23 3

  5. Disentangling-3D-Prototypical-Nets Disentangling-3D-Prototypical-Nets Public

    We present neural architectures that disentangle RGB-D images into objects' shapes and styles and a map of the background scene, and explore their applications for few-shot 3D object detection and …

    Python 11

  6. huggingface/trl huggingface/trl Public

    Train transformer language models with reinforcement learning.

    Python 9.3k 1.2k