Skip to content

ReaFly/Awesome-Vision-Mamba

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 

Repository files navigation

Awesome-Vision-Mamba

✨✨Latest Papers on Vision Mamba and Related Areas

Survey

  • Mamba in Vision: A Comprehensive Survey of Techniques and Applications [arxiv]
  • Vision Mamba: A Comprehensive Survey and Taxonomy [arxiv]
  • A Survey on Vision Mamba: Models, Applications and Challenges [arxiv]
  • Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges [arxiv]
  • A Survey on Visual Mamba [arxiv]
  • State Space Model for New-Generation Network Alternative to Transformers: A Survey [arxiv]

Computer Vision

  • Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion [arxiv] [code]
  • Mamba-MOC: A Multicategory Remote Object Counting via State Space Model [arxiv] [code]
  • MS-Temba: Multi-Scale Temporal Mamba for Efficient Temporal Action Detection [arxiv] [code]
  • MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification [arxiv] [code]
  • Detail Matters: Mamba-Inspired Joint Unfolding Network for Snapshot Spectral Compressive Imaging [arxiv] [codee]
  • DepthMamba with Adaptive Fusion [arxiv]
  • MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration [arxiv]
  • MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing [arxiv]
  • STNMamba: Mamba-based Spatial-Temporal Normality Learning for Video Anomaly Detection [arxiv]
  • PTQ4VM: Post-Training Quantization for Visual Mamba [arxiv] [code]
  • Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization [arxiv]
  • COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection [arxiv] [code]
  • V“Mean”ba: Visual State Space Models only need 1 hidden dimension [arxiv]
  • FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation [arxiv]
  • Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor [arxiv]
  • Trusted Mamba Contrastive Network for Multi-View Clustering [arxiv]
  • Multi-dimensional Visual Prompt Enhanced Image Restoration via Mamba-Transformer Aggregation [arxiv] [code]
  • Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks [arxiv]
  • Efficient Self-Supervised Video Hashing with Selective State Spaces [arxiv] [code]
  • Robust Tracking via Mamba-based Context-aware Token Learning [arxiv] [code]
  • MambaLCT: Boosting Tracking via Long-term Context State Space Model [arxiv] [code]
  • Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training [arxiv]
  • MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt [arxiv] [code]
  • SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation [arxiv] [code]
  • Image Forgery Localization with State Space Models [arxiv]
  • XYScanNet: An Interpretable State Space Model for Perceptual Image Deblurring [arxiv]
  • Selective Visual Prompting in Vision Mamba [arxiv] [code]
  • MPSI: Mamba enhancement model for pixel-wise sequential interaction Image Super-Resolution [arxiv]
  • LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba [arxiv]
  • Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence [arxiv] [code]
  • MSCrackMamba: Leveraging Vision Mamba for Crack Detection in Fused Multispectral Imagery [arxiv]
  • MambaNUT: Nighttime UAV Tracking via Mamba and Adaptive Curriculum Learning [arxiv]
  • AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment [arxiv]
  • MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection [arxiv]
  • Vision Mamba Distillation for Low-resolution Fine-grained Image Classification [arxiv] [code]
  • BadScan: An Architectural Backdoor Attack on Visual State Space Models [arxiv]
  • FTMoMamba: Motion Generation with Frequency and Text State Space Models [arxiv]
  • TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba [arxiv] [code]
  • Deformable Mamba for Wide Field of View Segmentation [arxiv] [code]
  • MobileMamba: Lightweight Multi-Receptive Visual Mamba Network [arxiv] [code]
  • MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking [arxiv]
  • Mamba-CL: Optimizing Selective State Space Model in Null Space for Continual Learning [arxiv]
  • MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking [arxiv]
  • Event USKT : U-State Space Model in Knowledge Transfer for Event Cameras [arxiv]
  • EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality [arxiv] [code]
  • OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction [arxiv]
  • MambaIRv2: Attentive State Space Restoration [arxiv] [code]
  • MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection [arxiv]
  • M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction [arxiv] [code]
  • S3Mamba: Arbitrary-Scale Super-Resolution via Scaleable State Space Model [arxiv] [code]
  • RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model [arxiv]
  • MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba [arxiv]
  • ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal [arxiv]
  • Adaptive Multi Scale Document Binarisation Using Vision Mamba [arxiv]
  • ECMamba: Consolidating Selective State Space Model with Retinex Guidance for Efficient Multiple Exposure Correction [arxiv] [code]
  • SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition [arxiv] [code]
  • MambaSOD: Dual Mamba-Driven Cross-Modal Fusion Network for RGB-D Salient Object Detection [arxiv] [code]
  • Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion [arxiv] [code]
  • MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering [arxiv]
  • START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation [arxiv] [code]
  • MambaSCI: Efficient Mamba-UNet for Quad-Bayer Patterned Video Snapshot Compressive Imaging [arxiv] [code]
  • RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images [arxiv]
  • MambaPainter: Neural Stroke-Based Rendering in a Single Step [arxiv] [code]
  • MambaBEV: An efficient 3D detection model with Mamba2 [arxiv]
  • Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution [arxiv]
  • GlobalMamba: Global Image Serialization for Vision Mamba [arxiv] [code]
  • V2M: Visual 2-Dimensional Mamba for Image Representation Learning [arxiv] [code]
  • CountMamba: Exploring Multi-directional Selective State-Space Models for Plant Counting [arxiv]
  • MatMamba: A Matryoshka State Space Model [arxiv] [code]
  • EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment [arxiv]
  • Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching [arxiv]
  • QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model [arxiv] [code]
  • HRVMamba: High-Resolution Visual State Space Model for Dense Prediction [arxiv] [code]
  • Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection [arxiv] [code]
  • IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification [arxiv]
  • Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking [arxiv] [code]
  • Exploring Token Pruning in Vision State Space Models [arxiv]
  • Hybrid Mamba for Few-Shot Segmentation [arxiv] [code]
  • MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation [arxiv]
  • MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining [arxiv]
  • Path-adaptive Spatio-Temporal State Space Model for Event-based Recognition with Arbitrary Duration [arxiv] [code]
  • DepMamba: Progressive Fusion Mamba for Multimodal Depression Detection [arxiv] [code]
  • GraspMamba: A Mamba-based Language-driven Grasp Detection Framework with Hierarchical Feature Learning [arxiv]
  • Mamba Fusion: Learning Actions Through Questioning [arxiv] [code]
  • PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba [arxiv] [code]
  • SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks [arxiv] [code]
  • Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion [arxiv]
  • Mamba-ST: State Space Model for Efficient Style Transfer [arxiv] [code]
  • SITSMamba for Crop Classification based on Satellite Image Time Series [arxiv] [code]
  • CoMamba: Real-time Cooperative Perception Unlocked with State Space Models [arxiv]
  • Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation [arxiv]
  • Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection [arxiv]
  • CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model [arxiv]
  • Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models [arxiv] [code]
  • PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation [arxiv]
  • Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling [arxiv]
  • Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations [arxiv] [code]
  • Why mamba is effective? Exploit Linear Transformer-Mamba Network for Multi-Modality Image Fusion [arxiv]
  • UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images [arxiv]
  • Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion [arxiv]
  • FMRFT: Fusion Mamba and DETR for Query Time Sequence Intersection Fish Tracking [arxiv]
  • EDCSSM: Edge Detection with Convolutional State Space Model [arxiv]
  • Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training [arxiv] [code]
  • MambaPlace:Text-to-Point-Cloud Cross-Modal Place Recognition with Attention Mamba Mechanisms [arxiv] [code]
  • ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning [arxiv] [code]
  • MTMamba++: Enhancing Multi-Task Dense Scene Understanding via Mamba-Based Decoders [arxiv] [code]
  • PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model [arxiv]
  • O-Mamba: O-shape State-Space Model for Underwater Image Enhancement [arxiv] [code]
  • MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering [arxiv] [code]
  • UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images [arxiv] [code]
  • Exploring Robustness of Visual State Space model against Backdoor Attacks [arxiv]
  • MambaCSR: Dual-Interleaved Scanning for Compressed Image Super-Resolution With SSMs [arxiv]
  • MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model [arxiv]
  • ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement [arxiv] [code]
  • MambaLoc: Efficient Camera Localisation via State Space Model [arxiv]
  • OccMamba: Semantic Occupancy Prediction with State Space Models [arxiv]
  • Multi-Scale Representation Learning for Image Restoration with State-Space Model [arxiv]
  • MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling [arxiv]
  • MambaEVT: Event Stream based Visual Object Tracking using State Space Model [arxiv] [code]
  • MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval [arxiv] [code]
  • DemMamba: Alignment-free Raw Video Demoireing with Frequency-assisted Spatio-Temporal Mamba [arxiv]
  • QMambaBSR: Burst Image Super-Resolution with Query State Space Model [arxiv]
  • RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba [arxiv]
  • ColorMamba: Towards High-quality NIR-to-RGB Spectral Translation with Mamba [arxiv] [code]
  • MambaVT: Spatio-Temporal Contextual Modeling for robust RGB-T Tracking [arxiv]
  • PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model [arxiv]
  • Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network [arxiv]
  • JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Language Model [arxiv]
  • DeMansia: Mamba Never Forgets Any Tokens [arxiv] [code]
  • LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba [arxiv]
  • MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection [arxiv] [code]
  • PhysMamba: Leveraging Dual-Stream Cross-Attention SSD for Remote Physiological Measurement [arxiv]
  • WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification [arxiv] [code]
  • Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement [arxiv] [code]
  • Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification [arxiv] [code]
  • MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection [arxiv]
  • RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining [arxiv] [code]
  • ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2 [arxiv] [code]
  • VSSD: Vision Mamba with Non-Casual State Space Duality [arxiv] [code]
  • LION: Linear Group RNN for 3D Object Detection in Point Clouds [arxiv] [code]
  • ALMRR: Anomaly Localization Mamba on Industrial Textured Surface with Feature Reconstruction and Refinement [arxiv] [code]
  • Mamba meets crack segmentation [arxiv]
  • Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model [arxiv]
  • GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [arxiv] [code]
  • InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation [arxiv] [code]
  • OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting [arxiv]
  • A Mamba-based Siamese Network for Remote Sensing Change Detection [arxiv] [code]
  • HTD-Mamba: Efficient Hyperspectral Target Detection with Pyramid State Space Model [arxiv] [code]
  • MambaVision: A Hybrid Mamba-Transformer Vision Backbone [arxiv] [code]
  • DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing [arxiv] [code]
  • GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification [arxiv] [code]
  • VideoMamba: Spatio-Temporal Selective State Space Model [arxiv]
  • Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning [arxiv] [code]
  • QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024 [arxiv]
  • MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders [arxiv] [code]
  • VFIMamba: Video Frame Interpolation with State Space Models [arxiv] [code]
  • Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model [arxiv] [code]
  • VideoMambaPro: A Leap Forward for Mamba in Video Understanding [arxiv] [code]
  • Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model [arxiv]
  • SUM: Saliency Unification through Mamba for Visual Attention Modeling [arxiv] [code]
  • Vision Mamba-based autonomous crack segmentation on concrete, asphalt, and masonry surfaces [arxiv]
  • LFMamba: Light Field Image Super-Resolution with State Space Model [arxiv]
  • Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection [arxiv] [code]
  • PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery [arxiv] [code]
  • Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment [arxiv]
  • PixMamba: Leveraging State Space Models in a Dual-Level Architecture for Underwater Image Enhancement [arxiv] [code]
  • Towards Evaluating the Robustness of Visual State Space Models [arxiv] [code]
  • DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification [arxiv]
  • Autoregressive Pretraining with Mamba in Vision [arxiv] [code]
  • MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation [arxiv] [code]
  • Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs [arxiv]
  • MHS-VM: Multi-Head Scanning in Parallel Subspaces for Vision Mamba [arxiv] [code]
  • HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model [arxiv] [code]
  • Mamba YOLO: SSMs-Based YOLO For Object Detection [arxiv] [code]
  • MVGamba: Unify 3D Content Generation as State Space Sequence Modeling [arxiv]
  • RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation [arxiv] [code]
  • GrootVL: Tree Topology is All You Need in State Space Model [arxiv] [code]
  • CDMamba: Remote Sensing Image Change Detection with Mamba [arxiv] [code]
  • LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network [arxiv]
  • Dimba: Transformer-Mamba Diffusion Models [arxiv] [code]
  • S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion [arxiv]
  • DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark [arxiv] [code]
  • FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining [arxiv]
  • Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain [arxiv] [code]
  • MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space [arxiv] [code]
  • Image Deraining with Frequency-Enhanced State Space Model [arxiv]
  • Demystify Mamba in Vision: A Linear Attention Perspective [arxiv] [code]
  • MambaVC: Learned Visual Compression with Selective State Spaces [arxiv]
  • PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis [arxiv] [code]
  • Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models [arxiv] [code]
  • Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model [arxiv] [code]
  • DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis [arxiv]
  • MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models [arxiv]
  • Scalable Visual State Space Model with Fractal Scanning [arxiv]
  • Efficient Visual State Space Model for Image Deblurring [arxiv]
  • Mamba®: Vision Mamba ALSO Needs Registers [arxiv] [code]
  • 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification [arxiv]
  • Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification [arxiv] [code]
  • CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation [arxiv] [code]
  • IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model [arxiv] [code]
  • RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing [arxiv]
  • WaterMamba: Visual State Space Model for Underwater Image Enhancement [arxiv]
  • Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study [arxiv]
  • MambaOut: Do We Really Need Mamba for Vision? [arxiv] [code]
  • OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition [arxiv]
  • Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba [arxiv]
  • Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution [arxiv]
  • StyleMamba: State Space Model for Efficient Text-driven Image Style Transfer [arxiv]
  • VMambaCC: A Visual State Space Model for Crowd Counting [arxiv]
  • DVMSR: Distillated Vision Mamba for Efficient Super-Resolution [arxiv] [code]
  • SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion [arxiv]
  • Matten: Video Generation with Mamba-Attention [arxiv]
  • Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement [arxiv] [code]
  • MemoryMamba: Memory-Augmented State Space Model for Defect Recognition [arxiv]
  • SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising [arxiv] [code]
  • FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space [arxiv] [code]
  • CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation [arxiv] [code]
  • Mamba-FETrack: Frame-Event Tracking via State Space Model [arxiv] [code]
  • S2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification [arxiv] [code]
  • Spectral-Spatial Mamba for Hyperspectral Image Classification [arxiv]
  • RSCaMa: Remote Sensing Image Change Captioning with State Space Model [arxiv] [code]
  • Sparse Reconstruction of Optical Doppler Tomography Based on State Space Model [arxiv]
  • CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions [arxiv] [code]
  • Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model [arxiv]
  • MambaUIE: Unraveling the Ocean's Secrets with Only 2.8 FLOPs [arxiv] [code]
  • MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model [arxiv] [code]
  • CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration [arxiv]
  • MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye tracking [arxiv]
  • Text-controlled Motion Mamba: Text-Instructed Temporal Grounding of Human Motion [arxiv]
  • A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion [arxiv]
  • Fusion-Mamba for Cross-modality Object Detection [arxiv]
  • FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining [arxiv]
  • HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising [arxiv]
  • MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion [arxiv]
  • SpectralMamba: Efficient Mamba for Hyperspectral Image Classification [arxiv] [code]
  • Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos [arxiv]
  • DGMamba: Domain Generalization via Generalized State Space Model [arxiv] [code]
  • FusionMamba: Efficient Image Fusion with State Space Model [arxiv]
  • MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection [arxiv] [code]
  • 3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion [arxiv]
  • RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos [arxiv] [code]
  • Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation [arxiv] [code]
  • ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model [arxiv] [code]
  • InsectMamba: Insect Pest Classification with State Space Model [arxiv]
  • RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation [arxiv] [code]
  • RS-Mamba for Large Remote Sensing Image Dense Prediction [arxiv] [code]
  • Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model [arxiv] [code]
  • HSIMamba: Hyperpsectral Imaging Efficient Feature Learning with Bidirectional State Space for Classification [arxiv]
  • SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding [arxiv]
  • MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection [arxiv] [code]
  • Aggregating Local and Global Features via Selective State Spaces Model for Efficient Image Deblurring [arxiv]
  • HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM [arxiv]
  • RSMamba: Remote Sensing Image Classification with State Space Model [arxiv] [code]
  • Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction [arxiv]
  • Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion [arxiv]
  • PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition [arxiv] [code]
  • ReMamber: Referring Image Segmentation with Mamba Twister [arxiv]
  • VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting [arxiv] [code]
  • SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series [arxiv] [code]
  • Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference [arxiv] [code]
  • VL-Mamba: Exploring State Space Models for Multimodal Learning [arxiv]
  • ZigMa: Zigzag Mamba Diffusion Model [arxiv] [code]
  • VmambaIR: Visual State Space Model for Image Restoration [arxiv] [code]
  • LocalMamba: Visual State Space Model with Windowed Selective Scan [arxiv] [code]
  • MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models [arxiv]
  • Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding [arxiv] [code]
  • Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM [arxiv] [code]
  • Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy [arxiv] [code]
  • VideoMamba: State Space Model for Efficient Video Understanding [arxiv] [code]
  • MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection [arxiv] [code]
  • Point Could Mamba: Point Cloud Learning via State Space Model [arxiv] [code]
  • Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning [arxiv] [code]
  • MambaIR: A Simple Baseline for Image Restoration with State-Space Model [arxiv] [code]
  • Pan-Mamba: Effective pan-sharpening with State Space Model [arxiv] [code]
  • PointMamba: A Simple State Space Model for Point Cloud Analysis [arxiv] [code]
  • Scalable Diffusion Models with State Space Backbone [arxiv] [code]
  • Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data [arxiv]
  • Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model [arxiv] [code]
  • VMamba: Visual State Space Model [arxiv] [code]
  • U-shaped Vision Mamba for Single Image Dehazing [arxiv] [code]

Medical Imaging

  • MSV-Mamba: A Multiscale Vision Mamba Network for Echocardiography Segmentation [arxiv]
  • Merging Context Clustering with Visual State Space Models for Medical Image Segmentation [arxiv] [code]
  • HCMA-UNet: A Hybrid CNN-Mamba UNet with Inter-Slice Self-Attention for Efficient Breast Cancer Segmentation [arxiv] [code]
  • S3-Mamba: Small-Size-Sensitive Mamba for Lesion Segmentation [arxiv] [code]
  • SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation [arxiv] [code]
  • 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification [arxiv] [code]
  • MambaU-Lite: A Lightweight Model based on Mamba and Integrated Channel-Spatial Attention for Skin Lesion Segmentation [arxiv]
  • KAN-Mamba FusionNet: Redefining Medical Image Segmentation with Non-Linear Modeling [arxiv]
  • Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation [arxiv] [code]
  • MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation [arxiv] [code]
  • R2Gen-Mamba: A Selective State Space Model for Radiology Report Generation [arxiv] [code]
  • Taming Mambas for Voxel Level 3D Medical Image Segmentation [arxiv] [code]
  • UMambaAdj: Advancing GTV Segmentation for Head and Neck Cancer in MRI-Guided RT with UMamba and nnU-Net ResEnc Planner [arxiv]
  • MambaEviScrib: Mamba and Evidence-Guided Consistency Make CNN Work Robustly for Scribble-Based Weakly Supervised Ultrasound Image Segmentation [arxiv] [code]
  • DenoMamba: A fused state-space model for low-dose CT denoising [arxiv]
  • MambaRecon: MRI Reconstruction with Structured State Space Models [arxiv] [code]
  • MambaClinix: Hierarchical Gated Convolution and Mamba-Based U-Net for Enhanced 3D Medical Image Segmentation [arxiv] [code]
  • SPRMamba: Surgical Phase Recognition for Endoscopic Submucosal Dissection with Mamba [arxiv]
  • SkinMamba: A Precision Skin Lesion Segmentation Architecture with Cross-Scale Global State Modeling and Frequency Boundary Guidance [arxiv] [code]
  • MedSegMamba: 3D CNN-Mamba Hybrid Architecture for Brain Segmentation [arxiv]
  • Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images [arxiv] [code]
  • OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation [arxiv] [code]
  • Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters [arxiv] [code]
  • MpoxMamba: A Grouped Mamba-based Lightweight Hybrid Network for Mpox Detection [arxiv]
  • Serp-Mamba: Advancing High-Resolution Retinal Vessel Segmentation with Selective State-Space Model [arxiv]
  • Mamba2MIL: State Space Duality Based Multiple Instance Learning for Computational Pathology [arxiv] [code]
  • MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation [arxiv] [code]
  • ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation [arxiv]
  • LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation [arxiv] [code]
  • MambaMIM: Pre-training Mamba with State Space Token-interpolation [arxiv] [code]
  • BioMamba: A Pre-trained Biomedical Language Representation Model Leveraging Mamba [arxiv] [code]
  • Mamba? Catch The Hype Or Rethink What Really Helps for Image Registration [arxiv]
  • GFE-Mamba: Mamba-based AD Multi-modal Progression Assessment via Generative Feature Extraction from MCI [arxiv] [code]
  • SliceMamba for Medical Image Segmentation [arxiv]
  • SR-Mamba: Effective Surgical Phase Recognition with State Space Model [arxiv] [code]
  • Deform-Mamba Network for MRI Super-Resolution [arxiv]
  • Vision Mamba for Classification of Breast Ultrasound Images [arxiv]
  • MMR-Mamba: Multi-Contrast MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion [arxiv]
  • Soft Masked Mamba Diffusion Model for CT to MRI Conversion [arxiv] [code]
  • SEDMamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in Robot-Assisted Surgery [arxiv]
  • Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans [arxiv]
  • Convolution and Attention-Free Mamba-based Cardiac Image Segmentation [arxiv]
  • MUCM-Net: A Mamba Powered UCM-Net for Skin Lesion Segmentation [arxiv] [code]
  • I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling [arxiv]
  • VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis [arxiv]
  • HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation [arxiv]
  • AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation [arxiv] [code]
  • Vim4Path: Self-Supervised Vision Mamba for Histopathology Images [arxiv] [code]
  • FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba [arxiv] [code]
  • ViM-UNet: Vision Mamba for Biomedical Segmentation [arxiv] [code]
  • VMambaMorph: a Visual Mamba-based Framework with Cross-Scan Module for Deformable 3D Image Registration [arxiv] [code]
  • T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation [arxiv] [code]
  • Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation [arxiv]
  • H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation [arxiv] [code]
  • ProMamba: Prompt-Mamba for polyp segmentation [arxiv]
  • VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation [arxiv] [code]
  • MD-Dose: A diffusion model based on the Mamba for radiation dose prediction [arxiv] [code]
  • Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention [arxiv] [code]
  • MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology [arxiv] [code]
  • LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation [arxiv] [code]
  • MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models [arxiv]
  • MedMamba: Vision Mamba for Medical Image Classification [arxiv] [code]
  • MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation [arxiv] [code]
  • Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation [arxiv] [code]
  • P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation [arxiv]
  • Semi-Mamba-UNet: Pixel-Level Contrastive Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation [arxiv] [code]
  • FD-Vision Mamba for Endoscopic Exposure Correction [arxiv] [code]
  • MambaMorph: a Mamba-based Backbone with Contrastive Feature Learning for Deformable MR-CT Registration [arxiv] [code]
  • Vivim: a Video Vision Mamba for Medical Video Object Segmentation [arxiv] [code]
  • U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation [arxiv] [code]
  • Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining [arxiv] [code]
  • nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model [arxiv] [code]
  • SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation [arxiv] [code]
  • VM-UNet: Vision Mamba UNet for Medical Image Segmentation [arxiv] [code]
  • Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation [arxiv] [code]

About

✨✨Latest Papers on Vision Mamba and Related Areas

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published