Awesome-Vision-Mamba

✨✨Latest Papers on Vision Mamba and Related Areas

Survey

Mamba in Vision: A Comprehensive Survey of Techniques and Applications [arxiv]
Vision Mamba: A Comprehensive Survey and Taxonomy [arxiv]
A Survey on Vision Mamba: Models, Applications and Challenges [arxiv]
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges [arxiv]
A Survey on Visual Mamba [arxiv]
State Space Model for New-Generation Network Alternative to Transformers: A Survey [arxiv]

Computer Vision

Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion [arxiv] [code]
Mamba-MOC: A Multicategory Remote Object Counting via State Space Model [arxiv] [code]
MS-Temba: Multi-Scale Temporal Mamba for Efficient Temporal Action Detection [arxiv] [code]
MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification [arxiv] [code]
Detail Matters: Mamba-Inspired Joint Unfolding Network for Snapshot Spectral Compressive Imaging [arxiv] [codee]
DepthMamba with Adaptive Fusion [arxiv]
MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration [arxiv]
MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing [arxiv]
STNMamba: Mamba-based Spatial-Temporal Normality Learning for Video Anomaly Detection [arxiv]
PTQ4VM: Post-Training Quantization for Visual Mamba [arxiv] [code]
Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization [arxiv]
COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection [arxiv] [code]
V“Mean”ba: Visual State Space Models only need 1 hidden dimension [arxiv]
FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation [arxiv]
Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor [arxiv]
Trusted Mamba Contrastive Network for Multi-View Clustering [arxiv]
Multi-dimensional Visual Prompt Enhanced Image Restoration via Mamba-Transformer Aggregation [arxiv] [code]
Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks [arxiv]
Efficient Self-Supervised Video Hashing with Selective State Spaces [arxiv] [code]
Robust Tracking via Mamba-based Context-aware Token Learning [arxiv] [code]
MambaLCT: Boosting Tracking via Long-term Context State Space Model [arxiv] [code]
Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training [arxiv]
MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt [arxiv] [code]
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation [arxiv] [code]
Image Forgery Localization with State Space Models [arxiv]
XYScanNet: An Interpretable State Space Model for Perceptual Image Deblurring [arxiv]
Selective Visual Prompting in Vision Mamba [arxiv] [code]
MPSI: Mamba enhancement model for pixel-wise sequential interaction Image Super-Resolution [arxiv]
LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba [arxiv]
Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence [arxiv] [code]
MSCrackMamba: Leveraging Vision Mamba for Crack Detection in Fused Multispectral Imagery [arxiv]
MambaNUT: Nighttime UAV Tracking via Mamba and Adaptive Curriculum Learning [arxiv]
AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment [arxiv]
MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection [arxiv]
Vision Mamba Distillation for Low-resolution Fine-grained Image Classification [arxiv] [code]
BadScan: An Architectural Backdoor Attack on Visual State Space Models [arxiv]
FTMoMamba: Motion Generation with Frequency and Text State Space Models [arxiv]
TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba [arxiv] [code]
Deformable Mamba for Wide Field of View Segmentation [arxiv] [code]
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network [arxiv] [code]
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking [arxiv]
Mamba-CL: Optimizing Selective State Space Model in Null Space for Continual Learning [arxiv]
MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking [arxiv]
Event USKT : U-State Space Model in Knowledge Transfer for Event Cameras [arxiv]
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality [arxiv] [code]
OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction [arxiv]
MambaIRv2: Attentive State Space Restoration [arxiv] [code]
MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection [arxiv]
M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction [arxiv] [code]
S3Mamba: Arbitrary-Scale Super-Resolution via Scaleable State Space Model [arxiv] [code]
RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model [arxiv]
MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba [arxiv]
ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal [arxiv]
Adaptive Multi Scale Document Binarisation Using Vision Mamba [arxiv]
ECMamba: Consolidating Selective State Space Model with Retinex Guidance for Efficient Multiple Exposure Correction [arxiv] [code]
SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition [arxiv] [code]
MambaSOD: Dual Mamba-Driven Cross-Modal Fusion Network for RGB-D Salient Object Detection [arxiv] [code]
Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion [arxiv] [code]
MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering [arxiv]
START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation [arxiv] [code]
MambaSCI: Efficient Mamba-UNet for Quad-Bayer Patterned Video Snapshot Compressive Imaging [arxiv] [code]
RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images [arxiv]
MambaPainter: Neural Stroke-Based Rendering in a Single Step [arxiv] [code]
MambaBEV: An efficient 3D detection model with Mamba2 [arxiv]
Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution [arxiv]
GlobalMamba: Global Image Serialization for Vision Mamba [arxiv] [code]
V2M: Visual 2-Dimensional Mamba for Image Representation Learning [arxiv] [code]
CountMamba: Exploring Multi-directional Selective State-Space Models for Plant Counting [arxiv]
MatMamba: A Matryoshka State Space Model [arxiv] [code]
EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment [arxiv]
Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching [arxiv]
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model [arxiv] [code]
HRVMamba: High-Resolution Visual State Space Model for Dense Prediction [arxiv] [code]
Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection [arxiv] [code]
IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification [arxiv]
Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking [arxiv] [code]
Exploring Token Pruning in Vision State Space Models [arxiv]
Hybrid Mamba for Few-Shot Segmentation [arxiv] [code]
MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation [arxiv]
MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining [arxiv]
Path-adaptive Spatio-Temporal State Space Model for Event-based Recognition with Arbitrary Duration [arxiv] [code]
DepMamba: Progressive Fusion Mamba for Multimodal Depression Detection [arxiv] [code]
GraspMamba: A Mamba-based Language-driven Grasp Detection Framework with Hierarchical Feature Learning [arxiv]
Mamba Fusion: Learning Actions Through Questioning [arxiv] [code]
PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba [arxiv] [code]
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks [arxiv] [code]
Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion [arxiv]
Mamba-ST: State Space Model for Efficient Style Transfer [arxiv] [code]
SITSMamba for Crop Classification based on Satellite Image Time Series [arxiv] [code]
CoMamba: Real-time Cooperative Perception Unlocked with State Space Models [arxiv]
Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation [arxiv]
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection [arxiv]
CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model [arxiv]
Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models [arxiv] [code]
PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation [arxiv]
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling [arxiv]
Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations [arxiv] [code]
Why mamba is effective? Exploit Linear Transformer-Mamba Network for Multi-Modality Image Fusion [arxiv]
UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images [arxiv]
Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion [arxiv]
FMRFT: Fusion Mamba and DETR for Query Time Sequence Intersection Fish Tracking [arxiv]
EDCSSM: Edge Detection with Convolutional State Space Model [arxiv]
Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training [arxiv] [code]
MambaPlace:Text-to-Point-Cloud Cross-Modal Place Recognition with Attention Mamba Mechanisms [arxiv] [code]
ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning [arxiv] [code]
MTMamba++: Enhancing Multi-Task Dense Scene Understanding via Mamba-Based Decoders [arxiv] [code]
PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model [arxiv]
O-Mamba: O-shape State-Space Model for Underwater Image Enhancement [arxiv] [code]
MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering [arxiv] [code]
UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images [arxiv] [code]
Exploring Robustness of Visual State Space model against Backdoor Attacks [arxiv]
MambaCSR: Dual-Interleaved Scanning for Compressed Image Super-Resolution With SSMs [arxiv]
MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model [arxiv]
ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement [arxiv] [code]
MambaLoc: Efficient Camera Localisation via State Space Model [arxiv]
OccMamba: Semantic Occupancy Prediction with State Space Models [arxiv]
Multi-Scale Representation Learning for Image Restoration with State-Space Model [arxiv]
MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling [arxiv]
MambaEVT: Event Stream based Visual Object Tracking using State Space Model [arxiv] [code]
MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval [arxiv] [code]
DemMamba: Alignment-free Raw Video Demoireing with Frequency-assisted Spatio-Temporal Mamba [arxiv]
QMambaBSR: Burst Image Super-Resolution with Query State Space Model [arxiv]
RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba [arxiv]
ColorMamba: Towards High-quality NIR-to-RGB Spectral Translation with Mamba [arxiv] [code]
MambaVT: Spatio-Temporal Contextual Modeling for robust RGB-T Tracking [arxiv]
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model [arxiv]
Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network [arxiv]
JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Language Model [arxiv]
DeMansia: Mamba Never Forgets Any Tokens [arxiv] [code]
LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba [arxiv]
MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection [arxiv] [code]
PhysMamba: Leveraging Dual-Stream Cross-Attention SSD for Remote Physiological Measurement [arxiv]
WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification [arxiv] [code]
Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement [arxiv] [code]
Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification [arxiv] [code]
MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection [arxiv]
RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining [arxiv] [code]
ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2 [arxiv] [code]
VSSD: Vision Mamba with Non-Casual State Space Duality [arxiv] [code]
LION: Linear Group RNN for 3D Object Detection in Point Clouds [arxiv] [code]
ALMRR: Anomaly Localization Mamba on Industrial Textured Surface with Feature Reconstruction and Refinement [arxiv] [code]
Mamba meets crack segmentation [arxiv]
Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model [arxiv]
GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [arxiv] [code]
InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation [arxiv] [code]
OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting [arxiv]
A Mamba-based Siamese Network for Remote Sensing Change Detection [arxiv] [code]
HTD-Mamba: Efficient Hyperspectral Target Detection with Pyramid State Space Model [arxiv] [code]
MambaVision: A Hybrid Mamba-Transformer Vision Backbone [arxiv] [code]
DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing [arxiv] [code]
GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification [arxiv] [code]
VideoMamba: Spatio-Temporal Selective State Space Model [arxiv]
Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning [arxiv] [code]
QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024 [arxiv]
MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders [arxiv] [code]
VFIMamba: Video Frame Interpolation with State Space Models [arxiv] [code]
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model [arxiv] [code]
VideoMambaPro: A Leap Forward for Mamba in Video Understanding [arxiv] [code]
Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model [arxiv]
SUM: Saliency Unification through Mamba for Visual Attention Modeling [arxiv] [code]
Vision Mamba-based autonomous crack segmentation on concrete, asphalt, and masonry surfaces [arxiv]
LFMamba: Light Field Image Super-Resolution with State Space Model [arxiv]
Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection [arxiv] [code]
PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery [arxiv] [code]
Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment [arxiv]
PixMamba: Leveraging State Space Models in a Dual-Level Architecture for Underwater Image Enhancement [arxiv] [code]
Towards Evaluating the Robustness of Visual State Space Models [arxiv] [code]
DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification [arxiv]
Autoregressive Pretraining with Mamba in Vision [arxiv] [code]
MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation [arxiv] [code]
Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs [arxiv]
MHS-VM: Multi-Head Scanning in Parallel Subspaces for Vision Mamba [arxiv] [code]
HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model [arxiv] [code]
Mamba YOLO: SSMs-Based YOLO For Object Detection [arxiv] [code]
MVGamba: Unify 3D Content Generation as State Space Sequence Modeling [arxiv]
RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation [arxiv] [code]
GrootVL: Tree Topology is All You Need in State Space Model [arxiv] [code]
CDMamba: Remote Sensing Image Change Detection with Mamba [arxiv] [code]
LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network [arxiv]
Dimba: Transformer-Mamba Diffusion Models [arxiv] [code]
S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion [arxiv]
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark [arxiv] [code]
FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining [arxiv]
Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain [arxiv] [code]
MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space [arxiv] [code]
Image Deraining with Frequency-Enhanced State Space Model [arxiv]
Demystify Mamba in Vision: A Linear Attention Perspective [arxiv] [code]
MambaVC: Learned Visual Compression with Selective State Spaces [arxiv]
PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis [arxiv] [code]
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models [arxiv] [code]
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model [arxiv] [code]
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis [arxiv]
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models [arxiv]
Scalable Visual State Space Model with Fractal Scanning [arxiv]
Efficient Visual State Space Model for Image Deblurring [arxiv]
Mamba®: Vision Mamba ALSO Needs Registers [arxiv] [code]
3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification [arxiv]
Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification [arxiv] [code]
CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation [arxiv] [code]
IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model [arxiv] [code]
RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing [arxiv]
WaterMamba: Visual State Space Model for Underwater Image Enhancement [arxiv]
Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study [arxiv]
MambaOut: Do We Really Need Mamba for Vision? [arxiv] [code]
OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition [arxiv]
Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba [arxiv]
Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution [arxiv]
StyleMamba: State Space Model for Efficient Text-driven Image Style Transfer [arxiv]
VMambaCC: A Visual State Space Model for Crowd Counting [arxiv]
DVMSR: Distillated Vision Mamba for Efficient Super-Resolution [arxiv] [code]
SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion [arxiv]
Matten: Video Generation with Mamba-Attention [arxiv]
Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement [arxiv] [code]
MemoryMamba: Memory-Augmented State Space Model for Defect Recognition [arxiv]
SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising [arxiv] [code]
FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space [arxiv] [code]
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation [arxiv] [code]
Mamba-FETrack: Frame-Event Tracking via State Space Model [arxiv] [code]
S2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification [arxiv] [code]
Spectral-Spatial Mamba for Hyperspectral Image Classification [arxiv]
RSCaMa: Remote Sensing Image Change Captioning with State Space Model [arxiv] [code]
Sparse Reconstruction of Optical Doppler Tomography Based on State Space Model [arxiv]
CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions [arxiv] [code]
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model [arxiv]
MambaUIE: Unraveling the Ocean's Secrets with Only 2.8 FLOPs [arxiv] [code]
MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model [arxiv] [code]
CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration [arxiv]
MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye tracking [arxiv]
Text-controlled Motion Mamba: Text-Instructed Temporal Grounding of Human Motion [arxiv]
A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion [arxiv]
Fusion-Mamba for Cross-modality Object Detection [arxiv]
FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining [arxiv]
HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising [arxiv]
MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion [arxiv]
SpectralMamba: Efficient Mamba for Hyperspectral Image Classification [arxiv] [code]
Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos [arxiv]
DGMamba: Domain Generalization via Generalized State Space Model [arxiv] [code]
FusionMamba: Efficient Image Fusion with State Space Model [arxiv]
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection [arxiv] [code]
3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion [arxiv]
RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos [arxiv] [code]
Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation [arxiv] [code]
ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model [arxiv] [code]
InsectMamba: Insect Pest Classification with State Space Model [arxiv]
RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation [arxiv] [code]
RS-Mamba for Large Remote Sensing Image Dense Prediction [arxiv] [code]
Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model [arxiv] [code]
HSIMamba: Hyperpsectral Imaging Efficient Feature Learning with Bidirectional State Space for Classification [arxiv]
SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding [arxiv]
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection [arxiv] [code]
Aggregating Local and Global Features via Selective State Spaces Model for Efficient Image Deblurring [arxiv]
HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM [arxiv]
RSMamba: Remote Sensing Image Classification with State Space Model [arxiv] [code]
Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction [arxiv]
Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion [arxiv]
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition [arxiv] [code]
ReMamber: Referring Image Segmentation with Mamba Twister [arxiv]
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting [arxiv] [code]
SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series [arxiv] [code]
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference [arxiv] [code]
VL-Mamba: Exploring State Space Models for Multimodal Learning [arxiv]
ZigMa: Zigzag Mamba Diffusion Model [arxiv] [code]
VmambaIR: Visual State Space Model for Image Restoration [arxiv] [code]
LocalMamba: Visual State Space Model with Windowed Selective Scan [arxiv] [code]
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models [arxiv]
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding [arxiv] [code]
Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM [arxiv] [code]
Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy [arxiv] [code]
VideoMamba: State Space Model for Efficient Video Understanding [arxiv] [code]
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection [arxiv] [code]
Point Could Mamba: Point Cloud Learning via State Space Model [arxiv] [code]
Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning [arxiv] [code]
MambaIR: A Simple Baseline for Image Restoration with State-Space Model [arxiv] [code]
Pan-Mamba: Effective pan-sharpening with State Space Model [arxiv] [code]
PointMamba: A Simple State Space Model for Point Cloud Analysis [arxiv] [code]
Scalable Diffusion Models with State Space Backbone [arxiv] [code]
Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data [arxiv]
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model [arxiv] [code]
VMamba: Visual State Space Model [arxiv] [code]
U-shaped Vision Mamba for Single Image Dehazing [arxiv] [code]

Medical Imaging

MSV-Mamba: A Multiscale Vision Mamba Network for Echocardiography Segmentation [arxiv]
Merging Context Clustering with Visual State Space Models for Medical Image Segmentation [arxiv] [code]
HCMA-UNet: A Hybrid CNN-Mamba UNet with Inter-Slice Self-Attention for Efficient Breast Cancer Segmentation [arxiv] [code]
S3-Mamba: Small-Size-Sensitive Mamba for Lesion Segmentation [arxiv] [code]
SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation [arxiv] [code]
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification [arxiv] [code]
MambaU-Lite: A Lightweight Model based on Mamba and Integrated Channel-Spatial Attention for Skin Lesion Segmentation [arxiv]
KAN-Mamba FusionNet: Redefining Medical Image Segmentation with Non-Linear Modeling [arxiv]
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation [arxiv] [code]
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation [arxiv] [code]
R2Gen-Mamba: A Selective State Space Model for Radiology Report Generation [arxiv] [code]
Taming Mambas for Voxel Level 3D Medical Image Segmentation [arxiv] [code]
UMambaAdj: Advancing GTV Segmentation for Head and Neck Cancer in MRI-Guided RT with UMamba and nnU-Net ResEnc Planner [arxiv]
MambaEviScrib: Mamba and Evidence-Guided Consistency Make CNN Work Robustly for Scribble-Based Weakly Supervised Ultrasound Image Segmentation [arxiv] [code]
DenoMamba: A fused state-space model for low-dose CT denoising [arxiv]
MambaRecon: MRI Reconstruction with Structured State Space Models [arxiv] [code]
MambaClinix: Hierarchical Gated Convolution and Mamba-Based U-Net for Enhanced 3D Medical Image Segmentation [arxiv] [code]
SPRMamba: Surgical Phase Recognition for Endoscopic Submucosal Dissection with Mamba [arxiv]
SkinMamba: A Precision Skin Lesion Segmentation Architecture with Cross-Scale Global State Modeling and Frequency Boundary Guidance [arxiv] [code]
MedSegMamba: 3D CNN-Mamba Hybrid Architecture for Brain Segmentation [arxiv]
Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images [arxiv] [code]
OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation [arxiv] [code]
Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters [arxiv] [code]
MpoxMamba: A Grouped Mamba-based Lightweight Hybrid Network for Mpox Detection [arxiv]
Serp-Mamba: Advancing High-Resolution Retinal Vessel Segmentation with Selective State-Space Model [arxiv]
Mamba2MIL: State Space Duality Based Multiple Instance Learning for Computational Pathology [arxiv] [code]
MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation [arxiv] [code]
ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation [arxiv]
LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation [arxiv] [code]
MambaMIM: Pre-training Mamba with State Space Token-interpolation [arxiv] [code]
BioMamba: A Pre-trained Biomedical Language Representation Model Leveraging Mamba [arxiv] [code]
Mamba? Catch The Hype Or Rethink What Really Helps for Image Registration [arxiv]
GFE-Mamba: Mamba-based AD Multi-modal Progression Assessment via Generative Feature Extraction from MCI [arxiv] [code]
SliceMamba for Medical Image Segmentation [arxiv]
SR-Mamba: Effective Surgical Phase Recognition with State Space Model [arxiv] [code]
Deform-Mamba Network for MRI Super-Resolution [arxiv]
Vision Mamba for Classification of Breast Ultrasound Images [arxiv]
MMR-Mamba: Multi-Contrast MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion [arxiv]
Soft Masked Mamba Diffusion Model for CT to MRI Conversion [arxiv] [code]
SEDMamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in Robot-Assisted Surgery [arxiv]
Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans [arxiv]
Convolution and Attention-Free Mamba-based Cardiac Image Segmentation [arxiv]
MUCM-Net: A Mamba Powered UCM-Net for Skin Lesion Segmentation [arxiv] [code]
I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling [arxiv]
VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis [arxiv]
HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation [arxiv]
AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation [arxiv] [code]
Vim4Path: Self-Supervised Vision Mamba for Histopathology Images [arxiv] [code]
FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba [arxiv] [code]
ViM-UNet: Vision Mamba for Biomedical Segmentation [arxiv] [code]
VMambaMorph: a Visual Mamba-based Framework with Cross-Scan Module for Deformable 3D Image Registration [arxiv] [code]
T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation [arxiv] [code]
Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation [arxiv]
H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation [arxiv] [code]
ProMamba: Prompt-Mamba for polyp segmentation [arxiv]
VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation [arxiv] [code]
MD-Dose: A diffusion model based on the Mamba for radiation dose prediction [arxiv] [code]
Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention [arxiv] [code]
MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology [arxiv] [code]
LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation [arxiv] [code]
MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models [arxiv]
MedMamba: Vision Mamba for Medical Image Classification [arxiv] [code]
MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation [arxiv] [code]
Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation [arxiv] [code]
P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation [arxiv]
Semi-Mamba-UNet: Pixel-Level Contrastive Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation [arxiv] [code]
FD-Vision Mamba for Endoscopic Exposure Correction [arxiv] [code]
MambaMorph: a Mamba-based Backbone with Contrastive Feature Learning for Deformable MR-CT Registration [arxiv] [code]
Vivim: a Video Vision Mamba for Medical Video Object Segmentation [arxiv] [code]
U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation [arxiv] [code]
Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining [arxiv] [code]
nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model [arxiv] [code]
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation [arxiv] [code]
VM-UNet: Vision Mamba UNet for Medical Image Segmentation [arxiv] [code]
Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation [arxiv] [code]

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome-Vision-Mamba

Survey

Computer Vision

Medical Imaging

About

Releases

Packages

ReaFly/Awesome-Vision-Mamba

Folders and files

Latest commit

History

Repository files navigation

Awesome-Vision-Mamba

Survey

Computer Vision

Medical Imaging

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages