"LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection"

Playback speed

Share post at current time

0:00

Transcript

"LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection"

The podcast on this paper is generated with Google's Illuminate.

Rohan Paul

Dec 28, 2024

SAM (Segment Anything Model) meets Mamba for novel multi-focus image processing

Combining SAM and Mamba creates superior salient object detection

https://arxiv.org/abs/2411.06652

Original Problem 🎯:

Light field cameras capture multi-focus images for 3D scene reconstruction, but detecting salient objects across these varying focal depths remains challenging. Current methods struggle with effectively integrating information from multiple focal slices while maintaining computational efficiency.

-----

Solution in this Paper 🛠️:

→ Introduces LFSamba - a two-stream encoder-decoder that combines SAM (Segment Anything Model) with Mamba architecture

→ Uses frozen SAM encoder with fine-tuned adapters to extract features from both focal slices and all-focus images

→ Implements Inter-Slice Mamba to model relationships between focal slices across different depth levels

→ Employs Inter-Modal Mamba to fuse focal slice features with all-focus features

→ Develops scribble annotation dataset for weakly supervised learning

-----

Key Insights 🔍:

→ Mamba architecture effectively models long-range dependencies in focal slices with linear complexity

→ Combining SAM with adapters reduces computation while maintaining feature discrimination

→ Inter-Modal fusion between focal slices and all-focus images enhances mutual feature complementarity

→ Scribble annotations can replace dense pixel-level supervision

-----

Results 📊:

→ Outperforms 15 fully-supervised methods across all metrics

→ 31% reduction in MAE values compared to other weakly supervised methods

→ 14% improvement in MAE using Mamba compared to alternative architectures

→ Maintains high performance while being computationally efficient

Rohan's Bytes

"LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection"

Discussion about this video