Question 1

What types of prompts does SAM support?

Accepted Answer

SAM supports foreground/background points, bounding boxes, and masks. Text prompts have been explored but not yet released.

Question 2

What is the structure of the SAM model?

Accepted Answer

The SAM model includes a ViT-H image encoder, a prompt encoder, and a transformer-based mask decoder.

Question 3

Can SAM integrate with other systems?

Accepted Answer

Yes, SAM can take input prompts from other systems such as gaze tracking from AR/VR headsets or bounding box prompts from object detectors.

Question 4

Is SAM capable of zero-shot generalization?

Accepted Answer

Yes, SAM can generalize to unfamiliar objects and images without requiring additional training.

Question 5

What kind of training data was used for SAM?

Accepted Answer

SAM was trained on the SA-1B dataset, which includes over 1.1 billion segmentation masks from approximately 11 million images.

Question 6

How long does it take for SAM to perform inference?

Accepted Answer

The image encoder takes about 0.15 seconds on an NVIDIA A100 GPU, while the prompt encoder and mask decoder take around 50ms on a CPU.

Question 7

Does SAM work on videos?

Accepted Answer

Currently, SAM only works on images and not on videos.

Question 8

How is SAM's model designed for efficiency?

Accepted Answer

SAM is decoupled into a one-time image encoder and a lightweight mask decoder that can run in web browsers within milliseconds per prompt.

Question 9

What platforms support SAM?

Accepted Answer

The image encoder is implemented in PyTorch for GPU use, while the prompt encoder and mask decoder can be executed with PyTorch or ONNX runtime on both CPU and GPU.

Question 10

What is the size of the SAM model?

Accepted Answer

The image encoder has 632 million parameters, and the prompt encoder and mask decoder have 4 million parameters.

Segment Anything By Meta

What is Segment Anything By Meta?

Segment Anything By Meta's Top Features

Use Cases

Graphic Designers

Video Editors

AR/VR Developers

Researchers

3D Modelers

AI Developers

Photographers

Digital Artists

Social Media Managers

Educators

Tags

Top Segment Anything By Meta Alternatives

AI Gallery

ImageBind by Meta

Unfake.png

OnModel.ai

VModel

Spot

GenAI by Meta

Emu Edit

User Reviews

Share your thoughts

Frequently Asked Questions