Segment objects in images using text prompts
Analyze images to detect objects, points, keypoints, or text