Dynamic stimuli are ubiquitous in natural viewing conditions implying that grouping operations need to operate, not only in space, but also jointly in space and time. Moreover, in natural viewing, attention plays an important role in controlling how resources are allocated. We investigated how attention interacts with spatio-temporal perceptual grouping by using a bistable stimulus, called the Ternus-Pikler display. Ternus-Pikler displays can give rise to two different motion percepts, called Element Motion (EM) and Group Motion (GM), the former dominating at short Inter-Stimulus Intervals (ISIs) and the latter at long ISIs. Our results indicate that GM grouping requires more attentional resources than EM grouping. Different theoretical accounts of perceptual grouping and attention are discussed and evaluated in the light of the current results. (C) 2011 Elsevier Ltd. All rights reserved.