Abstract. Summary extract of heterogeneous multimedia documents is an important content and requirement of information management at the Internet era. Though some summary extract approaches have been proposed in recent years, in general they can only be used in a single-type media. More important is that the existed approaches of multimedia summary extract had no satisfactory effects, and can't be brought into practice. Aiming at these problems, an approach based on folksonomy with incentive and quality assurance mechanisms is put forward. The proven folksonomy ensures the practicality of the approach, the incentive and quality assurance mechanisms avoid the innate problems of folksonomy: cold boot and inaccurate tagging. The experiment shows that the approach is reasonable and effective.