This paper aims to pre-study on finding events embedded in recent video datasets and transforming them into verbs. To this end, we need to look over conventional video datasets for human action and activity and then analyze the events embedded in video datasets. Finally we should also allow for transformation from events to verbs. As an early stage for this purpose, we investigate conventional and recently available visual datasets and analyze activities or actions embedded in those datasets in this paper