In this paper, we propose “Multimedia Montage”, which is the structural synthesis in time and space of multimedia components, such as movies and sounds, as a new image expression method for communications. In this study, we introduce the counterpoint theory in music to compose movie structures, and we use scripts to describe the structures. The nature of counterpoints lies in the parallelism of autonomous elements and they fit today's multimedia technologies very well. We confirm the effectiveness of our method by making example movies based on the counterpoint theory, developing a prototype system, and conducting movie synthesis experiments using the prototype. Our next goal is to achieve automatic synchronization among multimedia components based on their internal rhythms, which we have observed in experiments.