技術摘要(英)
A multimodal method for detecting video includes following step of: receiving a message to be detected so as to obtain a multimodal result by a processor, which the message to be detected is corresponding to a video to be detected; generating a plurality of detecting conditions according to the multimodal result by the processor; searching a plurality of videos in a video detection database so as to obtain at least one target video in the plurality of videos according to the plurality of detecting conditions by the processor, which each of the plurality of videos includes a plurality of video paragraphs respectively, which each of the plurality of video paragraphs includes a piece of corresponding multimodal related data respectively; comparing the plurality of detecting conditions and the piece of corresponding multimodal related data of the plurality of video paragraphs so as to obtain a matching video paragraph and use a video corresponding to the matching video paragraph as the at least one target video by the processor; and outputting the at least one target video and the video to be detected to a display device for display by the processor.