Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows

Bibliographic Information

Other Title
  • 調理動作後の物体の視覚的状態予測を目指した Visual Recipe Flow データセットの構築と評価

Description

<p>We present a new multimodal dataset called Visual Recipe Flow, which enables us to learn each cooking action result in a recipe text. The dataset consists of object state changes and the workflow of the recipe text. The state change is represented as an image pair, while the workflow is represented as a recipe flow graph (r-FG). We explain the data collection and annotation procedure and evaluate the dataset by measuring the inter-annotator agreement. Finally, we investigate the importance of each annotation component by conducting multi-modal information retrieval experiments. </p>

Journal

References(16)*help

See more

Details 詳細情報について

Report a problem

Back to top