|Title||Performance database: capturing data for optimizing distributed streaming workflows|
|Publication Type||Journal Article|
|Year of Publication||2011|
|Authors||Liew, CS, Atkinson, MP, Ostrowski, R, Cole, M, van Hemert, JI, Han, L|
|Journal Title||Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences|
|Keywords||measurement framework; performance data; streaming workflows|
The performance database (PDB) stores performance-related data gathered during workflow enactment. We argue that by carefully understanding and manipulating this data, we can improve efficiency when enacting workflows. This paper describes the rationale behind the PDB, and proposes a systematic way to implement it. The prototype is built as part of the Advanced Data Mining and Integration Research for Europe project. We use workflows from real-world experiments to demonstrate the usage of PDB.