M. Banikazemi, V. Moorthy, and D. K. Panda, Efficient collective communication on heterogeneous networks of workstations, Proceedings of the 27th International Conference on Parallel Processing (ICPP'98), 1998.

O. Beaumont, A. Legrand, L. Marchal, and Y. Robert, Assessing the impact and limits of steady-state scheduling for mixed task and data parallelism on heterogeneous platforms, HeteroPar'2004: International Conference on Heterogeneous Computing, jointly published with ISPDC'2004: International Symposium on Parallel and Distributed Computing, pp.296-302, 2004.
URL : https://hal.archives-ouvertes.fr/hal-00789444

A. Benoit and Y. Robert, Mapping pipeline skeletons onto heterogeneous platforms, J. Parallel Distributed Computing, vol.68, issue.6, pp.790-808, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00118921

M. Beynon, A. Sussman, U. Catalyurek, T. Kurc, and J. Saltz, Performance optimization for data intensive grid applications, PProceedings of the Third Annual International Workshop on Active Middleware Services (AMS'01), 2001.

M. D. Beynon, T. Kurc, A. Sussman, and J. Saltz, Optimizing execution of component-based applications using group instances, Future Generation Computer Systems, vol.18, issue.4, pp.435-448, 2002.

P. Bhat, C. Raghavendra, and V. Prasanna, Efficient collective communication in distributed heterogeneous systems, ICDCS'99 19th International Conference on Distributed Computing Systems, pp.15-24, 1999.

P. Bhat, C. Raghavendra, and V. Prasanna, Efficient collective communication in distributed heterogeneous systems, Journal of Parallel and Distributed Computing, vol.63, pp.251-263, 2003.

C. Consel, H. Hamdi, L. Réveilì-ere, L. Singaravelu, H. Yu et al., Spidle: a DSL approach to specifying streaming applications, Proc. 2nd Int. Conf. on Generative Programming and Component Engineering, pp.1-17, 2003.

, DataCutter Project: Middleware for Filtering Large Archival Scientific Datasets in a Grid Environment

M. R. Garey and D. S. Johnson, Computers and Intractability, a Guide to the Theory of NP-Completeness, 1979.

W. Gropp, E. Lusk, N. Doss, and A. Skjellum, A high-performance, portable implementation of the MPI message passing interface standard, Parallel Computing, vol.22, issue.6, pp.789-828, 1996.

J. Gummaraju, J. Coburn, Y. Turner, and M. Rosenblum, Streamware: programming generalpurpose multicore processors using streams, Proc. 13th Int. Conf. on Architectural Support for Programming Languages and Operating Systems ASPLOS, pp.297-307, 2008.

B. Hong and V. Prasanna, Bandwidth-aware resource allocation for heterogeneous computing systems to maximize throughput, Proceedings of the 32th International Conference on Parallel Processing, 2003.

N. T. Karonis, B. Toonen, and I. Foster, Mpich-g2: A grid-enabled implementation of the message passing interface, J.Parallel and Distributed Computing, vol.63, issue.5, pp.551-563, 2003.

S. Khuller and Y. Kim, On broadcasting in heterogenous networks, Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete algorithms, pp.1011-1020, 2004.

B. Kreaseck, L. Carter, H. Casanova, J. Ferrante, and S. Nandy, Interference-aware scheduling, International Journal of High Performance Computing Applications, vol.20, issue.1, pp.45-59, 2006.

Y. Kwok and I. Ahmad, Static scheduling algorithms for allocating directed task graphs to multiprocessors, ACM Computing Surveys, vol.31, issue.4, pp.406-471, 1999.

P. Liu, Broadcast scheduling optimization for heterogeneous cluster systems, Journal of Algorithms, vol.42, issue.1, pp.135-152, 2002.

R. Newton, L. Girod, M. Craig, S. Madden, and G. Morrisett, Wavescript: A case-study in applying a distributed stream-processing language, 2008.

T. Saif and M. Parashar, Understanding the behavior and performance of non-blocking communications in MPI, Proceedings of Euro-Par 2004: Parallel Processing, vol.3149, pp.173-182, 2004.

M. Spencer, R. Ferreira, M. Beynon, T. Kurc, U. Catalyurek et al., Executing multiple pipelined data analysis operations in the grid, 2002 ACM/IEEE Supercomputing Conference, 2002.

U. Srivastava, K. Munagala, J. Widom, and R. Motwani, Query optimization over web services, VLDB '06: Proceedings of the 32nd Int. Conference on Very Large Data Bases, pp.355-366, 2006.

R. Stephens, A survey of stream processing, Acta Informatica, vol.34, issue.7, pp.491-541, 1997.

J. Subhlok and G. Vondran, Optimal mapping of sequences of data parallel tasks, Proc. 5th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'95, pp.134-143, 1995.

J. Subhlok and G. Vondran, Optimal latency-throughput tradeoffs for data parallel pipelines, ACM Symposium on Parallel Algorithms and Architectures SPAA'96, pp.62-71, 1996.

K. Taura and A. A. Chien, A heuristic algorithm for mapping communicating tasks on heterogeneous resources, Heterogeneous Computing Workshop, pp.102-115, 2000.

W. Thies, M. Karczmarek, and S. Amarasinghe, Streamit: a language for streaming applications, Proceedings of 11th Int. Conf. on Compiler Construction, 2002.

H. Topcuoglu, S. Hariri, and M. Y. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Trans. Parallel Distributed Systems, vol.13, issue.3, pp.260-274, 2002.

N. Vydyanathan, U. Catalyurek, T. Kurc, P. Saddayappan, and J. Saltz, An approach for optimizing latency under throughput constraints for application workflows on clusters, 2007.

N. Vydyanathan, U. Catalyurek, T. Kurc, P. Saddayappan, and J. Saltz, Optimizing latency and throughput of application workflows on clusters, 2007.

T. Yang and A. Gerasoulis, DSC: Scheduling parallel tasks on an unbounded number of processors, IEEE Trans. Parallel and Distributed Systems, vol.5, issue.9, pp.951-967, 1994.