[1] THIES W, KARCZMAREK M, AMARASINGHE S. StreamIt: A language for streaming applications[C]// Proceedings of the 11th International Conference on Compiler Construction. London, UK: ACM Press, 2002: 179-196. [2] BUCK I, FOLEY T, HOM D, et al. Brook for GPUs: Stream computing on graphics hardware[J]. ACM Transactions on Graphics, 2004, 23(3): 777-786. [3] MARK W R, GLANVILLE R S, AKELEY K, et al. Cg: A system for programming graphics hardware in a C-like language[J]. ACM Transactions on Graphics, 2003, 22(3): 896-907. [4] KUDLUR M, MAHLKE S. Orchestrating the execution of stream programs on multicore platforms[J]. ACM SIGPLAN Notices, 2008, 43(6): 114-124. [5] GORDON M, THIES W, AMARASINGHE S. Exploiting coarse-grained task, data, and pipeline parallelism in stream programs[C]// Proceedings of 14th International Conference on Architectural Support for Programming Languages and Operating Systems. San Jose, USA: ACM Press, 2006: 151-162. [6] ZHURAVLEV S, BLAGODUROV S, FEDOROVA A. Addressing shared resource contention in multicore processors via scheduling[C]// Proceedings of the 15th Edition of ASPLOS on Architectural Support for Programming Languages and Operating Systems. New York, USA: ACM Press, 2010: 129-142. [7] 张维维,魏海涛,于俊清, 等. COstream:一种面向数据流的编程语言和编译器实现[J].计算机学报,2013,36(10): 1993-2006. ZHANG Weiwei, WEI Haitao, YU Junqing, et al. COstream: A language for datafloe application and complier[J]. Chinese Journal of Computers, 2013, 36(10): 1993-2006. [8] THIELE L, BACIVAROV I, HAID W, et al. Mapping applications to tiled multiprocessor embedded systems[C]// Proceedings of the 7th Application of Concurrency to System Design. Bratislava, Slovak: ACM Press, 2007: 29-40. [9] ABOU-RJEILI A, KARYPIS G. Multilevel algorithms for partitioning power-law graphs[C]// Proceedings of 20th International Parallel and Distributed Processing Symposium. Washington USA: IEEE Press, 2006: 1-10. [10] MOLKA D, HACKENBERG D, SCHONE R, et al. Memory performance and cache coherency effects on an Intel Nehalem multiprocessor system[C]// Proceedings of the 18th International Conference on Parallel Architectures and Compilation Techniques. Raleigh, USA: IEEE Press, 2009: 261-270. [11] TORRELLAS J, LAM H S, HENNESSY J L. False sharing and spatial locality in multiprocessor caches[J]. IEEE Transactions on Computers, 1994, 43(6): 651-663. [12] MELLOR-CRUMMEY J, SCOTT M. L Algorithms for scalable synchronization on shared-memory multiprocessors[J]. ACM Transactions on Computer Systems, 1991, 9(1): 21-65.
() () |